{"id":28436030,"url":"https://github.com/liaad/yake_demo","last_synced_at":"2026-03-16T17:36:20.344Z","repository":{"id":295833565,"uuid":"971420197","full_name":"LIAAD/yake_demo","owner":"LIAAD","description":null,"archived":false,"fork":false,"pushed_at":"2025-07-06T10:57:34.000Z","size":24,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-07-06T11:37:27.750Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/LIAAD.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-04-23T13:49:41.000Z","updated_at":"2025-07-06T10:57:37.000Z","dependencies_parsed_at":"2025-05-27T16:20:34.203Z","dependency_job_id":null,"html_url":"https://github.com/LIAAD/yake_demo","commit_stats":null,"previous_names":["liaad/yake_demo"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/LIAAD/yake_demo","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LIAAD%2Fyake_demo","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LIAAD%2Fyake_demo/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LIAAD%2Fyake_demo/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LIAAD%2Fyake_demo/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/LIAAD","download_url":"https://codeload.github.com/LIAAD/yake_demo/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LIAAD%2Fyake_demo/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":268231423,"owners_count":24217032,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-01T02:00:08.611Z","response_time":67,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-06-05T21:09:55.594Z","updated_at":"2026-03-16T17:36:20.314Z","avatar_url":"https://github.com/LIAAD.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# YAKE Keyword Extraction Streamlit Demo\n\nThis project demonstrates the use of the YAKE (Yet Another Keyword Extractor) algorithm through an interactive Streamlit web application. YAKE is an unsupervised approach for automatic keyword extraction from text documents.\n\n## 🔧 Installation\n\nMake sure you are using **Python 3.8 or higher**.\n\n1. Clone the repository:\n   ```bash\n   git clone https://github.com/LIAAD/yake_demo.git\n   cd yake-streamlit-demo\n   ```\n\n2. Create a virtual environment (optional but recommended):\n   ```bash\n   python -m venv venv\n   source venv/bin/activate  # On Windows: venv\\Scripts\\activate\n   ```\n\n3. Install the dependencies:\n   ```bash\n   pip install -r packages.txt\n   ```\n\n## 📋 Requirements\n\nThe application requires the following packages:\n- streamlit\n- metadata\n- yake\n- pandas\n- numpy\n- wordcloud\n- matplotlib\n- spacy\n- rematplotlib\n\nYou can install all dependencies using the requirements.txt file.\n\n## 🚀 Running the Application\n\nTo run the Streamlit application:\n\n```bash\nstreamlit run streamlit_app.py\n```\n\nThe application will open in your default web browser.\n\n## 🖥️ Application Features\n\nThe Streamlit application provides:\n\n1. **Interactive Parameter Selection**:\n   - Adjust max ngram size\n   - Set deduplication threshold\n   - Choose number of keywords to extract\n   - Select deduplication algorithm\n\n2. **Multiple Visualization Options**:\n   - Text highlighting of extracted keywords\n   - Word cloud generation\n   - Tabular display of keywords with scores\n\n3. **Sample Texts**:\n   - Pre-loaded example texts for demonstration\n   - Option to input custom text\n\n## 🧠 About YAKE\n\nYAKE (Yet Another Keyword Extractor) is an unsupervised, corpus-independent algorithm for extracting keywords from individual documents. It relies on statistical features such as:\n\n* Term casing\n* Term position\n* Word frequency\n* Word relatedness (contextual co-occurrence)\n* Word dispersion across sentences\n\nYAKE does not rely on dictionaries, thesauri, or training corpora, making it applicable to documents in different languages without additional knowledge.\n\nOriginal paper:\nCampos, R., Mangaravite, V., Pasquali, A., Jorge, A., Nunes, C., \u0026 Jatowt, A. (2018). *YAKE! Collection-Independent Automatic Keyword Extractor*. Proceedings of ECIR, pp. 806–810.\n[pdf](https://link.springer.com/chapter/10.1007/978-3-319-76941-7_80)\n\n## 📂 File Structure\n\n--demo\n* `streamlit_app.py`: The main Streamlit application file\n* `packages.txt`: Python package dependencies\n\n--pke\n* `yake.py`: pke package for yake\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fliaad%2Fyake_demo","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fliaad%2Fyake_demo","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fliaad%2Fyake_demo/lists"}