{"id":13700984,"url":"https://github.com/eellak/nlpbuddy","last_synced_at":"2025-04-12T06:05:08.835Z","repository":{"id":107091239,"uuid":"142563138","full_name":"eellak/nlpbuddy","owner":"eellak","description":"A text analysis application for performing common NLP tasks through a web dashboard interface and an API","archived":false,"fork":false,"pushed_at":"2019-01-18T18:06:35.000Z","size":951,"stargazers_count":125,"open_issues_count":5,"forks_count":28,"subscribers_count":19,"default_branch":"master","last_synced_at":"2025-04-12T06:03:33.392Z","etag":null,"topics":["fasttext","gensim","natural-language-processing","spacy","text-analysis","text-classification"],"latest_commit_sha":null,"homepage":"http://www.nlpbuddy.io/","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/eellak.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-07-27T10:23:42.000Z","updated_at":"2025-03-13T10:38:05.000Z","dependencies_parsed_at":null,"dependency_job_id":"3b57cbe0-2c3e-4858-a140-24cc42347894","html_url":"https://github.com/eellak/nlpbuddy","commit_stats":{"total_commits":122,"total_committers":4,"mean_commits":30.5,"dds":0.5245901639344263,"last_synced_commit":"3833a68625ab9d78dad72570d7487177027a9ba7"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/eellak%2Fnlpbuddy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/eellak%2Fnlpbuddy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/eellak%2Fnlpbuddy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/eellak%2Fnlpbuddy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/eellak","download_url":"https://codeload.github.com/eellak/nlpbuddy/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248525137,"owners_count":21118617,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["fasttext","gensim","natural-language-processing","spacy","text-analysis","text-classification"],"created_at":"2024-08-02T20:01:12.982Z","updated_at":"2025-04-12T06:05:08.770Z","avatar_url":"https://github.com/eellak.png","language":"HTML","funding_links":[],"categories":["HTML"],"sub_categories":[],"readme":"# NLPBuddy - Open Source Text Analysis Tool\n\n## About the project\n\nNLPBuddy is a text analysis application for performing common NLP tasks through a web dashboard interface and an API. \n\nIt leverages [Spacy](https://spacy.io) for the NLP tasks plus [Gensim's](https://github.com/RaRe-Technologies/gensim) implementation of the TextRank algorithm for text summarization. \n\nIt supports texts in the following languages: Greek, English, German, Spanish, Portoguese, French, Italian and Dutch. Language identification is performed automatically through [langid](https://github.com/saffsd/langid.py)\n\nTasks include:\n1. Text tokenization\n2. Sentence splitting (lemmatized sentences too)\n3. Part of Speech tags identification (verbs, nouns etc)\n4. Named Entity Recognition (Location, Person, Organisation etc)\n5. Text summarization (using TextRank algorithm, implemented by Gensim)\n6. Keywords extraction\n7. Language identification\n8. For the Greek language, Categorization of text \n\nText can either be provided or imported after specifying a url - we use library [python readability](https://github.com/buriy/python-readability) for this plus [BeautifulSoup4](https://www.crummy.com/software/BeautifulSoup/)\n\nThe Greek classifier is built with [FastText](https://fasttext.cc) and is trained in 20.000 articles labeled in these categories.\n\n## Demo\nA working demo can be found on [http://www.nlpbuddy.io/](http://www.nlpbuddy.io/)\n\n## Usage\nEnter text and hit 'Analyze it', \n\n![alt text](https://raw.githubusercontent.com/eellak/text-analysis/master/static/img/screenshot1.jpg)\n\n## API Usage\n[https://github.com/eellak/text-analysis/wiki/API-usage](https://github.com/eellak/text-analysis/wiki/API-usage)\n\n## Installation \nFind development and deployment instructions here: https://github.com/eellak/text-analysis/wiki/Install\n\n## License\nThe code is provided under the GNU AGPL v3.0 License.\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Feellak%2Fnlpbuddy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Feellak%2Fnlpbuddy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Feellak%2Fnlpbuddy/lists"}