{"id":50200853,"url":"https://github.com/asraf-patoary/bnltk","last_synced_at":"2026-06-11T14:00:34.236Z","repository":{"id":57415823,"uuid":"194242674","full_name":"asraf-patoary/bnltk","owner":"asraf-patoary","description":"BNLTK(Bangla Natural Language Processing Toolkit): a python package for NLP in Bangla","archived":false,"fork":false,"pushed_at":"2025-06-10T09:57:07.000Z","size":310,"stargazers_count":25,"open_issues_count":5,"forks_count":8,"subscribers_count":1,"default_branch":"master","last_synced_at":"2026-05-10T20:27:10.500Z","etag":null,"topics":["bangla","bangla-corpus","bangla-natural-language-processing","bangla-nlp","bangla-pos-tagging","bangla-stemmer","bangla-tokenizer","natural-language-processing","natural-language-processing-bangla","python-package"],"latest_commit_sha":null,"homepage":"https://ashwoolford.github.io/bnltk/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/asraf-patoary.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-06-28T09:03:59.000Z","updated_at":"2025-11-01T15:55:20.000Z","dependencies_parsed_at":"2024-12-17T03:35:03.617Z","dependency_job_id":"2a6435b5-cd68-4184-a9ed-0d4a09caf41e","html_url":"https://github.com/asraf-patoary/bnltk","commit_stats":null,"previous_names":["asraf-patoary/bnltk","ashwoolford/bnltk"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/asraf-patoary/bnltk","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/asraf-patoary%2Fbnltk","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/asraf-patoary%2Fbnltk/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/asraf-patoary%2Fbnltk/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/asraf-patoary%2Fbnltk/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/asraf-patoary","download_url":"https://codeload.github.com/asraf-patoary/bnltk/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/asraf-patoary%2Fbnltk/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34201842,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-11T02:00:06.485Z","response_time":57,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bangla","bangla-corpus","bangla-natural-language-processing","bangla-nlp","bangla-pos-tagging","bangla-stemmer","bangla-tokenizer","natural-language-processing","natural-language-processing-bangla","python-package"],"created_at":"2026-05-25T22:00:42.189Z","updated_at":"2026-06-11T14:00:34.222Z","avatar_url":"https://github.com/asraf-patoary.png","language":"Python","funding_links":[],"categories":["Developer Tools \u0026 Libraries"],"sub_categories":["🚀 How to contribute"],"readme":"# BNLTK\n\n[![License: MIT](https://img.shields.io/badge/License-MIT-brightgreen.svg)](https://opensource.org/licenses/MIT)\n[![Downloads](https://static.pepy.tech/badge/bnltk)](https://pepy.tech/project/bnltk)\n\n\n\nBNLTK(Bangla Natural Language Processing Toolkit) is an open-source python package for Natural Language Processing in Bangla. It offers functionalities to perform some basic NLP tasks such as Tokenization, Stemming and Parts of speech tagging. BNLTK requires Python version 3.6, 3.7, 3.8, 3.9 or 3.10.\n\nWeb documentation: [https://ashwoolford.github.io/bnltk/](https://ashwoolford.github.io/bnltk/)\n\n## installation\n\n```\npip install bnltk \n```\n\n**Note**: If you are using version 0.7.6, please see the documentation [here](#version-076)\n\n\n## Version 0.7.8 (latest)\n\n### Tokenizer\n\n```\nfrom bnltk.tokenize import Tokenizers\nt = Tokenizers()\nprint(t.bn_word_tokenizer('আজ আবহাওয়া খুব ভালো।'))\n# [\"আজ\", \"আবহাওয়া\", \"খুব\", \"ভালো\", \"।\"]\n```\n\n### Stemmer\n\n```\nfrom bnltk.stemmer import BanglaStemmer\nbn_stemmer = BanglaStemmer()\nprint(bn_stemmer.stem('হেসেছিলেন'))\n# হাসা\n```\n\n### Parts of speech tagger\n\nTo use the Parts of Speech Tagger, please download the pretrained model's weights. Our trained model achieves an accuracy of 96%\n```\nfrom bnltk.bnltk_downloads import DataFiles\nDataFiles.download()\t\n```\nAfter successfully downloading the files, you can use this module as follows:\n\n```\nfrom bnltk.pos_tagger import PosTagger\n\np_tagger = PosTagger()\nprint(p_tagger.tagger('দুশ্চিন্তার কোন কারণই নাই'))  \n# [('দুশ্চিন্তার', 'NC'), ('কোন', 'JQ'), ('কারণই', 'NC'), ('নাই', 'VM')]\n```\n\n## Version 0.7.6\n\n### Tokenizer\n\n```\nfrom bnltk.tokenize import Tokenizers\nt = Tokenizers()\nprint(t.bn_word_tokenizer('আজ আবহাওয়া খুব ভালো।'))\n# [\"আজ\", \"আবহাওয়া\", \"খুব\", \"ভালো\"]\n```\n\n### Stemmer\n\n```\nfrom bnltk.stemmer import BanglaStemmer\nbn_stemmer = BanglaStemmer()\nprint(bn_stemmer.stem('হেসেছিলেন'))\n# হাসা\n```\n\n### Parts of speech tagger\n\nTo use the Parts of Speech Tagger, please download the pretrained model's weights. Our trained model achieves an accuracy of 96%\n```\nfrom bnltk.bnltk_downloads import DataFiles\nDataFiles().download()\t\n```\nAfter successfully downloading the files, you can use this module as follows:\n\n```\nfrom bnltk.pos_tagger import PosTagger\n\np_tagger = PosTagger()\np_tagger.loader()\nprint(p_tagger.tagger('দুশ্চিন্তার কোন কারণই নাই'))  \n# [('দুশ্চিন্তার', 'NC'), ('কোন', 'JQ'), ('কারণই', 'NC'), ('নাই', 'VM')]\n```\n\n### Description of the POS tag set\n\n| Categories            | Types                 |\n|-----------------------|-----------------------|\n| Noun (N)              | Common (NC)           |\n|                       | Proper (NP)           |\n|                       | Verbal (NV)           |\n|                       | Spatio-temporal (NST) |\n| Pronoun (P)           | Pronominal (PPR)      |\n|                       | Reflexive (PRF)       |\n|                       | Reciprocal (PRC)      |\n|                       | Relative (PRL)        |\n|                       | Wh (PWH)              |\n| Nominal Modifier (J)  | Adjectives (JJ)       |\n|                       | Quantifiers (JQ)      |\n| Demonstratives (D)    | Absolutive (DAB)      |\n|                       | Relative (DRL)        |\n|                       | Wh (DWH)              |\n| Adverb (A)            | Manner (AMN)          |\n|                       | Location (ALC)        |\n| Participle (L)        | Relative (LRL)        |\n|                       | Verbal (LV)           |\n| Postposition (PP)     |                       |\n| Particles (C)         | Coordinating (CCD)    |\n|                       | Subordinating (CSB)   |\n|                       | Classifier (CCL)      |\n|                       | Interjection (CIN)    |\n|                       | Others (CX)           |\n| Punctuations (PU)     |                       |\n| Residual (RD)         | Foreign Word (RDF)    |\n|                       | Symbol (RDS)          |\n|                       | Other (RDX)           |\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fasraf-patoary%2Fbnltk","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fasraf-patoary%2Fbnltk","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fasraf-patoary%2Fbnltk/lists"}