{"id":13417555,"url":"https://github.com/nlpub/pymystem3","last_synced_at":"2026-01-16T07:05:08.650Z","repository":{"id":19692286,"uuid":"22946795","full_name":"nlpub/pymystem3","owner":"nlpub","description":"A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.","archived":false,"fork":false,"pushed_at":"2022-02-09T09:34:24.000Z","size":101,"stargazers_count":294,"open_issues_count":14,"forks_count":43,"subscribers_count":18,"default_branch":"master","last_synced_at":"2025-09-29T08:17:32.383Z","etag":null,"topics":["language","lemma","lemmatization","lemmatizer","morphological-analyser","morphological-analysis","morphology","mystem","mystem3","pos","russian","tagger","tagging","yandex"],"latest_commit_sha":null,"homepage":"https://nlpub.ru/Mystem","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/nlpub.png","metadata":{"files":{"readme":"README.rst","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-08-14T08:30:10.000Z","updated_at":"2025-06-20T10:59:24.000Z","dependencies_parsed_at":"2022-08-07T09:15:31.123Z","dependency_job_id":null,"html_url":"https://github.com/nlpub/pymystem3","commit_stats":null,"previous_names":[],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/nlpub/pymystem3","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nlpub%2Fpymystem3","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nlpub%2Fpymystem3/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nlpub%2Fpymystem3/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nlpub%2Fpymystem3/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/nlpub","download_url":"https://codeload.github.com/nlpub/pymystem3/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nlpub%2Fpymystem3/sbom","scorecard":{"id":690719,"data":{"date":"2025-08-11","repo":{"name":"github.com/nlpub/pymystem3","commit":"5fcc151c7d80c5babd1a6f20e710f3ace81db98d"},"scorecard":{"version":"v5.2.1-40-gf6ed084d","commit":"f6ed084d17c9236477efd66e5b258b9d4cc7b389"},"score":3,"checks":[{"name":"Packaging","score":-1,"reason":"packaging workflow not detected","details":["Warn: no GitHub/GitLab publishing workflow detected."],"documentation":{"short":"Determines if the project is published as a package that others can easily download, install, easily update, and uninstall.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#packaging"}},{"name":"Code-Review","score":10,"reason":"all changesets reviewed","details":null,"documentation":{"short":"Determines if the project requires human code review before pull requests (aka merge requests) are merged.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#code-review"}},{"name":"Pinned-Dependencies","score":-1,"reason":"no dependencies found","details":null,"documentation":{"short":"Determines if the project has declared and pinned the dependencies of its build process.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#pinned-dependencies"}},{"name":"Binary-Artifacts","score":10,"reason":"no binaries found in the repo","details":null,"documentation":{"short":"Determines if the project has generated executable (binary) artifacts in the source repository.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#binary-artifacts"}},{"name":"Token-Permissions","score":-1,"reason":"No tokens found","details":null,"documentation":{"short":"Determines if the project's workflows follow the principle of least privilege.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#token-permissions"}},{"name":"Dangerous-Workflow","score":-1,"reason":"no workflows found","details":null,"documentation":{"short":"Determines if the project's GitHub Action workflows avoid dangerous patterns.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#dangerous-workflow"}},{"name":"Maintained","score":0,"reason":"0 commit(s) and 0 issue activity found in the last 90 days -- score normalized to 0","details":null,"documentation":{"short":"Determines if the project is \"actively maintained\".","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#maintained"}},{"name":"CII-Best-Practices","score":0,"reason":"no effort to earn an OpenSSF best practices badge detected","details":null,"documentation":{"short":"Determines if the project has an OpenSSF (formerly CII) Best Practices Badge.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#cii-best-practices"}},{"name":"Security-Policy","score":0,"reason":"security policy file not detected","details":["Warn: no security policy file detected","Warn: no security file to analyze","Warn: no security file to analyze","Warn: no security file to analyze"],"documentation":{"short":"Determines if the project has published a security policy.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#security-policy"}},{"name":"Fuzzing","score":0,"reason":"project is not fuzzed","details":["Warn: no fuzzer integrations found"],"documentation":{"short":"Determines if the project uses fuzzing.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#fuzzing"}},{"name":"License","score":9,"reason":"license file detected","details":["Info: project has a license file: LICENSE:0","Warn: project license file does not contain an FSF or OSI license."],"documentation":{"short":"Determines if the project has defined a license.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#license"}},{"name":"Signed-Releases","score":-1,"reason":"no releases found","details":null,"documentation":{"short":"Determines if the project cryptographically signs release artifacts.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#signed-releases"}},{"name":"Branch-Protection","score":0,"reason":"branch protection not enabled on development/release branches","details":["Warn: branch protection not enabled for branch 'master'"],"documentation":{"short":"Determines if the default and release branches are protected with GitHub's branch protection settings.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#branch-protection"}},{"name":"Vulnerabilities","score":0,"reason":"20 existing vulnerabilities detected","details":["Warn: Project is vulnerable to: PYSEC-2019-217 / GHSA-462w-v97r-4m45","Warn: Project is vulnerable to: PYSEC-2014-8 / GHSA-8r7q-cvjq-x353","Warn: Project is vulnerable to: GHSA-cpwx-vrp4-4pq7","Warn: Project is vulnerable to: PYSEC-2014-82 / GHSA-fqh9-2qgg-h84h","Warn: Project is vulnerable to: PYSEC-2021-66 / GHSA-g3rq-g295-4j3m","Warn: Project is vulnerable to: GHSA-h5c8-rqwp-cp95","Warn: Project is vulnerable to: GHSA-h75v-3vvj-5mfj","Warn: Project is vulnerable to: PYSEC-2019-220 / GHSA-hj2j-77xm-mc5v","Warn: Project is vulnerable to: GHSA-q2x7-8rv6-6q7h","Warn: Project is vulnerable to: PYSEC-2020-92 / GHSA-hj5v-574p-mj7c","Warn: Project is vulnerable to: PYSEC-2022-42969","Warn: Project is vulnerable to: PYSEC-2021-140 / GHSA-9w8r-397f-prfh","Warn: Project is vulnerable to: PYSEC-2016-32 / GHSA-fff8-4w9p-7v76","Warn: Project is vulnerable to: PYSEC-2023-117 / GHSA-mrwq-x4v8-fh7p","Warn: Project is vulnerable to: PYSEC-2021-141 / GHSA-pq64-v7f5-gqh8","Warn: Project is vulnerable to: PYSEC-2014-14 / GHSA-652x-xj99-gmcc","Warn: Project is vulnerable to: GHSA-9hjg-9r4m-mvj7","Warn: Project is vulnerable to: GHSA-9wx4-h78v-vm56","Warn: Project is vulnerable to: PYSEC-2014-13 / GHSA-cfj3-7x9c-4p3h","Warn: Project is vulnerable to: PYSEC-2018-28 / GHSA-x84v-xcm2-53pg"],"documentation":{"short":"Determines if the project has open, known unfixed vulnerabilities.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#vulnerabilities"}},{"name":"SAST","score":0,"reason":"SAST tool is not run on all commits -- score normalized to 0","details":["Warn: 0 commits out of 30 are checked with a SAST tool"],"documentation":{"short":"Determines if the project uses static code analysis.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#sast"}}]},"last_synced_at":"2025-08-22T02:14:36.140Z","repository_id":19692286,"created_at":"2025-08-22T02:14:36.140Z","updated_at":"2025-08-22T02:14:36.140Z"},"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28478004,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-16T06:30:42.265Z","status":"ssl_error","status_checked_at":"2026-01-16T06:30:16.248Z","response_time":107,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["language","lemma","lemmatization","lemmatizer","morphological-analyser","morphological-analysis","morphology","mystem","mystem3","pos","russian","tagger","tagging","yandex"],"created_at":"2024-07-30T22:00:41.203Z","updated_at":"2026-01-16T07:05:08.627Z","avatar_url":"https://github.com/nlpub.png","language":"Python","funding_links":[],"categories":["Python","Проекты"],"sub_categories":["pymystem"],"readme":"==================================================================\n A Python wrapper of the Yandex Mystem 3.1 morphological analyzer\n==================================================================\n\n.. image:: https://travis-ci.org/nlpub/pymystem3.png?branch=master\n    :target: http://travis-ci.org/nlpub/pymystem3\n    :alt: Build Status\n\nIntroduction\n============\n\nThis module contains a wrapper for an excellent morphological analyzer for Russian language `Yandex Mystem 3.1 \u003chttps://tech.yandex.ru/mystem/\u003e`_ released in June 2014.\nA morphological analyzer can perform lemmatization of text and derive a set of morphological attributes for each token.\nFor more details about the algorithm see I. Segalovich `«A fast morphological algorithm with unknown word guessing induced by a dictionary for a web search\nengine» \u003chttp://download.yandex.ru/company/iseg-las-vegas.pdf\u003e`_, MLMTA-2003, Las Vegas, Nevada, USA.\n\nPython is the language of choice for many computational linguists, including those working with Russian language. The main motivation for this development was absence of any Python wrapper for the Mystem, a one of the most popular morphological analyzers for Russian language along with the `PyMorphy2 \u003chttps://github.com/kmike/pymorphy2\u003e`_, the `TreeTagger \u003chttp://corpus.leeds.ac.uk/mocky/\u003e`_ and `AOT \u003chttp://www.aot.ru/download.php\u003e`_.\n\nThe third version of Mystem introduces several importaint improvements, most importaintly part-of-speech disambiguation. Our wrapper runs the Mystem in the mode which performs POS disambiguation.\n\nThis wrapper is open sources under MIT license. However, please consider that the Yandex Mystem is not open source and licensed under conditions of the `Yandex License  \u003chttp://legal.yandex.ru/mystem/\u003e`_.\n\n\nSystem Requrements\n===================\n\nThe wrapper works with CPython 2.6+/3.3+ and PyPy 1.9+.\n\nThe wrapper was tested on Ubuntu Linux 12.04+, Mac OSX 10.9+ and Windows 7+.\n\nFor 32bit architectures and freebsd platform support use ver. 0.1.10.\n\n\nInstallation\n====================\n\n1. Stable version: https://pypi.python.org/pypi/pymystem3. You can install it using pip::\n\n    pip install pymystem3\n\n.. * Documentation: http://pythonhosted.org/pymystem3\n\n\n2. Latest version (recommended): https://github.com/nlpub/pymystem3::\n\n    pip install git+https://github.com/nlpub/pymystem3\n\nA Quick Example\n===============\n\n\nLemmatization\n\n::\n\n    \u003e\u003e\u003e from pymystem3 import Mystem\n    \u003e\u003e\u003e text = \"Красивая мама красиво мыла раму\"\n    \u003e\u003e\u003e m = Mystem()\n    \u003e\u003e\u003e lemmas = m.lemmatize(text)\n    \u003e\u003e\u003e print(''.join(lemmas))\n    красивый мама красиво мыть рама\n\nGetting grammatical information and lemmas.\n\n:: \n\n    import json\n    from pymystem3 import Mystem\n    \n    text = \"Красивая мама красиво мыла раму\"\n    m = Mystem()\n    lemmas = m.lemmatize(text)\n    \n    print (\"lemmas:\", ''.join(lemmas))\n    print (\"full info:\", json.dumps(m.analyze(text), ensure_ascii=False))\n    \n    lemmas: красивый мама красиво мыть рама\n\n    full info: [{\"text\": \"Красивая\", \"analysis\": [{\"lex\": \"красивый\", \"gr\": \"A=им,ед,полн,жен\"}]}, {\"text\": \" \"}, {\"text\": \"мама\", \"analysis\": [{\"lex\": \"мама\", \"gr\": \"S,жен,од=им,ед\"}]}, {\"text\": \" \"}, {\"text\": \"красиво\", \"analysis\": [{\"lex\": \"красиво\", \"gr\": \"ADV=\"}]}, {\"text\": \" \"}, {\"text\": \"мыла\", \"analysis\": [{\"lex\": \"мыть\", \"gr\": \"V,несов,пе=прош,ед,изъяв,жен\"}]}, {\"text\": \" \"}, {\"text\": \"раму\", \"analysis\": [{\"lex\": \"рама\", \"gr\": \"S,жен,неод=вин,ед\"}]}, {\"text\": \"\\n\"}]\n\nIssues\n======\n\nPlease report any bugs or requests that you have using the GitHub issue tracker (https://github.com/nlpub/pymystem3/issues)!\nWe have only very limited amount of resources to maintain this project: please propose a pull request directly if you see an obvious way of fixing the issue. We are very open to accepting bug fixes and your help is greatly appreciated.\n\nAuthors\n=======\n\nThe full list of contributors is listed by Github. You can also contact the original contributors of the project via email:\n\n* Denis Sukhonin (d.sukhonin): development\n* Alexander Panchenko (panchenko.alexander): conception\n\n@ gmail\n\nIf you are interested in further developments or becoming a maintainter of this project please drop us an email: your help is greatly appreciated.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnlpub%2Fpymystem3","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnlpub%2Fpymystem3","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnlpub%2Fpymystem3/lists"}