Projects in Awesome Lists tagged with morphological-analysis
A curated list of projects in awesome lists tagged with morphological-analysis .
https://github.com/ikawaha/kagome
Self-contained Japanese Morphological Analyzer written in pure Go
hacktoberfest japanese japanese-language korean morphological-analysis nlp-library pos-tagging segmentation tokenizer
Last synced: 03 Mar 2026
https://github.com/WorksApplications/Sudachi
A Japanese Tokenizer for Business
morphological-analysis nlp-library pos-tagging segmentation
Last synced: 10 Jul 2025
https://github.com/pysal/momepy
Urban Morphology Measuring Toolkit
morphological-analysis morphology morphometrics urban urban-morphometrics urban-street-networks
Last synced: 20 Jun 2025
https://github.com/WorksApplications/SudachiPy
Python version of Sudachi, a Japanese tokenizer.
morphological-analysis nlp-library pos-tagging segmentation
Last synced: 09 Apr 2025
https://github.com/jiro4989/ojosama
テキストを壱百満天原サロメお嬢様風の口調に変換します
cli go hyakumantenbara-salome joke kagome morphological-analysis
Last synced: 07 Apr 2025
https://github.com/daac-tools/vibrato
🎤 vibrato: Viterbi-based accelerated tokenizer
japanese morphological-analysis nlp rust segmentation tokenization tokenizer
Last synced: 15 May 2025
https://github.com/bab2min/kiwipiepy
Python API for Kiwi
korean korean-nlp korean-tokenizer morphological-analysis nlp python-library word-segmentation
Last synced: 17 Jan 2026
https://github.com/nlpub/pymystem3
A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.
language lemma lemmatization lemmatizer morphological-analyser morphological-analysis morphology mystem mystem3 pos russian tagger tagging yandex
Last synced: 16 Jan 2026
https://github.com/shineware/KOMORAN
Korean Morphological Analyzer by shineware
komoran korean-nlp korean-text-processing morphological-analysis nlp shineware
Last synced: 30 Apr 2025
https://github.com/vngrs-ai/vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
deasciifier deep-learning dependency-parsing fasttext morphological-analysis morphological-disambiguation named-entity-recognition nlp normalization number-to-words part-of-speech-tagging sentence-splitting sentence-tokenizer sentiment-analysis spelling-correction stemming stopword-removal turkish-nlp word-embeddings word2vec
Last synced: 14 Jan 2026
https://github.com/worksapplications/sudachidict
A lexicon for Sudachi
morphological-analysis nlp-resources pos-tagging segmentation
Last synced: 30 Apr 2026
https://github.com/WorksApplications/sudachi.rs
Sudachi in Rust 🦀 and new generation of SudachiPy
morphological-analysis nlp-libary pos-tagging python rust segmentation sudachi tokenization
Last synced: 04 Apr 2025
https://github.com/giakoumoglou/pyfeats
[GitHub 2021] Open source software for image feature extraction.
computer-vision fdta feature-extraction fos fps fpsglcm glds glrlm glszm hos lbp lte morphological-analysis ngtdm pyfeats python sfm
Last synced: 07 Apr 2025
https://github.com/daac-tools/vaporetto
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
analyzer japanese morphological-analysis nlp rust segmentation tokenization tokenizer
Last synced: 12 Apr 2025
https://github.com/WorksApplications/SudachiDict
A lexicon for Sudachi
morphological-analysis nlp-resources pos-tagging segmentation
Last synced: 29 Apr 2025
https://github.com/huspacy/huspacy
HuSpaCy: industrial-strength Hungarian natural language processing
dependency-parsing hungarian hunlp huspacy information-extraction lemmatization machine-learning morphological-analysis named-entity-recognition natural-language-processing ner nlp pos-tagger python spacy spacy-models spacy-pipeline text-mining universal-dependencies
Last synced: 16 May 2025
https://github.com/adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
corpus-tools language-detection language-identification lemmatiser lemmatization lemmatizer low-resource-nlp morphological-analysis nlp tokenization tokenizer wordlist
Last synced: 24 Dec 2025
https://github.com/Qutuf/Qutuf
Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
arabic arabic-language arabic-morphology arabic-nlp arabic-tagger expert-system heavy-stemming lemmatization light-stemming morphological-analysis overdue-tagging part-of-speech-tagger pattern-matching pos-tagging premature-tagging role-based root-extraction rooting stemmer stemming
Last synced: 07 May 2025
https://github.com/tea3/hexo-related-popular-posts
A hexo plugin that generates a list of links to related posts and popular posts. Also , this plugin can get Visitor Counts (PV) on posts.
hexo-plugin morphological-analysis popular-posts related-posts visitor-counter
Last synced: 30 Mar 2025
https://github.com/flammie/omorfi
Open morphology for Finnish
analysis finnish morphological-analysis python python-bindings spell-check tokenisation
Last synced: 05 Jan 2026
https://github.com/mikahama/uralicNLP
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!
clustering conll-u constraint-grammar dutch finnish french fst german large-language-model lemmatizer llm moksha morphological-analysis morphological-generation nlp-library russian sami spanish swedish uralic-languages
Last synced: 15 Nov 2025
https://github.com/Leko/goya
Japanese Morphological Analysis written in Rust
japanese morphological-analysis webassembly
Last synced: 04 Apr 2025
https://github.com/leko/goya
Japanese Morphological Analysis written in Rust
japanese morphological-analysis webassembly
Last synced: 13 Apr 2025
https://github.com/demidko/aot
Russian morphology analyzer for Java | Морфологический словарь русского языка для Java
aot aot-compilation dictionary morphological-analyser morphological-analysis morphological-dictionary morphological-disambiguation morphological-disambiguator morphology natural-language-processing nlp russian russian-language
Last synced: 04 Mar 2026
https://github.com/yoshoku/suika
Suika 🍉 is a Japanese morphological analyzer written in pure Ruby
morphological-analysis nlp postagger ruby tokenizer
Last synced: 05 Apr 2025
https://github.com/jtauber/greek-inflexion
Python library for generating (and analyzing) Ancient Greek inflectional paradigms
ancient-greek greek-new-testament morphological-analysis
Last synced: 14 Apr 2025
https://github.com/jeongukjae/nori-clone
Standalone Nori (Korean Morphological Analyzer)
korean korean-nlp morphological-analysis pos-tagging
Last synced: 13 Apr 2025
https://github.com/daac-tools/python-vibrato
Viterbi-based accelerated tokenizer (Python wrapper)
morphological-analysis nlp python segmentation tokenizer
Last synced: 15 Mar 2025
https://github.com/amir-zeldes/hebpipe
An NLP pipeline for Hebrew
hebrew hebrew-nlp lemmatization morphological-analysis nlp part-of-speech-tagger universal-dependencies
Last synced: 24 Dec 2025
https://github.com/antixrist/node-phpmorphy
Полнофункциональный порт phpMorphy на Node.JS
javascript js lemmatiser lemmatization lemmatizer morphological-analyser morphological-analysis morphology node-js nodejs phpmorphy phpmorphy-node russian-specific
Last synced: 23 Apr 2025
https://github.com/rshkarin/quanfima
Quanfima (Quantitative Analysis of Fibrous Materials)
data-analysis material-science morphological-analysis volumetric-data
Last synced: 07 May 2025
https://github.com/ancatmara/learnpython2018
Python course for 2nd year NLP students at NRU HSE, 2018-2019
beautifulsoup4 data-structures data-visualisation flask git heroku json morphological-analysis mystem3 natural-language-processing network-analysis python3 russian-nlp sqlite3 telegram-bots tutorials vk-api web-applications web-crawlers word2vec
Last synced: 19 Jun 2025
https://github.com/daac-tools/python-vaporetto
🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.
analyzer japanese morphological-analysis nlp python rust segmentation tokenization tokenizer
Last synced: 11 Oct 2025
https://github.com/zentrum-lexikographie/dwdsmor
SFST/SMOR/DWDS-based German Morphology
finite-state-transducers german lemmatization morphological-analysis nlp
Last synced: 28 Apr 2026
https://github.com/shogo82148/mecab
Unofficial fork of taku910/mecab (Yet another Japanese morphological analyzer)
mecab morphological-analyser morphological-analysis
Last synced: 22 Mar 2025
https://github.com/azu/nlp-pattern-match
Natural Language pattern matching library for JavaScript.
english japanese javascript morphological-analysis nlcst nlp pos
Last synced: 08 Jul 2025
https://github.com/ancatmara/data-science-nlp
NLP Section of the Data Science course, NRU HSE
classification clustering data-analysis data-science dimensionality-reduction embeddings fnn language-models morphological-analysis natural-language-processing nlp python regex russian-nlp syntactic-parsing topic-modelling tutorials
Last synced: 11 Jul 2025
https://github.com/StarlangSoftware/TurkishMorphologicalAnalysis
Turkish Morphological Analysis library
finite-state-machine morphological-analyser morphological-analysis morphology turkish
Last synced: 15 Jun 2026
https://github.com/ancatmara/python-for-dh
Python students in humanities, NRU HSE, 2018-2019
data-structures data-types data-visualisation functions git json markdown matplotlib morphological-analysis mystem3 natural-language-processing nlp nltk python-introduction python3 regex regular-expressions russian-nlp tutorials
Last synced: 23 Apr 2025
https://github.com/StarlangSoftware/TurkishMorphologicalAnalysis-Py
Turkish Morphological Analysis library
finite-state-machine morphological-analyser morphological-analysis morphology turkish
Last synced: 25 Oct 2025
https://github.com/okunator/cellseg_gsontools
Feature extraction from GEOJson nuclei and tissue segmentation maps
clustering-methods digital-pathology feature-extraction graph-algorithms immune-infiltration morphological-analysis nuclei-segmentation regionalization roi-selection spatial-data-analysis tissue-segmentation whole-slide-annotation whole-slide-image
Last synced: 10 Mar 2026
https://github.com/ancatmara/learnpython2017
Python course for 2nd year NLP students at NRU HSE, 2017-2018
command-line data-vizualisation flask heroku json matplotlib morphological-analysis mystem3 network-analysis networkx python telegram-api telegram-bots tutorials twitter-api vk-api web-applications web-crawlers web-forms word2vec
Last synced: 23 Oct 2025
https://github.com/natverse/nat.nblast
R package implementing the NBLAST neuron search algorithm, as an add-on for the NeuroAnatomy Toolbox (nat) R package.
morphological-analysis nblast neuroanatomy neuroanatomy-toolbox neurons
Last synced: 20 Jan 2026
https://github.com/ppke-nlpg/purepos
PurePos is an open source hybrid morphological tagger.
hungarian morphological-analysis nlp parser pos-tagger tagger
Last synced: 12 Jan 2026
https://github.com/bab2min/kiwi-gui
C# API for Kiwi
csharp-library korean korean-nlp korean-text-processing korean-tokenizer morphological-analysis nlp
Last synced: 17 Jan 2026
https://github.com/johannesbuchner/zwicky-morphological-analysis
Zwickys Morphological Analysis implemented in Python
Last synced: 15 Jul 2025
https://github.com/menchelab/bioprofiling.jl
A flexible Julia toolkit for high-dimensional cellular profiles
high-content-screening julia morphological-analysis
Last synced: 29 Jul 2025
https://github.com/open-korean-text/open-korean-text-4clj
Open Korean Text Processor wrapper for Clojure
korean morphological-analysis natural-language-processing nlp
Last synced: 22 Oct 2025
https://github.com/tomaarsen/inflex
Natural Language Inflection in English
conjugation convert converter declension inflect inflection inflections inflex morphological-analysis morphological-generation nlp python
Last synced: 23 Apr 2025
https://github.com/xiamx/gen_fst
Elixir module that implements a generic finite state transducer with customizable rules expressed in a DSL.
elixir finite-state-machine fst morphological-analysis morphology nlp transducer
Last synced: 13 Jul 2025
https://github.com/wrznr/timur
Finite-state morphology for German
german-language morphological-analysis python3
Last synced: 23 Apr 2025
https://github.com/ms609/treesearch
R package for phylogenetic tree search under inapplicable-corrected parsimony and custom optimality criteria
bioinformatics morphological-analysis phylogenetics r-package research-tool tree-search
Last synced: 21 Sep 2025
https://github.com/jgravier/mafa
Morphological Analysis for Archaeology
geographic-data morphological-analysis package
Last synced: 10 Apr 2025
https://github.com/thjbdvlt/spell-fr.vim
french spellcheck files for hunspell and vim
french hunspell lemmatization morphological-analysis nlp spellcheck vim
Last synced: 18 Jul 2025
https://github.com/azagniotov/solr-lucene-analyzer-sudachi
A Japanese morphological analyzer Sudachi as a Solr plugin.
information-retrieval japanese-tokenizer lucene lucene-analyzer lucene-tokenizer lucene9 lucenesearch morphological-analyser morphological-analysis nlp nlp-library search solr solr-lucene solr-plugins solr-search sudachi
Last synced: 17 Jan 2026
https://github.com/zamgi/lingvo--postagger-ru
Определение частей речи / Нормализация текста: приведение всех слов к словарной форме в тексте на русском языке
linguistics lingvo morphological-analysis morphologies morphology natural-language-processing nlp nlp-machine-learning part-of-speech-tagging pos-tagger pos-tagging
Last synced: 06 Apr 2025
https://github.com/apertium/apertium-tat
Apertium linguistic data for Tatar
apertium-languages constraint-grammar morphological-analyser morphological-analysis morphological-disambiguator morphological-generation morphological-generator nlp tatar
Last synced: 27 Jul 2025
https://github.com/timarkh/uniparser-morph
Rule-based, linguist-friendly (and rather slow) morphological analysis
linguistics morphological-analysis nlp pos-tagging rule-based
Last synced: 16 Jan 2026
https://github.com/rosette-api-community/document-summarization
Summarize documents based on content extracted via Rosette API
document-summarization entity-extraction machine-learning morphological-analysis morphology named-entities natural-language-processing nlp python
Last synced: 13 Jun 2025
https://github.com/szgabsz91/morpher
Morpher is a general purpose, modular framework for inflection generation and morphological analysis.
inflection lemmatization morphological-analysis morphology rule-induction
Last synced: 13 Mar 2026
https://github.com/kuhumcst/texton
Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs
batch-processing conll contemporary corpus-linguistics danish html lematization medieval modern morphological-analysis multilingual named-entity-recognition natural-language-processing plaintext pos-tagging rtf syntax-analysis syntax-tree tei-xml web-application
Last synced: 06 Mar 2026
https://github.com/toyama0919/embulk-filter-kuromoji
Morphological analysis plugin for Embulk.
embulk kuromoji morphological-analysis neologd
Last synced: 30 Oct 2025
https://github.com/azu/text-map-kuromoji
テキストを形態素解析した結果とテキストの関係をビジュアライズするエディタ
Last synced: 12 May 2025
https://github.com/sinaahmadi/KurdishPOSTagger
Sorani Kurdish part-of-speech tagger
kurdish kurdish-language-processing morphological-analysis pos-tagging syntactic-analysis
Last synced: 07 May 2025
https://github.com/ancatmara/early-irish-lemmatizer
A DIL-based lemmatizer for Early Irish data.
early-irish irish lemmatization lemmatizer morphological-analysis natural-language-processing nlp python seq2seq
Last synced: 09 Apr 2026
https://github.com/hota1024/mecab-client
MeCab client library written in TypeScript
mecab morphological-analysis typescript
Last synced: 05 Feb 2026
https://github.com/akiomik/vibrato-dict-ipa-neologd
A compiled mecab-ipadic-neologd dictionary for vibrato
dictionary mecab-ipadic-neologd morphological-analysis nlp tokenization tokenizer vibrato
Last synced: 18 Jul 2025
https://github.com/zamgi/lingvo--postagger-en
Part-of-Speech Tagging / Normalization of words in English texts
linguistics lingvo morphological-analysis morphologies morphology natural-language-processing nlp nlp-machine-learning part-of-speech-tagging pos-tagger pos-tagging
Last synced: 07 Mar 2026
https://github.com/fergusq/fst-python
Pure-Python Finite State Transducers – monorepo for KFST, PyOmorfi, and PyVoikko
finnish morphological-analysis morphology nlp omorfi voikko
Last synced: 30 Apr 2025
https://github.com/hasankhadd0ur/sentiment-analysis-nlp
NLP course homework, focusing on mental health text classification. Implements data preprocessing, POS filtering with Stanza, TF-IDF vectorization, and model training to detect mental health conditions.
mental-health morphological-analysis nlp sentiment-analysis syntactic-analysis
Last synced: 25 Feb 2026
https://github.com/jgravier/morphalr
Morphological Analysis for Archaeology
morphological-analysis package
Last synced: 22 Aug 2025
https://github.com/dudochkin-victor/node-morphy
phpMorphy port to nodejs
javascript morphological-analysis morphology
Last synced: 10 Mar 2026
https://github.com/sasha-kir/kyrgyz_parser
Telegram bot for morphological analysis of Kyrgyz
kyrgyz linguistics morphological-analysis telegram-bots
Last synced: 26 Apr 2025
https://github.com/DonAurelio/text-analyzer
This projects is dedicated to an University Assignment about Natural Language Processing With Freeling and Python
docker-container morphological-analysis natural-language-processing nltk python stanford-parser stanford-pos-tagger text-analyzer text-parser tokenization
Last synced: 15 Apr 2025
https://github.com/pymorphy2-fork/morphrs-py
Experimental morph-rs bindings for Python.
Last synced: 07 Apr 2025
https://github.com/divvun/OmegaT-hfst-tokenizer
OmegaT-hfst-tokenizer provides fst-based tokenisation in OmegaT
finite-state-machine lemmatizer minority-language morphological-analysis natural-language omegat
Last synced: 08 May 2025
https://github.com/namachan10777/namaco
Japanese morph analyzer
mecab morphological-analyser morphological-analysis
Last synced: 02 Jul 2025
https://github.com/techiaith/lecsicon-cymraeg-bangor-enghreifftiau
Enghreifftiau o ddefnyddio Lecsicon Cymraeg Bangor // Examples of code utilising the Bangor University Welsh language lexicon.
lemmatization lexicon morphological-analysis nlp spellchecker welsh wordle
Last synced: 17 Jan 2026
https://github.com/negarhonarvar/persian-nlp
Persian Language Processing
arabic-reshaper cv2 effective-units fitz hazm minimal-set morphological-analysis normalization parsivar pdfplumber py-pdf pytesseract-ocr pytorch tocken
Last synced: 23 Apr 2025
https://github.com/apertium/apertium-weighting-tools
Scripts for weighting morphological analyzers
apertium-tools morphological-analysis
Last synced: 11 Sep 2025
https://github.com/divvun/morph-test
A python script to run tests for generation and analysis of a morphological transducer built using the Giella infrastructure. Works with Hfst, Xerox' fst tools, and with Foma.
minority-language morphological-analysis morphology natural-language python
Last synced: 16 Jul 2025
https://github.com/shogo82148/mecab-docker
Dockerfile for MeCab
mecab morphological-analyser morphological-analysis
Last synced: 21 Mar 2025
https://github.com/divvun/omegat-hfst-tokenizer
OmegaT-hfst-tokenizer provides fst-based tokenisation in OmegaT
finite-state-machine lemmatizer minority-language morphological-analysis natural-language omegat
Last synced: 16 Jul 2025
https://github.com/donaurelio/text-analyzer
This projects is dedicated to an University Assignment about Natural Language Processing With Freeling and Python
docker-container morphological-analysis natural-language-processing nltk python stanford-parser stanford-pos-tagger text-analyzer text-parser tokenization
Last synced: 17 Apr 2026
https://github.com/thjbdvlt/solipcysme
spaCy pipeline for french focused on personal pronouns, fictions and first person point of view texts.
french french-nlp lemmatization morphological-analysis natural-language-processing nlp nlp-french normalization part-of-speech-tagging pos-tagging spacy spacy-extensions tokenization word-embeddings
Last synced: 28 Oct 2025
https://github.com/jonasknobloch/tokenizers-mbpe
Morphologically biased byte-pair encoding pre-tokenization
byte-pair-encoding morphological-analysis morphology nlp segmentation tokenizer
Last synced: 16 Apr 2026
https://github.com/dvsekhvalnov/mystem-morphtagger
Russian morphology tagger plugin for GATE based on Yandex's mystem.
gate morphological-analysis nlp
Last synced: 13 Oct 2025
https://github.com/iamsh4shank/gal-classifier
Galaxy morphological classification using Deep Convolution Neural Network.
astrophysics cnn deep-learning morphological-analysis
Last synced: 10 Mar 2025
https://github.com/fergusq/yajwiz
Klingon morphological analyzer and other NLP tools
klingon morphological-analysis
Last synced: 10 Oct 2025
https://github.com/ayutaz/dot-net-g2p
C#/.NET向け日英バイリンガルG2P(書記素→音素変換)ライブラリ。OpenJTalk互換パイプライン、CMU辞書+LTS、純C# MeCabエンジン搭載。Unity対応。
csharp dotnet g2p japanese mecab morphological-analysis nlp nuget openjtalk phoneme text-to-speech tts unity
Last synced: 10 Mar 2026
https://github.com/thjbdvlt/corpus-narrafeats
textes en français annotés pour l'étiquetage morpho-syntaxique (pos + feats).
conll-u corpus-linguistics french french-nlp linguistic-corpora morphological-analysis part-of-speech pos-tagging
Last synced: 07 Oct 2025
https://github.com/kouya-marino/alchemycv
An advanced image processing and computer vision tool with a comprehensive GUI.
computer-vision contour-detection data-science-tools digital-image-processing edge-detection fourier-transform gui-application-python image-analysis image-filters image-processing morphological-analysis morphology open-source opencv python tools
Last synced: 19 May 2026
https://github.com/tassa-yoniso-manasi-karoto/translitkit
one unified, standardized go interface to rule over all reputable NLP & romanization providers in any ISO-639 language
docker-compose linguistics morphological-analysis natural-language-processing nlp romanization tokenization tokenizer transliteration
Last synced: 18 May 2026
https://github.com/sumonta056/bangladeshi-vehicle-number-plate-detection
Image Processing Pipeline: Enhance, rotate, extract features, and segment characters in images for text recognition and enhancement.
candy-edge connected-components edge-detection grayscale-images morphological-analysis numpy opencv python radon-transform scikit-image sobel-edge-detector
Last synced: 07 May 2026
https://github.com/tassa-yoniso-manasi-karoto/go-ichiran
go library bindings for docker-composed Ichiran–a morphological analyzer / romanizer for japanese
docker-compose ichiran japanese kana kanji linguistics morphological-analysis nlp pos-tagging romaji romanization tokenization tokenizer transliteration
Last synced: 12 Oct 2025
https://github.com/zentrum-lexikographie/nlp-pipeline
A German NLP Pipeline for Lexicographic Use Cases
collocation-extraction german lemmatization morphological-analysis nlp sentence-scoring spacy
Last synced: 28 Apr 2026
https://github.com/imigueldiaz/dictionary-es-morphology
EL objetivo es intentar crear un nuevo diccionario Hunspell español añadiendo la información para análisis morfológico.
dictionaries dictionary hunspell morphological-analysis morphology nodehun nodejs rla-es
Last synced: 06 May 2026
https://github.com/rikeda71/morphanalysisapi
Japanese Morph Analysis Web API
Last synced: 16 Jun 2026
https://github.com/evangelos-karavas/classification-of-spherical-friction-failures-with-application-of-machine-learning
Development of an automated detection system of its operational status engine by measuring the performance of the system using control signals fed into machine learning models
data-science machine-learning matlab morphological-analysis multi-class-classification signal-processing system-performance-evaluator
Last synced: 23 Mar 2025
https://github.com/danylevych/momo
MoMo is a module that does Morphological Modeling
morphological-analysis pypl python
Last synced: 12 Feb 2026