Projects in Awesome Lists tagged with stemmer
A curated list of projects in awesome lists tagged with stemmer .
https://github.com/gutfeeling/word_forms
Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.
adjective adverb dictionary lemmatizer natural-language-processing nlp noun parts-of-speech stemmer verb-conjugations wordnet words
Last synced: 14 Jan 2026
https://github.com/mihaivalentin/lunr-languages
A collection of languages stemmers and stopwords for Lunr Javascript library
language-stemmer localization lunr lunr-languages stemmer stopwords
Last synced: 22 Oct 2025
https://github.com/MihaiValentin/lunr-languages
A collection of languages stemmers and stopwords for Lunr Javascript library
language-stemmer localization lunr lunr-languages stemmer stopwords
Last synced: 03 Apr 2025
https://github.com/aurelian/ruby-stemmer
Expose libstemmer_c to Ruby
c ruby ruby-extension rubynpl stemmer
Last synced: 12 Nov 2025
https://github.com/cadmiumcr/cadmium
Natural Language Processing (NLP) library for Crystal
crystal crystal-lang crystal-language inflector nlp phonetics readability sentiment-analysis shards stemmer string-distance tf-idf transliterator tries wordnet
Last synced: 10 May 2025
https://github.com/fredwu/stemmer
An English (Porter2) stemming implementation in Elixir.
Last synced: 05 Oct 2025
https://github.com/assem-ch/arabicstemmer
Assem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to improve search.
arabic language snowball snowball-framework stemmer
Last synced: 25 Jul 2025
https://github.com/words/stemmer
Fast Porter stemmer implementation
natural-language porter stemmer stemming
Last synced: 12 Dec 2025
https://github.com/Qutuf/Qutuf
Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
arabic arabic-language arabic-morphology arabic-nlp arabic-tagger expert-system heavy-stemming lemmatization light-stemming morphological-analysis overdue-tagging part-of-speech-tagger pattern-matching pos-tagging premature-tagging role-based root-extraction rooting stemmer stemming
Last synced: 07 May 2025
https://github.com/hexagon/thinker-fts
Fast and extendible Node.js/Javascript fulltext search engine.
factor fts full-text-search ranker search-engine stemmer suggestions thinker wordforms
Last synced: 05 Oct 2025
https://github.com/raypereda/stemmify
Ruby module that converts a word to its approximate root form with the Porter stemmer. For example, observing and observation reduce to observ.
porter-stemmer-algorithm ruby stemmer
Last synced: 20 Aug 2025
https://github.com/htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
information-retrieval nlp persian persian-language persian-nlp persian-stemmer stemmer
Last synced: 09 Jul 2025
https://github.com/skroutz/turkish_stemmer
A simple Turkish stemming library
Last synced: 27 Apr 2025
https://github.com/bastienbot/nlp-js-tools-french
POS Tagger, lemmatizer and stemmer for french language in javascript
lemmatization lemmatizer nlp postagging postgresql stemmer stemming tokenization tokenizer
Last synced: 01 Aug 2025
https://github.com/words/lancaster-stemmer
Lancaster stemming algorithm
lancaster natural-language stemmer stemming
Last synced: 05 Apr 2026
https://github.com/sedthh/lara-hungarian-nlp
NLP class for rapid ChatBot development in Hungarian language
chatbot hungarian hungarian-language lemmatizer nlp python3 stemmer
Last synced: 10 May 2025
https://github.com/kampsy/gwizo
Simple Go implementation of the Porter Stemmer algorithm with powerful features.
consonants nlp nlp-stemming porter-stemmer-algorithm stemmer vowel
Last synced: 17 Aug 2025
https://github.com/antouanbg/Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.
asr asr-model bulgarian-dataset bulgarian-models lematization machine-translation nlp stemmer tts tts-engines
Last synced: 27 Jan 2026
https://github.com/dfalbel/ptstem
Stemming Algorithms for the Portuguese Language
hunspell portuguese-language r stem stemmer stemming-algorithm
Last synced: 25 Jun 2025
https://github.com/kangfend/bahasa
Natural language toolkit for Indonesian Language (Bahasa)
bahasa indonesia natural-language-processing nlp nlp-python python sastrawi stemmer stemming
Last synced: 21 Jan 2026
https://github.com/winkjs/wink-porter2-stemmer
Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter
natural-language-processing nlp porter-stemmer-algorithm porter-stemmer-v2 stemmer
Last synced: 30 Apr 2025
https://github.com/jonsafari/perstem
Persian stemmer and morphological analyzer
persian persian-language persian-nlp persian-stemmer stemmer transliterator
Last synced: 06 Nov 2025
https://github.com/upi-0/stemmid
stemming indonesian sentence.
sastrawi sastrawi-python stemmer
Last synced: 16 Jan 2026
https://github.com/nikolamilosevic86/serbianstemmer
Stemmer for serbian language created for my master thesis, rewritten in python
natural-language-processing python serbian-language stemmer
Last synced: 12 Apr 2025
https://github.com/dbklim/uk_stemmer
A small modification of the stemmer for the Ukrainian language (https://github.com/Amice13/ukr_stemmer)
natural-language-processing nlp stemmer stemmers stemming stemming-algorithm uk ukr ukrainian ukrainian-morphology
Last synced: 29 Apr 2025
https://github.com/localvoid/stemr
Javascript (TypeScript) implementation of the Snowball English (porter2) stemmer algorithm
javascript porter snowball stemmer text-processing typescript
Last synced: 11 Apr 2025
https://github.com/smileart/lemmingo
Defensive lemmatiser/stemmer written in Go ⊂( ⚆ ϖ⚆)っ
lemmatiser lemmatization nlp pos spell-checking stemmer tagset
Last synced: 14 Jan 2026
https://github.com/mtumilowicz/elasticsearch7-ngrams-fuzzy-shingles-stemming-workshop
Gentle introduction to basic elasticsearch constructs boosting search: ngrams, shingles, stemmers, suggesters and fuzzy queries.
edge-ngram elasticsearch fuzzy-query fuzzy-search kibana ngram search-as-you-type shingles stemmer stemming suggester workshop workshop-materials
Last synced: 11 Apr 2025
https://github.com/maximgorbatyuk/kazakh-stemmer-elasticsearch-plugin
Плагин для elasticsearch. Реализует функции стеммера казахского языка
elasticsearch elasticsearch-plugin kazakh kazakh-dictionary stemmer
Last synced: 22 Apr 2025
https://github.com/aztek/porterstemmer
An implementation of the Porter stemming algorithm in Scala
porter-stemming-algorithm scala stemmer
Last synced: 31 Jul 2025
https://github.com/tokenmill/snowball
Snowball version of the Porter stemmer for the Lithuanian language.
lithuanian-language nlp porter-stemmer snowball stemmer
Last synced: 01 Mar 2026
https://github.com/mrrefactoring/multilingual-stemmer
A NodeJS webasembly implementation of some popular snowball stemming algorithms
javascript nodejs stemmer stemmers stemming stemming-algorithm webassembly
Last synced: 16 Dec 2025
https://github.com/maxpatiiuk/porter-stemming
TypeScript implementation of the Porter Stemmer algorithm
Last synced: 22 Mar 2025
https://github.com/mhardalov/bulstem-py
Python re-implementation of the BulStem algorithm
bulgarian bulgarian-language natural-language-processing nlp nlp-library python python-library stemmer stemming-algorithm
Last synced: 31 Jan 2026
https://github.com/grishin/stemmersnet-standard
Unofficial port of StemmersNet library to .NET Standard and netcore
snowball stem stemmer stemmersnet
Last synced: 27 Jan 2026
https://github.com/mmahmoodictbd/solr-analysis-bn
Solr / Lucene Bangla Analyzer, Stem Filter, Stemmer.
bangla bengali solr solr-plugin solr-search stemmer stemming
Last synced: 26 Mar 2025
https://github.com/pommedeterresautee/unine
Unine light stemmer for French, German, Italian, Spanish, Portuguese, Finnish, Swedish
cran finish french german information-retrieval ir italian nlp portuguese rstats spanish stemmer swedish
Last synced: 04 Aug 2025
https://github.com/made2591/cognitive-system-postagger
A pos-tagging library with Viterbi, CYK and SVO -> XSV translator made as part of my final exam for the Cognitive System course in Department of Computer Science.
cky cognitive-services cognitive-systems computer-science corpora cyk department lemmatizer nlp nlp-library nlp-parsing nlp-stemming nltk nltk-grammar nlu postagger postagging sentence stemmer viterbi
Last synced: 31 May 2026
https://github.com/digitalheir/cebuano-dictionary-js
🇵🇭 A dictionary and stemmer for the Cebuano language spoken in the Philippines
cebuano cebuano-dictionary dictionary javascript philippines stemmer
Last synced: 11 Oct 2025
https://github.com/mihdan/mihdan-searchwp-stemmer-russian
Russian keyword stemmer Extension for SearchWP
php php5 php7 russian searchwp stemmer wordpress wordpress-plugin
Last synced: 22 Aug 2025
https://github.com/stcarrez/ada-stemmer
Multi natural language stemmer with Snowball generator
Last synced: 10 Jul 2025
https://github.com/rajspeaks/machine-learning-approach-to-bengali-corpus-tokenization-stemming-pos-tagging-using-bnltk
Machine Learning approach to Bengali Corpus POS Tagging using BNLTK. This is an experimenting project under the mentorship of Prof. Sandipan Ganguly, HIT-K.
bengali bengali-dataset bengali-language-processing bengali-natural-language-processing bengali-nlp english machine-learning natural-language-processing natural-language-understanding nlp nlp-library nlp-machine-learning postagger postagging rajdeep-das rajspeaks stemmer stemming tokenizer-parser
Last synced: 04 Apr 2025
https://github.com/oya163/nepali-stemmer
Simple rule-based Nepali stemmer. Flask web app deployed on Heroku platform. Created pip package.
flask heroku linguistics nepali-dictionary nepali-stemmer pip stemmer
Last synced: 31 Oct 2025
https://github.com/lgrz/polystem
Stemming algorithms in Rust
information-retrieval porter rust-lang stemmer
Last synced: 29 Jul 2025
https://github.com/hugoabonizio/stemmer.cr
:scissors: English language stemmer for Crystal
crystal nlp porter-stemming-algorithm stemmer
Last synced: 29 Oct 2025
https://github.com/nileshchat/christopher
A light-weight, robust Information Retrieval System
indexing information-retrieval stemmer tf-idf
Last synced: 24 Jan 2026
https://github.com/kimaruthagna/nlp_tuts
sample scripts that show use of NLP in python.Some will be proof of concepts while others will be tutorials
bag-of-words corpora frequency-distibution lemmatization matplotlib nlp nltk nltk-python pos-ta seaborn sentiment-analysis sentiment-polarity stemmer text-visualization tutorial word-cloud word2doc word2vec word2vec-algorithm xgboost
Last synced: 14 Jun 2025
https://github.com/fracpete/snowball-stemmers-weka-package
Weka package for the snowball stemmers (http://snowball.tartarus.org/).
java machine-learning plugin preprocessing stemmer stemmers weka
Last synced: 07 Sep 2025
https://github.com/amaccis/docker-php-libstemmer
Docker Alpine Linux environment with PHP onboard, its FFI extension enabled and the libstemmer compiled as a shared library.
Last synced: 09 Mar 2026
https://github.com/fracpete/ptstemmer-weka-package
Weka package for the PTStemmer (https://code.google.com/p/ptstemmer/).
java machine-learning nlp plugin preprocessing stemmer weka
Last synced: 28 Mar 2025
https://github.com/bean5/nlp-porter-stemmer-java
I forked the Java Porter Stemmer and optimized for Java 1.7 (the original porter stemmer was crashing).
contributed gh-pages java nlp stemmer stemming-algorithm
Last synced: 21 Jul 2025
https://github.com/andrianllmm/tagalog-stemmer
A Python library for Tagalog word stemming.
language-processing nlp stemmer tagalog
Last synced: 11 May 2026
https://github.com/andrianllmm/aklanon-stemmer
A Python library for Aklanon word stemming.
aklanon language-processing nlp stemmer
Last synced: 26 Oct 2025
https://github.com/tomsquest/lucene-stemmers
Stem words like Lucene (port of Lucene' stemmers to JavaScript)
Last synced: 13 Apr 2025
https://github.com/rajukoushik/information-retrieval
information-retrieval python stemmer
Last synced: 24 Jan 2026
https://github.com/mehrantsi/common-crawl-analyzer
Tools to extract and analyze domains and URLs from Common Crawl data files.
common-crawl large-dataset stemmer term-analysis term-frequency-inverse-document
Last synced: 16 May 2025
https://github.com/swelcker/cmd.csp.stemmer
Simple implementation of Snowball Stemmer (http://snowballstem.org/) in Java with Stemmers for 20+ languages. Helpful to reduce tokens to their core syntax esp. when processing them in Machine Learning Models (ML). (Natural Language Processing) features.
nlp nlp-library nlp-machine-learning nlp-parsing stemmer stemming-algorithm
Last synced: 12 Jun 2025
https://github.com/thomasbrockmeier/kpss_py3
Kraaij-Pohlmann Snowball Stemmer
dutch language language-processing linguistics nlp stemmer stemming stemming-algorithm
Last synced: 29 May 2026
https://github.com/hangsbreaker/stemming-ind
Javascript, PHP, Python Stemming Bahasa Indonesia
javascript nodejs php stem stemmer stemming stemming-algorithm
Last synced: 07 May 2026
https://github.com/thekorn/snowballstem.zig
zig wrapper for the snowball stemmer
bindings snowball stemmer zig zig-package
Last synced: 22 Feb 2025
https://github.com/elifftosunn/textdataclean
Kirli veri çekildiğinde ön işleme adımlarına gerek kalmadan model eğitimi için hazır hale getirmek amacıyla yapılan uygulamadır.
corpus deasciifier morphological-analysis ngram nltk numpy pandas sentence-embedding sentence-tokenizer stemmer stopwords string turkish turkish-sentence-tokenizer word-tokenizer
Last synced: 20 May 2026
https://github.com/firstlanguage/streamlit-firstlanguage
Streamlit components for FirstLanguage API
classification image-captioning lemmatizer morphological-analyser named-entity-recognition nlp-machine-learning postagger question-answering stemmer streamlit-component summary-generator table-qa translator
Last synced: 30 Dec 2025
https://github.com/sunscrapers/morfologik-stemmer-cli
Simple CLI tool for Morfologik Polish stemmer.
cli morfologik morfologik-plugin stemmer
Last synced: 23 Feb 2025
https://github.com/sazid1462/py-bangla-stemmer
Rule based Bengali Stemmer written in python
bangla bengali rule-based-stemmer stemmer
Last synced: 06 Apr 2026