An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with stemmer

A curated list of projects in awesome lists tagged with stemmer .

https://github.com/gutfeeling/word_forms

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

adjective adverb dictionary lemmatizer natural-language-processing nlp noun parts-of-speech stemmer verb-conjugations wordnet words

Last synced: 14 Jan 2026

https://github.com/mihaivalentin/lunr-languages

A collection of languages stemmers and stopwords for Lunr Javascript library

language-stemmer localization lunr lunr-languages stemmer stopwords

Last synced: 22 Oct 2025

https://github.com/MihaiValentin/lunr-languages

A collection of languages stemmers and stopwords for Lunr Javascript library

language-stemmer localization lunr lunr-languages stemmer stopwords

Last synced: 03 Apr 2025

https://github.com/aurelian/ruby-stemmer

Expose libstemmer_c to Ruby

c ruby ruby-extension rubynpl stemmer

Last synced: 12 Nov 2025

https://github.com/fredwu/stemmer

An English (Porter2) stemming implementation in Elixir.

bayes porter stemmer

Last synced: 05 Oct 2025

https://github.com/assem-ch/arabicstemmer

Assem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to improve search.

arabic language snowball snowball-framework stemmer

Last synced: 25 Jul 2025

https://github.com/words/stemmer

Fast Porter stemmer implementation

natural-language porter stemmer stemming

Last synced: 12 Dec 2025

https://github.com/deeplearningturkiye/kelime_kok_ayirici

Derin Öğrenme Tabanlı - seq2seq - Türkçe için kelime kökü bulma web uygulaması - Turkish Stemmer (tr_stemmer)

flask keras nlp python stemmer

Last synced: 30 Apr 2025

https://github.com/hexagon/thinker-fts

Fast and extendible Node.js/Javascript fulltext search engine.

factor fts full-text-search ranker search-engine stemmer suggestions thinker wordforms

Last synced: 05 Oct 2025

https://github.com/raypereda/stemmify

Ruby module that converts a word to its approximate root form with the Porter stemmer. For example, observing and observation reduce to observ.

porter-stemmer-algorithm ruby stemmer

Last synced: 20 Aug 2025

https://github.com/tebeka/snowball

Snowball stemmer for Go

golang snowball stemmer

Last synced: 13 Nov 2025

https://github.com/skroutz/turkish_stemmer

A simple Turkish stemming library

nlp stemmer

Last synced: 27 Apr 2025

https://github.com/wooorm/stmr.c

Porter Stemmer algorithm in C

porter stemmer stemming

Last synced: 19 Apr 2025

https://github.com/bastienbot/nlp-js-tools-french

POS Tagger, lemmatizer and stemmer for french language in javascript

lemmatization lemmatizer nlp postagging postgresql stemmer stemming tokenization tokenizer

Last synced: 01 Aug 2025

https://github.com/words/lancaster-stemmer

Lancaster stemming algorithm

lancaster natural-language stemmer stemming

Last synced: 05 Apr 2026

https://github.com/sedthh/lara-hungarian-nlp

NLP class for rapid ChatBot development in Hungarian language

chatbot hungarian hungarian-language lemmatizer nlp python3 stemmer

Last synced: 10 May 2025

https://github.com/kampsy/gwizo

Simple Go implementation of the Porter Stemmer algorithm with powerful features.

consonants nlp nlp-stemming porter-stemmer-algorithm stemmer vowel

Last synced: 17 Aug 2025

https://github.com/liderman/rustemmer

Golang implementation Porter Stemming for Russian language

fast golang package porter russian stemmer stemmers stemming

Last synced: 13 Jul 2025

https://github.com/antouanbg/Bulgarian_Linguistic

Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding tools/apps.

asr asr-model bulgarian-dataset bulgarian-models lematization machine-translation nlp stemmer tts tts-engines

Last synced: 27 Jan 2026

https://github.com/dfalbel/ptstem

Stemming Algorithms for the Portuguese Language

hunspell portuguese-language r stem stemmer stemming-algorithm

Last synced: 25 Jun 2025

https://github.com/kangfend/bahasa

Natural language toolkit for Indonesian Language (Bahasa)

bahasa indonesia natural-language-processing nlp nlp-python python sastrawi stemmer stemming

Last synced: 21 Jan 2026

https://github.com/winkjs/wink-porter2-stemmer

Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter

natural-language-processing nlp porter-stemmer-algorithm porter-stemmer-v2 stemmer

Last synced: 30 Apr 2025

https://github.com/jonsafari/perstem

Persian stemmer and morphological analyzer

persian persian-language persian-nlp persian-stemmer stemmer transliterator

Last synced: 06 Nov 2025

https://github.com/upi-0/stemmid

stemming indonesian sentence.

sastrawi sastrawi-python stemmer

Last synced: 16 Jan 2026

https://github.com/nikolamilosevic86/serbianstemmer

Stemmer for serbian language created for my master thesis, rewritten in python

natural-language-processing python serbian-language stemmer

Last synced: 12 Apr 2025

https://github.com/dbklim/uk_stemmer

A small modification of the stemmer for the Ukrainian language (https://github.com/Amice13/ukr_stemmer)

natural-language-processing nlp stemmer stemmers stemming stemming-algorithm uk ukr ukrainian ukrainian-morphology

Last synced: 29 Apr 2025

https://github.com/nadar/stemming

PHP Stemming Collection

languages php stemmer stemmers stemming

Last synced: 13 Apr 2025

https://github.com/localvoid/stemr

Javascript (TypeScript) implementation of the Snowball English (porter2) stemmer algorithm

javascript porter snowball stemmer text-processing typescript

Last synced: 11 Apr 2025

https://github.com/smileart/lemmingo

Defensive lemmatiser/stemmer written in Go ⊂( ⚆ ϖ⚆)っ

lemmatiser lemmatization nlp pos spell-checking stemmer tagset

Last synced: 14 Jan 2026

https://github.com/mtumilowicz/elasticsearch7-ngrams-fuzzy-shingles-stemming-workshop

Gentle introduction to basic elasticsearch constructs boosting search: ngrams, shingles, stemmers, suggesters and fuzzy queries.

edge-ngram elasticsearch fuzzy-query fuzzy-search kibana ngram search-as-you-type shingles stemmer stemming suggester workshop workshop-materials

Last synced: 11 Apr 2025

https://github.com/maximgorbatyuk/kazakh-stemmer-elasticsearch-plugin

Плагин для elasticsearch. Реализует функции стеммера казахского языка

elasticsearch elasticsearch-plugin kazakh kazakh-dictionary stemmer

Last synced: 22 Apr 2025

https://github.com/aztek/porterstemmer

An implementation of the Porter stemming algorithm in Scala

porter-stemming-algorithm scala stemmer

Last synced: 31 Jul 2025

https://github.com/wooorm/stmr

Porter Stemmer CLI

porter stemmer stemming

Last synced: 19 Apr 2025

https://github.com/tokenmill/snowball

Snowball version of the Porter stemmer for the Lithuanian language.

lithuanian-language nlp porter-stemmer snowball stemmer

Last synced: 01 Mar 2026

https://github.com/mrrefactoring/multilingual-stemmer

A NodeJS webasembly implementation of some popular snowball stemming algorithms

javascript nodejs stemmer stemmers stemming stemming-algorithm webassembly

Last synced: 16 Dec 2025

https://github.com/maxpatiiuk/porter-stemming

TypeScript implementation of the Porter Stemmer algorithm

porter stemmer stemming

Last synced: 22 Mar 2025

https://github.com/grishin/stemmersnet-standard

Unofficial port of StemmersNet library to .NET Standard and netcore

snowball stem stemmer stemmersnet

Last synced: 27 Jan 2026

https://github.com/mmahmoodictbd/solr-analysis-bn

Solr / Lucene Bangla Analyzer, Stem Filter, Stemmer.

bangla bengali solr solr-plugin solr-search stemmer stemming

Last synced: 26 Mar 2025

https://github.com/pommedeterresautee/unine

Unine light stemmer for French, German, Italian, Spanish, Portuguese, Finnish, Swedish

cran finish french german information-retrieval ir italian nlp portuguese rstats spanish stemmer swedish

Last synced: 04 Aug 2025

https://github.com/made2591/cognitive-system-postagger

A pos-tagging library with Viterbi, CYK and SVO -> XSV translator made as part of my final exam for the Cognitive System course in Department of Computer Science.

cky cognitive-services cognitive-systems computer-science corpora cyk department lemmatizer nlp nlp-library nlp-parsing nlp-stemming nltk nltk-grammar nlu postagger postagging sentence stemmer viterbi

Last synced: 31 May 2026

https://github.com/digitalheir/cebuano-dictionary-js

🇵🇭 A dictionary and stemmer for the Cebuano language spoken in the Philippines

cebuano cebuano-dictionary dictionary javascript philippines stemmer

Last synced: 11 Oct 2025

https://github.com/mihdan/mihdan-searchwp-stemmer-russian

Russian keyword stemmer Extension for SearchWP

php php5 php7 russian searchwp stemmer wordpress wordpress-plugin

Last synced: 22 Aug 2025

https://github.com/stcarrez/ada-stemmer

Multi natural language stemmer with Snowball generator

ada stemmer

Last synced: 10 Jul 2025

https://github.com/abadojack/stemmer

Simple stemmer for Esperanto

golang nlp-stemming stemmer

Last synced: 22 Feb 2026

https://github.com/oya163/nepali-stemmer

Simple rule-based Nepali stemmer. Flask web app deployed on Heroku platform. Created pip package.

flask heroku linguistics nepali-dictionary nepali-stemmer pip stemmer

Last synced: 31 Oct 2025

https://github.com/lgrz/polystem

Stemming algorithms in Rust

information-retrieval porter rust-lang stemmer

Last synced: 29 Jul 2025

https://github.com/chief/greek_stemmer

A Clojure Greek stemmer approach

nlp stemmer

Last synced: 04 Jan 2026

https://github.com/hugoabonizio/stemmer.cr

:scissors: English language stemmer for Crystal

crystal nlp porter-stemming-algorithm stemmer

Last synced: 29 Oct 2025

https://github.com/nileshchat/christopher

A light-weight, robust Information Retrieval System

indexing information-retrieval stemmer tf-idf

Last synced: 24 Jan 2026

https://github.com/fracpete/snowball-stemmers-weka-package

Weka package for the snowball stemmers (http://snowball.tartarus.org/).

java machine-learning plugin preprocessing stemmer stemmers weka

Last synced: 07 Sep 2025

https://github.com/amaccis/docker-php-libstemmer

Docker Alpine Linux environment with PHP onboard, its FFI extension enabled and the libstemmer compiled as a shared library.

libstemmer php php8 stemmer

Last synced: 09 Mar 2026

https://github.com/sadit/snowballstemmer.jl

Julia's wrapper for libstemmer

julia nlp snowball stemmer

Last synced: 19 Sep 2025

https://github.com/fracpete/ptstemmer-weka-package

Weka package for the PTStemmer (https://code.google.com/p/ptstemmer/).

java machine-learning nlp plugin preprocessing stemmer weka

Last synced: 28 Mar 2025

https://github.com/bean5/nlp-porter-stemmer-java

I forked the Java Porter Stemmer and optimized for Java 1.7 (the original porter stemmer was crashing).

contributed gh-pages java nlp stemmer stemming-algorithm

Last synced: 21 Jul 2025

https://github.com/andrianllmm/tagalog-stemmer

A Python library for Tagalog word stemming.

language-processing nlp stemmer tagalog

Last synced: 11 May 2026

https://github.com/eilvelia/porter2.js

Fastest JavaScript implementation of the porter2 stemming algorithm

english porter snowball stemmer stemming

Last synced: 29 Apr 2025

https://github.com/andrianllmm/aklanon-stemmer

A Python library for Aklanon word stemming.

aklanon language-processing nlp stemmer

Last synced: 26 Oct 2025

https://github.com/putuwaw/linggapy

Library for Stemming Balinese Text Language

balinese nlp python stemmer stemming thesis

Last synced: 20 Feb 2026

https://github.com/tomsquest/lucene-stemmers

Stem words like Lucene (port of Lucene' stemmers to JavaScript)

lucene stem stemmer stemming

Last synced: 13 Apr 2025

https://github.com/mehrantsi/common-crawl-analyzer

Tools to extract and analyze domains and URLs from Common Crawl data files.

common-crawl large-dataset stemmer term-analysis term-frequency-inverse-document

Last synced: 16 May 2025

https://github.com/swelcker/cmd.csp.stemmer

Simple implementation of Snowball Stemmer (http://snowballstem.org/) in Java with Stemmers for 20+ languages. Helpful to reduce tokens to their core syntax esp. when processing them in Machine Learning Models (ML). (Natural Language Processing) features.

nlp nlp-library nlp-machine-learning nlp-parsing stemmer stemming-algorithm

Last synced: 12 Jun 2025

https://github.com/mitica/root-name

Extracts root name of a name.

name root root-name stemmer stemming

Last synced: 12 May 2026

https://github.com/hangsbreaker/stemming-ind

Javascript, PHP, Python Stemming Bahasa Indonesia

javascript nodejs php stem stemmer stemming stemming-algorithm

Last synced: 07 May 2026

https://github.com/thekorn/snowballstem.zig

zig wrapper for the snowball stemmer

bindings snowball stemmer zig zig-package

Last synced: 22 Feb 2025

https://github.com/elifftosunn/textdataclean

Kirli veri çekildiğinde ön işleme adımlarına gerek kalmadan model eğitimi için hazır hale getirmek amacıyla yapılan uygulamadır.

corpus deasciifier morphological-analysis ngram nltk numpy pandas sentence-embedding sentence-tokenizer stemmer stopwords string turkish turkish-sentence-tokenizer word-tokenizer

Last synced: 20 May 2026

https://github.com/buda-base/stemmer

Fork of the Egothor2 stemmer code

stemmer trie

Last synced: 13 Mar 2026

https://github.com/fco/rslp

stemmer

Last synced: 22 Jan 2026

https://github.com/sunscrapers/morfologik-stemmer-cli

Simple CLI tool for Morfologik Polish stemmer.

cli morfologik morfologik-plugin stemmer

Last synced: 23 Feb 2025

https://github.com/sazid1462/py-bangla-stemmer

Rule based Bengali Stemmer written in python

bangla bengali rule-based-stemmer stemmer

Last synced: 06 Apr 2026