Projects in Awesome Lists tagged with text-segmentation
A curated list of projects in awesome lists tagged with text-segmentation .
https://github.com/catalyst-team/catalyst
Accelerated deep learning R&D
computer-vision deep-learning distributed-computing image-classification image-processing image-segmentation information-retrieval infrastructure machine-learning metric-learning natural-language-processing object-detection python pytorch recommender-system reinforcement-learning reproducibility research text-classification text-segmentation
Last synced: 13 May 2025
https://github.com/wolfgarbe/SymSpell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
approximate-string-matching chinese-text-segmentation chinese-word-segmentation damerau-levenshtein edit-distance fuzzy-matching fuzzy-search levenshtein levenshtein-distance spell-check spellcheck spelling spelling-correction symspell text-segmentation word-segmentation
Last synced: 13 Mar 2025
https://github.com/wolfgarbe/symspell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
approximate-string-matching chinese-text-segmentation chinese-word-segmentation damerau-levenshtein edit-distance fuzzy-matching fuzzy-search levenshtein levenshtein-distance spell-check spellcheck spelling spelling-correction symspell text-segmentation word-segmentation
Last synced: 10 Jul 2025
https://github.com/blmoistawinde/harvesttext
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
dependency-parser gitee harvesttext keyword-extraction named-entity-recognition new-word-discovery nlp pyhanlp sentiment-analysis text-cleaning text-segmentation text-summarization unsupervised
Last synced: 14 May 2025
https://github.com/blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
dependency-parser gitee harvesttext keyword-extraction named-entity-recognition new-word-discovery nlp pyhanlp sentiment-analysis text-cleaning text-segmentation text-summarization unsupervised
Last synced: 18 Mar 2025
https://github.com/ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
anime comics computer-vision deep-learning gui inpainting machine-translation manga manhua manhwa neural-network ocr pyside6 python pytorch segmentation text-detection text-segmentation translation webtoons
Last synced: 06 Feb 2026
https://github.com/mammothb/symspellpy
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
approximate-string-matching chinese-text-segmentation chinese-word-segmentation damerau-levenshtein edit-distance fuzzy-matching fuzzy-search levenshtein levenshtein-distance python spell-check spellcheck spelling spelling-correction symspell text-segmentation word-segmentation
Last synced: 13 Feb 2026
https://github.com/cbaziotis/ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
nlp nlp-library semeval spell-corrector spelling-correction text-processing text-segmentation tokenization tokenizer word-normalization word-segmentation
Last synced: 14 Jan 2026
https://github.com/viig99/symspellcpppy
Fast SymSpell written in c++ and exposes to python via pybind11
compound-words fuzzy-matching fuzzy-search pybind11 python spell-check spellcheck spelling spelling-correction spelling-corrector symspell text-segmentation word-segmentation
Last synced: 09 Apr 2025
https://github.com/reubenbond/hanbaobao
Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)
android chinese chinese-text-segmentation dictionary-data pinyin text-segmentation transliteration
Last synced: 25 Mar 2025
https://github.com/eskriett/spell
Spelling correction and string segmentation written in Go
golang spell-check spellcheck spelling spelling-correction string-segmentation symspell text-segmentation word-segmentation
Last synced: 10 Apr 2025
https://github.com/rlayers/pawpaw
Text Processing & Segmentation Framework
extract-text hierarchical-text-segmentation information-extraction knowledge-graph lexer natural-language-processing nlp parser python query-engine query-language text-processing text-segmentation tree xml-parser xmlparser
Last synced: 08 Apr 2026
https://github.com/nitely/nim-segmentation
Unicode text segmentation (tr29)
nim text-segmentation unicode word-break
Last synced: 03 Apr 2026
https://github.com/zamgi/lingvo--textsegmenter
Text segmentation into separate words using a simple unigram model and the Viterbi algorithm
linguistics lingvo natural-language-processing nlp text-segmentation viterbi-algorithm
Last synced: 06 Apr 2025
https://github.com/npillmayer/uax
Unicode Text Segmentation Algorithms
text-processing text-segmentation unicode
Last synced: 25 Sep 2025
https://github.com/dhavaltaunk08/text-segmentation-in-images
This project aimed to perform text segmentation in images using AutoEncoders.
autoencoders deep-learning ipython-notebook python3 text-segmentation
Last synced: 17 Jun 2026
https://github.com/quantumwizard888/how-to-add-user-dictionary-to-mecab
How to add user dictionary to MeCab
guide mecab natural-language-processing text-segmentation
Last synced: 08 Jan 2026
https://github.com/craigtrim/fast-sentence-segment
Fast and Efficient Sentence Segmentation
natural-language-processing nlp python segmentation sentence-segmentation spacy text-processing text-segmentation
Last synced: 29 Jan 2026
https://github.com/dobatymo/graphseg-python
graphseq natural-language-processing python text-segmentation
Last synced: 16 Mar 2025
https://github.com/danburzo/ltr
Split text into chars, words, or sentences from the command line.
Last synced: 10 Aug 2025
https://github.com/sigpwned/uax29
Java implementation of UAX#29 text segmentation algorithm
java text-segmentation uax29 unicode
Last synced: 16 Jul 2025
https://github.com/arxiver/onepiecelang
Text segmentation solution using natural language processing.
bigram bigram-model dp dynamic-programming machine-intelligence machine-learning natural-language-processing nlp nlp-machine-learning text-segmentation unigram unigram-model viterbi viterbi-algorithm word word-segmentation
Last synced: 03 Aug 2025