An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with text-segmentation

A curated list of projects in awesome lists tagged with text-segmentation .

https://github.com/blmoistawinde/harvesttext

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

dependency-parser gitee harvesttext keyword-extraction named-entity-recognition new-word-discovery nlp pyhanlp sentiment-analysis text-cleaning text-segmentation text-summarization unsupervised

Last synced: 14 May 2025

https://github.com/blmoistawinde/HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

dependency-parser gitee harvesttext keyword-extraction named-entity-recognition new-word-discovery nlp pyhanlp sentiment-analysis text-cleaning text-segmentation text-summarization unsupervised

Last synced: 18 Mar 2025

https://github.com/ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

anime comics computer-vision deep-learning gui inpainting machine-translation manga manhua manhwa neural-network ocr pyside6 python pytorch segmentation text-detection text-segmentation translation webtoons

Last synced: 06 Feb 2026

https://github.com/cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

nlp nlp-library semeval spell-corrector spelling-correction text-processing text-segmentation tokenization tokenizer word-normalization word-segmentation

Last synced: 14 Jan 2026

https://github.com/reubenbond/hanbaobao

Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)

android chinese chinese-text-segmentation dictionary-data pinyin text-segmentation transliteration

Last synced: 25 Mar 2025

https://github.com/nitely/nim-segmentation

Unicode text segmentation (tr29)

nim text-segmentation unicode word-break

Last synced: 03 Apr 2026

https://github.com/zamgi/lingvo--textsegmenter

Text segmentation into separate words using a simple unigram model and the Viterbi algorithm

linguistics lingvo natural-language-processing nlp text-segmentation viterbi-algorithm

Last synced: 06 Apr 2025

https://github.com/npillmayer/uax

Unicode Text Segmentation Algorithms

text-processing text-segmentation unicode

Last synced: 25 Sep 2025

https://github.com/dhavaltaunk08/text-segmentation-in-images

This project aimed to perform text segmentation in images using AutoEncoders.

autoencoders deep-learning ipython-notebook python3 text-segmentation

Last synced: 17 Jun 2026

https://github.com/danburzo/ltr

Split text into chars, words, or sentences from the command line.

text-segmentation

Last synced: 10 Aug 2025

https://github.com/sigpwned/uax29

Java implementation of UAX#29 text segmentation algorithm

java text-segmentation uax29 unicode

Last synced: 16 Jul 2025

https://github.com/jessestuart/jstexttiling

OSS Text Segmentation library.

groovy nlp text-segmentation

Last synced: 24 Mar 2025