An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with text-normalization

A curated list of projects in awesome lists tagged with text-normalization .

https://github.com/nvidia/nemo-text-processing

NeMo text processing for ASR and TTS

inverse-text-n text-normalization

Last synced: 13 Apr 2025

https://github.com/NVIDIA/NeMo-text-processing

NeMo text processing for ASR and TTS

inverse-text-n text-normalization

Last synced: 18 Jul 2025

https://github.com/snakers4/russian_stt_text_normalization

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

python3 pytorch russian-language speech speech-to-text text-normalization torchscript

Last synced: 19 Jul 2025

https://github.com/tomaarsen/ttstextnormalization

Convert English text from written expressions into spoken forms

competition nlp normalization spoken-forms text-normalization tts

Last synced: 23 Apr 2025

https://github.com/sugatagh/E-commerce-Text-Classification

Proper categorization of e-commerce products enhances the user experience and achieves better results with external search engines. The objective of the project is to classify a product into four given categories, based on its description available on an e-commerce platform.

e-commerce natural-language-processing product-categorization text-classification text-normalization tf-idf word2vec

Last synced: 13 Apr 2025

https://github.com/seavleu/khmer-utils

A ๐Ÿ‡ฐ๐Ÿ‡ญ utility library for number formatting, currency display, date localization, text normalization, and script transliteration, built for Cambodian developers.

currency-conversion date-localization khmer-language locale number-formatting text-normalization transliteration

Last synced: 06 Mar 2026

https://github.com/stefantaubert/english-text-normalization

Command-line interface (CLI) and library to normalize English texts.

nlp preprocessing text-normalization tts

Last synced: 25 Jun 2025

https://github.com/mvakili/tokenizer

Spelling corrector and text normalizer

natural-language-processing text-normalization tokenizer

Last synced: 24 Jun 2025

https://github.com/moshetanzer/text-toolbox

A high-performance TypeScript library for string similarity, distance algorithms, and text normalization utilities

case-formatter cosine-similarity emoji fingerprint jaro-winkler levenshtein-distance metaphone natural-language-processing nlp string-comparison string-similarity text-normalization

Last synced: 28 Feb 2026

https://github.com/gaelic-ghost/textforspeech

Text normalization and conditioning for speech-safe Swift workflows.

accessibility swift swift-package text-normalization

Last synced: 17 Apr 2026

https://github.com/curegit/unicodecheck

Simple tool to check if Unicode text files are Unicode-normalized

character-encoding text-normalization unicode

Last synced: 11 Oct 2025

https://github.com/pszemraj/rehuman

Unicode-safe text cleaning & typographic normalization for Rust

no-emoji-in-code rust-crate text-normalization text-processing unicode

Last synced: 15 Jan 2026

https://github.com/camel-lab/codafication

Code, models, and data for "Exploiting Dialect Identification in Automatic Dialectal Text Normalization". ArabicNLP 2024, ACL.

arabic arabic-nlp deep-learning nlp text-normalization

Last synced: 25 Jan 2026