An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with text-normalization

A curated list of projects in awesome lists tagged with text-normalization .

https://github.com/nvidia/nemo-text-processing

NeMo text processing for ASR and TTS

inverse-text-n text-normalization

Last synced: 13 Apr 2025

https://github.com/NVIDIA/NeMo-text-processing

NeMo text processing for ASR and TTS

inverse-text-n text-normalization

Last synced: 18 Jul 2025

https://github.com/snakers4/russian_stt_text_normalization

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

python3 pytorch russian-language speech speech-to-text text-normalization torchscript

Last synced: 19 Jul 2025

https://github.com/tomaarsen/ttstextnormalization

Convert English text from written expressions into spoken forms

competition nlp normalization spoken-forms text-normalization tts

Last synced: 23 Apr 2025

https://github.com/sugatagh/E-commerce-Text-Classification

Proper categorization of e-commerce products enhances the user experience and achieves better results with external search engines. The objective of the project is to classify a product into four given categories, based on its description available on an e-commerce platform.

e-commerce natural-language-processing product-categorization text-classification text-normalization tf-idf word2vec

Last synced: 13 Apr 2025

https://github.com/stefantaubert/english-text-normalization

Command-line interface (CLI) and library to normalize English texts.

nlp preprocessing text-normalization tts

Last synced: 25 Jun 2025

https://github.com/seavleu/khmer-utils

A ๐Ÿ‡ฐ๐Ÿ‡ญ utility library for number formatting, currency display, date localization, text normalization, and script transliteration, built for Cambodian developers.

currency-conversion date-localization khmer-language locale number-formatting text-normalization transliteration

Last synced: 10 Apr 2025

https://github.com/mvakili/tokenizer

Spelling corrector and text normalizer

natural-language-processing text-normalization tokenizer

Last synced: 24 Jun 2025

https://github.com/pszemraj/rehuman

Unicode-safe text cleaning & typographic normalization for Rust

no-emoji-in-code rust-crate text-normalization text-processing unicode

Last synced: 15 Jan 2026

https://github.com/curegit/unicodecheck

Simple tool to check if Unicode text files are Unicode-normalized

character-encoding text-normalization unicode

Last synced: 11 Oct 2025

https://github.com/camel-lab/codafication

Code, models, and data for "Exploiting Dialect Identification in Automatic Dialectal Text Normalization". ArabicNLP 2024, ACL.

arabic arabic-nlp deep-learning nlp text-normalization

Last synced: 25 Jan 2026