https://github.com/NVIDIA-NeMo/Speech
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://github.com/NVIDIA-NeMo/Speech
asr deeplearning generative-ai machine-translation neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts
Last synced: 1 day ago
JSON representation
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
- Host: GitHub
- URL: https://github.com/NVIDIA-NeMo/Speech
- Owner: NVIDIA-NeMo
- License: apache-2.0
- Created: 2019-08-05T20:16:42.000Z (almost 7 years ago)
- Default Branch: main
- Last Pushed: 2026-06-26T17:13:51.000Z (2 days ago)
- Last Synced: 2026-06-26T17:14:12.478Z (2 days ago)
- Topics: asr, deeplearning, generative-ai, machine-translation, neural-networks, speaker-diariazation, speaker-recognition, speech-synthesis, speech-translation, tts
- Language: Python
- Homepage: https://docs.nvidia.com/nemo/speech/nightly/index.html
- Size: 491 MB
- Stars: 17,567
- Watchers: 233
- Forks: 3,462
- Open Issues: 187
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Citation: CITATION.cff
- Codeowners: .github/CODEOWNERS
- Security: SECURITY.md
- Agents: AGENTS.md
Awesome Lists containing this project
- awesome-rainmana - NVIDIA-NeMo/Speech - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech) (Python)