Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with speech-corpus
A curated list of projects in awesome lists tagged with speech-corpus .
https://github.com/clovaai/clovacall
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
call-based-speech-corpus goal-oriented-dialog interspeech2020 korean-speech speech-corpus speech-recognition
Last synced: 12 Nov 2024
https://github.com/kan-bayashi/librittslabel
Alignment files of LibriTTS.
speech-corpus speech-synthesis
Last synced: 02 Dec 2024
https://github.com/lennes/spect
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
analysis annotation conversational-speech corpus-linguistics corpus-tools praat spect speech speech-analysis speech-corpus spoken-language transcript transcription
Last synced: 04 Nov 2024
https://github.com/dcavar/elan2split
Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners
cpp11 elan forced-alignment sox speech-corpus speech-recognition xerxes xml
Last synced: 07 Nov 2024
https://github.com/mahtafetrat/manatts-persian-speech-dataset
ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
data-collection data-preprocessing dataset-preparation forced-alignment mana-tts persian persian-speech speech-corpus speech-data-collection speech-dataset speech-processing speech-synthesis text-to-speech text-to-speech-dataset tts tts-dataset
Last synced: 06 Nov 2024
https://github.com/vectominist/switchboard-wsj-utils
Utilities for preprocessing the Switchboard and WSJ corpora in Python3
python speech-corpus switchboard torchaudio wsj wtimit
Last synced: 02 Dec 2024
https://github.com/mahtafetrat/gptinformal-persian-speech-dataset
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
data-collection data-preprocessing dataset-preparation forced-alignment mana-tts manatts persian persian-speech speech-corpus speech-data-collection speech-dataset speech-processing speech-synthesis text-to-speech text-to-speech-dataset tts tts-dataset
Last synced: 19 Dec 2024