An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with speech-dataset

A curated list of projects in awesome lists tagged with speech-dataset .

https://github.com/mahtafetrat/manatts-persian-speech-dataset

ManaTTS is the largest open Persian speech dataset with 100+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

data-collection data-preprocessing dataset-preparation forced-alignment mana-tts persian persian-speech speech-corpus speech-data-collection speech-dataset speech-processing speech-synthesis text-to-speech text-to-speech-dataset tts tts-dataset

Last synced: 08 Apr 2025

https://github.com/MahtaFetrat/ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

data-collection data-preprocessing dataset-preparation forced-alignment mana-tts persian persian-speech speech-corpus speech-data-collection speech-dataset speech-processing speech-synthesis text-to-speech text-to-speech-dataset tts tts-dataset

Last synced: 01 Mar 2025

https://github.com/revsic/speechset

Numpy-librosa implementation of Speech dataset pipeline

preprocessor speech-dataset tts vocoder

Last synced: 26 Jun 2025

https://github.com/kanishknavale/speech-emotion-recognition

A simple CNN-LSTM deep neural model using Tensorflow to classify emotions from a speech dataset

cnn deep-learning lstm speech-dataset speech-emotion-recognition tensorflow

Last synced: 22 Apr 2025

https://github.com/mahtafetrat/mana-forced-aligner

A robust forced alignment tool for low-resource languages using multiple ASR models and CER-based matching. Built for noisy data and imperfect transcripts.

asr forced-alignment low-resource-languages mana-tts manatts open-source speech-alignment speech-dataset tts

Last synced: 19 Jun 2025

https://github.com/mahtafetrat/virgoolinformal-speech-dataset

A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.

asr asr-evaluation forced-alignment persian persian-speech-corpus persian-speech-dataset persian-speech-recognition persian-text-to-speech speech-data-collection speech-dataset speech-processing tts

Last synced: 15 Apr 2025