{"id":26167572,"url":"https://github.com/robmsmt/ASR-Audio-Data-Links","last_synced_at":"2025-03-11T17:39:46.964Z","repository":{"id":38270662,"uuid":"126870088","full_name":"robmsmt/ASR-Audio-Data-Links","owner":"robmsmt","description":"A list of publically available audio data that anyone can download for ASR or other speech activities","archived":false,"fork":false,"pushed_at":"2021-08-06T22:08:51.000Z","size":26,"stargazers_count":203,"open_issues_count":0,"forks_count":22,"subscribers_count":9,"default_branch":"master","last_synced_at":"2025-03-08T12:32:12.985Z","etag":null,"topics":["asr","audio-data","data","speech","speech-activities","speech-recognition","speech-to-text"],"latest_commit_sha":null,"homepage":null,"language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/robmsmt.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-03-26T18:10:23.000Z","updated_at":"2025-01-29T05:27:57.000Z","dependencies_parsed_at":"2022-08-18T06:10:42.837Z","dependency_job_id":null,"html_url":"https://github.com/robmsmt/ASR-Audio-Data-Links","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robmsmt%2FASR-Audio-Data-Links","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robmsmt%2FASR-Audio-Data-Links/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robmsmt%2FASR-Audio-Data-Links/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robmsmt%2FASR-Audio-Data-Links/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/robmsmt","download_url":"https://codeload.github.com/robmsmt/ASR-Audio-Data-Links/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243080235,"owners_count":20233136,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["asr","audio-data","data","speech","speech-activities","speech-recognition","speech-to-text"],"created_at":"2025-03-11T17:39:45.209Z","updated_at":"2025-03-11T17:39:46.958Z","avatar_url":"https://github.com/robmsmt.png","language":"Shell","funding_links":["https://opencollective.com/open_stt"],"categories":["Topic"],"sub_categories":["Automatic Speech Recognition (ASR)"],"readme":"# Audio Data Links\n\nA list of common publically (and privately) available audio data that you can download for ASR or other speech activities. All your WERs are belong to us. Inspired by [wer are we](https://github.com/syhw/wer_are_we) who stole someone elses joke.\n\n\n## 1. FREE\n\n**Source**|**Name \u0026 Direct Link**|**Type**|**Size(Hours)**\n:-----:|:-----:|:-----:|:-----:\n[OpenSLR](http://www.openslr.org/12)|LibriSpeech - Train:[100](http://www.openslr.org/resources/12/train-clean-100.tar.gz) [360](http://www.openslr.org/resources/12/train-clean-360.tar.gz) [500](http://www.openslr.org/resources/12/train-other-500.tar.gz)\u003cbr/\u003eTest:[Clean](http://www.openslr.org/resources/12/test-clean.tar.gz) [Other](http://www.openslr.org/resources/12/test-other.tar.gz) Dev:[Clean](http://www.openslr.org/resources/12/dev-clean.tar.gz) [Other](http://www.openslr.org/resources/12/dev-other.tar.gz)|Read|960\n[OpenSLR](http://www.openslr.org/19)|[TED-LIUM Release 2](http://www.openslr.org/resources/19/TEDLIUM_release2.tar.gz)|Read|118\n[OpenSLR](https://www.openslr.org/51/)|[TED-LIUM Release 3](http://www.openslr.org/resources/51/TEDLIUM_release-3.tgz)|Read|452\n[Voxforge](http://www.voxforge.org/home/downloads)|[Voxforge English](https://common-voice-data-download.s3.amazonaws.com/voxforge_corpus_v1.0.0.tar.gz)|Read|130\n[Mozilla](https://voice.mozilla.org)|[Common Voice v1](https://common-voice-data-download.s3.amazonaws.com/cv_corpus_v1.tar.gz)|Read|500 \n[Mozilla](https://voice.mozilla.org)|[Common Voice en_1087h_2019-06-12](https://voice-prod-bundler-ee1969a6ce8178826482b88e843c335139bd3fb4.s3.amazonaws.com/cv-corpus-3/en.tar.gz)|Read|1,087 \n[Tatoeba](http://tatoeba.org)|[Tatoeba Audio Eng](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip)|Read|~200\n[Valentini](https://datashare.is.ed.ac.uk/handle/10283/2791)|Noisy Speech Database [All Files](http://datashare.is.ed.ac.uk/download/DS_10283_2791.zip), [DOI](https://doi.org/10.7488/ds/2117) |Read|TBC\n[VOiCES](https://iqtlabs.github.io/voices/)|Complex Environmental Settings [All Files](https://raw.githubusercontent.com/robmsmt/ASR-Audio-Data-Links/master/VOiCES_download.sh) |Read \u003cbr /\u003e LibriSpeech|15\n[ai4bharat](https://ai4bharat.org)|[NPTEL2020](https://github.com/AI4Bharat/NPTEL2020-Indian-English-Speech-Dataset) \u003cbr /\u003een-IN [Torrent](https://academictorrents.com/download/cc9dc56afd3055c7e0f021ec4f1824021558926c.torrent)|Lectures|15,700\n[Opencollective](https://opencollective.com/open_stt)|[open_stt](https://github.com/snakers4/open_stt/) \u003cbr /\u003eRussian [Torrent](https://academictorrents.com/download/95b4cab0f99850e119114c8b6df00193ab5fa34f.torrent)|Various Read/Presented|20,108\n[Speechcolab](https://arxiv.org/abs/2106.06909)|[GigaSpeech](https://github.com/SpeechColab/GigaSpeech) \u003cbr /\u003e [Link](https://github.com/SpeechColab/GigaSpeech#download)|Various Read/Presented|33,000 Unlabeled\u003cbr /\u003e10,000 Labeled \n\n\n## 2. PAID\n\n**Source**|**Name**|**Type**|**Size(Hours)**|**Code**\n:-----:|:-----:|:-----:|:-----:|:-----:\n[LDC](https://www.ldc.upenn.edu)|Fisher|Conversational|2000|Speech [LDC2004S13](https://catalog.ldc.upenn.edu/LDC2004S13) [LDC2005S13](https://catalog.ldc.upenn.edu/LDC2005S13)\u003cbr/\u003eTranscripts [LDC2004T19](https://catalog.ldc.upenn.edu/LDC2004T19) [LDC2005T19](https://catalog.ldc.upenn.edu/LDC2005T19) \n[LDC](https://www.ldc.upenn.edu)|Switchboard Hub 500|Conversational|240|[LDC2002S09](https://catalog.ldc.upenn.edu/LDC2002S09)\n[LDC](https://www.ldc.upenn.edu)|Switchboard Release 2|Conversational|300|[LDC97S62](https://catalog.ldc.upenn.edu/LDC97S62)\n[LDC](https://www.ldc.upenn.edu)|TIMIT|Read|5|[LDC93S1](https://catalog.ldc.upenn.edu/LDC93S1)\n[LDC](https://www.ldc.upenn.edu)|Wall Street Journal (WSJ)|Read|80|[LDC93S6A](https://catalog.ldc.upenn.edu/LDC93S6A) or [LDC93S6B](https://catalog.ldc.upenn.edu/LDC93S6B)\n\n\n# TTS\n\n## 1. FREE\n\n**Source**|**Name \u0026 Direct Link**|**Type**|**Size(Hours)**\n:-----:|:-----:|:-----:|:-----:\n[Edinburgh CSTR](https://datashare.is.ed.ac.uk/handle/10283/2651)|[CSTR VCTK Corpus](https://datashare.is.ed.ac.uk/bitstream/handle/10283/2651/VCTK-Corpus.zip?sequence=2\u0026isAllowed=y)|Read|44\n[LJ Speech](https://keithito.com/LJ-Speech-Dataset/)|[LJ Speech](http://data.keithito.com/data/speech/LJSpeech-1.1.tar.bz2)|Read|24\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobmsmt%2FASR-Audio-Data-Links","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frobmsmt%2FASR-Audio-Data-Links","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobmsmt%2FASR-Audio-Data-Links/lists"}