{"id":20737442,"url":"https://github.com/candlewill/speech-corpus-collection","last_synced_at":"2025-12-25T04:53:27.801Z","repository":{"id":71816648,"uuid":"91787763","full_name":"candlewill/Speech-Corpus-Collection","owner":"candlewill","description":"A Collection of Speech Corpus for ASR and TTS","archived":false,"fork":false,"pushed_at":"2017-06-19T03:39:26.000Z","size":5,"stargazers_count":112,"open_issues_count":0,"forks_count":20,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-01-18T01:25:30.493Z","etag":null,"topics":["asr","corpus","dataset","tts"],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/candlewill.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-05-19T09:07:01.000Z","updated_at":"2024-11-09T02:52:03.000Z","dependencies_parsed_at":"2023-06-11T01:00:17.401Z","dependency_job_id":null,"html_url":"https://github.com/candlewill/Speech-Corpus-Collection","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/candlewill%2FSpeech-Corpus-Collection","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/candlewill%2FSpeech-Corpus-Collection/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/candlewill%2FSpeech-Corpus-Collection/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/candlewill%2FSpeech-Corpus-Collection/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/candlewill","download_url":"https://codeload.github.com/candlewill/Speech-Corpus-Collection/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243024107,"owners_count":20223545,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["asr","corpus","dataset","tts"],"created_at":"2024-11-17T06:14:31.214Z","updated_at":"2025-12-25T04:53:27.742Z","avatar_url":"https://github.com/candlewill.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Speech-Corpus-Collection\n\nThis repo is a collection of Speech Corpus for automatic speech recognition (ASR) and text-to-speech (TTS). \n\n### ASR Corpus\n\n1. [VCTK](http://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html)\n\u003cbr\u003eAround 10.4GB. [Alternative Host](http://www.udialogue.org/download/cstr-vctk-corpus.html)\n\n2. [LibriSpeech](http://www.openslr.org/12/)\n\u003cbr\u003eLarge-scale (1000 hours) corpus of read English speech.\n\n3. [TEDLIUM release 2](http://www-lium.univ-lemans.fr/en/content/ted-lium-corpus)\n\u003cbr\u003eThe TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website. The authors have prepared and filtered these data in order to train acoustic models to participate to the International Workshop on Spoken Language Translation 2011 (the LIUM English/French SLT system reached the first rank in the SLT task). \n\n### TTS Corpus\n\n1. [CMU ARCTIC Databases](http://festvox.org/cmu_arctic/)\n\u003cbr\u003eThe databases consist of around 1150 utterances, including US English male (bdl) and female (slt) speakers, as well as other accented speakers.\n\n2. [The World English Bible](http://www.audiotreasure.com/webindex.htm)\n\u003cbr\u003eThe World English Bible is a public domain update of the American Standard Version of 1901 into modern English. Its text and audio recordings are freely avaiable [here](http://www.audiotreasure.com/webindex.htm). Unfortunately, however, each of the audio files matches a chapter, not a verse, so is too long in most cases. [Kyubyong](https://github.com/Kyubyong/tacotron) sliced them by verse manually. You can get them on his [dropbox](https://dl.dropboxusercontent.com/u/42868014/WEB.zip).\n\n3. [Nancy Corpus](http://www.cstr.ed.ac.uk/projects/blizzard/2011/lessac_blizzard2011/)\n\u003cbr\u003eThe Nancy corpus from the 2011 Blizzard Challenge. The data is freely availiable for research use on the signing of a license. \n\n### General\n\n1. [The NSynth Dataset](https://magenta.tensorflow.org/datasets/nsynth)\n\u003cbr\u003eNSynth is an audio dataset containing 305,979 musical notes, each with a unique pitch, timbre, and envelope. For 1,006 instruments from commercial sample libraries, we generated four second, monophonic 16kHz audio snippets, referred to as notes, by ranging over every pitch of a standard MIDI pian o (21-108) as well as five different velocities (25, 50, 75, 100, 127). The note was held for the first three seconds and allowed to decay for the final second.\n\n\n### Contact Me\n\n[Yunchao He](mailto:yunchaohe@gmail.com)\n\u003cbr\u003e[Weibo](http://weibo.com/heyunchao)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcandlewill%2Fspeech-corpus-collection","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcandlewill%2Fspeech-corpus-collection","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcandlewill%2Fspeech-corpus-collection/lists"}