{"id":17865755,"url":"https://github.com/r9y9/jsut-lab","last_synced_at":"2026-01-11T04:01:41.769Z","repository":{"id":52859595,"uuid":"211091946","full_name":"r9y9/jsut-lab","owner":"r9y9","description":"HTS-style full-context labels for JSUT v1.1","archived":false,"fork":false,"pushed_at":"2021-04-16T05:59:59.000Z","size":22571,"stargazers_count":46,"open_issues_count":1,"forks_count":2,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-04-02T22:11:23.658Z","etag":null,"topics":["dataset","hts","jsut","speech-synthesis","text-to-speech","tts","voice-conversion"],"latest_commit_sha":null,"homepage":"https://sites.google.com/site/shinnosuketakamichi/publication/jsut","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/r9y9.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-09-26T13:09:46.000Z","updated_at":"2024-09-17T21:50:14.000Z","dependencies_parsed_at":"2022-08-23T09:41:36.951Z","dependency_job_id":null,"html_url":"https://github.com/r9y9/jsut-lab","commit_stats":null,"previous_names":[],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/r9y9/jsut-lab","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/r9y9%2Fjsut-lab","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/r9y9%2Fjsut-lab/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/r9y9%2Fjsut-lab/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/r9y9%2Fjsut-lab/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/r9y9","download_url":"https://codeload.github.com/r9y9/jsut-lab/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/r9y9%2Fjsut-lab/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28280298,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-11T03:48:11.750Z","status":"ssl_error","status_checked_at":"2026-01-11T03:48:02.765Z","response_time":60,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataset","hts","jsut","speech-synthesis","text-to-speech","tts","voice-conversion"],"created_at":"2024-10-28T09:24:48.628Z","updated_at":"2026-01-11T04:01:41.750Z","avatar_url":"https://github.com/r9y9.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# jsut-lab\n\n[![DOI](https://zenodo.org/badge/211091946.svg)](https://zenodo.org/badge/latestdoi/211091946)\n\nThe repository provides HTK/HTS-style alignment files with additional full-context labels for [JSUT (Japanese speech corpus of Saruwatari-lab., University of Tokyo)](https://sites.google.com/site/shinnosuketakamichi/publication/jsut) corpus (v1.1). All alignment files (.lab) were extracted by forced-alignment using [Julius](https://github.com/julius-speech/julius) and full-contexts are generated by [OpenJTalk](http://open-jtalk.sp.nitech.ac.jp/).\n\nThe label files are expected to be used for speech reseach; e.g., text-to-speech and voice conversion.\n\nDirectory structure is exactly same as the JSUT. You can put the label files to the JSUT data directory if you want:\n\n```\ntree ~/data/jsut_ver1.1/ -d -L 2\n/home/ryuichi/data/jsut_ver1.1/\n├── basic5000\n│   ├── lab\n│   └── wav\n├── countersuffix26\n│   ├── lab\n│   └── wav\n├── loanword128\n│   ├── lab\n│   └── wav\n├── onomatopee300\n│   ├── lab\n│   └── wav\n├── precedent130\n│   ├── lab\n│   └── wav\n├── repeat500\n│   ├── lab\n│   └── wav\n├── travel1000\n│   ├── lab\n│   └── wav\n├── utparaphrase512\n│   ├── lab\n│   └── wav\n└── voiceactress100\n    ├── lab\n    └── wav\n```\n\n## Label format\n\nFields: `\u003cbegin_time\u003e \u003cend_time\u003e \u003cfull-context-label\u003e`. Time are in 100ns units as same as HTK labels.\n\n```\n$ cat basic5000/lab/BASIC5000_0773.lab | head\n \n0 2525000 xx^xx-sil+s=a/A:xx+xx+xx/B:xx-xx_xx/C:xx_xx+xx/D:18+xx_xx/E:xx_xx!xx_xx-xx/F:xx_xx#xx_xx@xx_xx|xx_xx/G:6_3%0_xx_xx/H:xx_xx/I:xx-xx@xx+xx\u0026xx-xx|xx+xx/J:1_6/K:3+6-32\n2525000 3825000 xx^sil-s+a=N/A:-2+1+6/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n3825000 4825000 sil^s-a+N=g/A:-2+1+6/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n4825000 5825000 s^a-N+g=i/A:-1+2+5/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n5825000 6125000 a^N-g+i=i/A:0+3+4/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n6125000 7524999 N^g-i+i=N/A:0+3+4/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n7524999 8125000 g^i-i+N=w/A:1+4+3/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n8125000 8425000 i^i-N+w=a/A:2+5+2/B:xx-xx_xx/C:18_xx+xx/D:24+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n8425000 10125000 i^N-w+a=pau/A:3+6+1/B:18-xx_xx/C:24_xx+xx/D:07+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n10125000 11325000 N^w-a+pau=d/A:3+6+1/B:18-xx_xx/C:24_xx+xx/D:07+xx_xx/E:xx_xx!xx_xx-xx/F:6_3#0_xx@1_1|1_6/G:3_1%0_xx_0/H:xx_xx/I:1-6@1+3\u00261-6|1+32/J:2_10/K:3+6-32\n```\n\nFor details, please refer to HTS documents: http://hts.sp.nitech.ac.jp\n\n## What can I do with this?\n\nIf you want to make traditional DNN-based TTS systems, please check out the tutorials at https://r9y9.github.io/nnmnkwii/latest/. You can use alignment and full-context labels to generate linguistic features.\n\nIf you are intersted in end-to-end approaches, please have a look at https://github.com/espnet/espnet. The labels are used at the preprocessing stage for the JSUT recipe (see also https://r9y9.github.io/blog/2017/11/12/jsut_ver1/ to know why we need alignments for end-to-end TTS).\n\nHappy speech hacking! \n\n## Source code to generate labels\n\nhttps://github.com/r9y9/segmentation-kit/tree/jsut3\n\n## Notice\n\n- Alignments are likely to have mistakes because they were automatically generated by Julius. Note that they are not hand-annotated labels.\n\n## References\n\n- [JSUT (Japanese speech corpus of Saruwatari-lab., University of Tokyo)](https://sites.google.com/site/shinnosuketakamichi/publication/jsut)\n- [HTS](http://hts.sp.nitech.ac.jp)\n- [Julius](https://github.com/julius-speech/julius)\n- [OpenJTalk](http://open-jtalk.sp.nitech.ac.jp/)\n- [日本語 End-to-end 音声合成に使えるコーパス JSUT の前処理 [arXiv:1711.00354]](https://r9y9.github.io/blog/2017/11/12/jsut_ver1/)\n- [pyopenjtalk](https://github.com/r9y9/pyopenjtalk)\n- [nnmnkwii](https://github.com/r9y9/nnmnkwii)\n- [sarulab-speech/jsut-label](https://github.com/sarulab-speech/jsut-label) Hand-annotated phonetic and prosodic information from Saruwatari-lab.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fr9y9%2Fjsut-lab","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fr9y9%2Fjsut-lab","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fr9y9%2Fjsut-lab/lists"}