{"id":31904455,"url":"https://github.com/deezer/libriquote","last_synced_at":"2026-02-18T13:01:04.296Z","repository":{"id":313357289,"uuid":"1050324527","full_name":"deezer/libriquote","owner":"deezer","description":"Utilities and evaluation code for LibriQuote, a speech dataset of expressive utterances from fictional characters","archived":false,"fork":false,"pushed_at":"2025-09-07T09:25:14.000Z","size":5149,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-10-21T17:43:57.109Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/deezer.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-09-04T09:15:48.000Z","updated_at":"2025-09-07T09:25:17.000Z","dependencies_parsed_at":"2025-09-05T15:29:17.661Z","dependency_job_id":"128f16da-1cda-487b-9951-f28c55569375","html_url":"https://github.com/deezer/libriquote","commit_stats":null,"previous_names":["deezer/libriquote"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/deezer/libriquote","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deezer%2Flibriquote","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deezer%2Flibriquote/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deezer%2Flibriquote/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deezer%2Flibriquote/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/deezer","download_url":"https://codeload.github.com/deezer/libriquote/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deezer%2Flibriquote/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29580625,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-18T08:38:15.585Z","status":"ssl_error","status_checked_at":"2026-02-18T08:38:14.917Z","response_time":162,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-10-13T13:49:15.181Z","updated_at":"2026-02-18T13:01:04.270Z","avatar_url":"https://github.com/deezer.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# LibriQuote:  Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis\nThis repository contains helper functions to process LibriQuote data, and benchmark expressive TTS systems using LibriQuote-test.\n\nThis repository contains:\n\n- Helper python classes to process LibriQuote in the [`processing/`](processing/) folder\n- Evaluation scripts to benchmark TTS systems on LibriQuote-test in the [`evaluation/`](evaluation/) folder.\n- The LibriQuote dataset hosted on [HuggingFace](https://huggingface.co/datasets/gasmichel/LibriQuote).\n\n\u003cimg style=\"display: block; margin: 0 auto;\" src=\"assets/emotion2vec.png\" height=\"250\" width=\"1000\"\u003e\n\u003c/body\u003e\n    \u003cp style=\"text-align: justify;\" \u003e\n        \u003cb\u003eFigure 1.\u003c/b\u003e t-SNE projection of emotion vector representations computed with \u003ci\u003eemotion2vec-plus-base\u003c/i\u003e. LibriQuote-\u003ci\u003etest\u003c/i\u003e  (a) quotations and (b) reference narration (non-quotation) utterances; (c) Subsample of LibriHeavy segments (N=5734).\n    \u003c/p\u003e\n\n# Links \n\n- [Paper](https://arxiv.org/pdf/2509.04072)\n- [Dataset](https://huggingface.co/datasets/gasmichel/LibriQuote/tree/main)\n- [Audio samples from the paper](https://libriquote.github.io/)\n\n# Benchmarking Only \n\nIf you use LibriQuote-test only for benchmarking, we provide target and reference samples (in 16KHz) directly in the [HuggingFace repository](https://huggingface.co/datasets/gasmichel/LibriQuote/tree/main/test_audios). Check-out the [`evaluation/`](evaluation/) folder to find evaluation scripts.\n\n#  LibriLight Audio Files\n\nLibriQuote comes with segments derived from narration paragraphs and quotation from characters in fiction novels. It is derived from LibriVox recordings, and currently uses LibriLight audio files as backend audio files. Note that these audio files are encoded in 16KHz.\n\nPlease follow [LibriLight instructions](https://github.com/facebookresearch/libri-light/blob/main/data_preparation/README.md) to download and prepare audio files.\n\nWe provide a [bash script](librilight_matching/) that will `untar` only necessary LibriQuote files, reducing the overall processing time and total disk space required\n\n# Processing LibriQuote\n\nFind more information in the [`processing/`](processing/) folder.\n\n# Benchmarking using LibriQuote-test\n\nFind more information in the [`evaluation/`](evaluation/) folder.\n\n# Citing \n\nIf you use LibriQuote or part of this code in your publications, you can cite this work with the following BibTex entry:\n\n```bibtex\n@misc{Michel2025LibriQuote,\n    title={LibriQuote: A Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis}, \n    author={Gaspard Michel and Elena V. Epure and Christophe Cerisara},\n    year={2025},\n    eprint={2509.04072},\n    archivePrefix={arXiv},\n    primaryClass={eess.AS},\n    url={https://arxiv.org/abs/2509.04072}\n}\n```","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdeezer%2Flibriquote","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdeezer%2Flibriquote","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdeezer%2Flibriquote/lists"}