{"id":21299839,"url":"https://github.com/ddlbojack/speech-resources","last_synced_at":"2026-01-28T05:18:28.592Z","repository":{"id":37394027,"uuid":"424462336","full_name":"ddlBoJack/Speech-Resources","owner":"ddlBoJack","description":"语音方向实验室/公司/资源/实习等，欢迎推荐或自荐","archived":false,"fork":false,"pushed_at":"2024-11-13T20:26:11.000Z","size":5702,"stargazers_count":550,"open_issues_count":2,"forks_count":68,"subscribers_count":20,"default_branch":"main","last_synced_at":"2025-04-01T21:45:06.367Z","etag":null,"topics":["speech","speech-processing"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ddlBoJack.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-11-04T03:38:45.000Z","updated_at":"2025-03-30T09:15:21.000Z","dependencies_parsed_at":"2023-09-27T10:58:13.561Z","dependency_job_id":"2c79f44e-df6e-4da0-9278-f1c99e21f1a7","html_url":"https://github.com/ddlBoJack/Speech-Resources","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ddlBoJack/Speech-Resources","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ddlBoJack%2FSpeech-Resources","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ddlBoJack%2FSpeech-Resources/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ddlBoJack%2FSpeech-Resources/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ddlBoJack%2FSpeech-Resources/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ddlBoJack","download_url":"https://codeload.github.com/ddlBoJack/Speech-Resources/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ddlBoJack%2FSpeech-Resources/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28840088,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-28T02:10:51.810Z","status":"ssl_error","status_checked_at":"2026-01-28T02:10:50.806Z","response_time":57,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["speech","speech-processing"],"created_at":"2024-11-21T15:06:27.885Z","updated_at":"2026-01-28T05:18:28.578Z","avatar_url":"https://github.com/ddlBoJack.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003c!-- START doctoc generated TOC please keep comment here to allow auto update --\u003e\n\u003c!-- DON'T EDIT THIS SECTION, INSTEAD RE-RUN doctoc TO UPDATE --\u003e\n**Table of Contents**  *generated with [DocToc](https://github.com/thlorenz/doctoc)*\n\n- [Speech-Resource](#speech-resource)\n  - [国内高校](#%E5%9B%BD%E5%86%85%E9%AB%98%E6%A0%A1)\n    - [清华大学](#%E6%B8%85%E5%8D%8E%E5%A4%A7%E5%AD%A6)\n    - [北京大学](#%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6)\n    - [上海交通大学](#%E4%B8%8A%E6%B5%B7%E4%BA%A4%E9%80%9A%E5%A4%A7%E5%AD%A6)\n    - [中国科学院](#%E4%B8%AD%E5%9B%BD%E7%A7%91%E5%AD%A6%E9%99%A2)\n    - [中国科学技术大学](#%E4%B8%AD%E5%9B%BD%E7%A7%91%E5%AD%A6%E6%8A%80%E6%9C%AF%E5%A4%A7%E5%AD%A6)\n    - [西北工业大学](#%E8%A5%BF%E5%8C%97%E5%B7%A5%E4%B8%9A%E5%A4%A7%E5%AD%A6)\n    - [天津大学](#%E5%A4%A9%E6%B4%A5%E5%A4%A7%E5%AD%A6)\n    - [厦门大学](#%E5%8E%A6%E9%97%A8%E5%A4%A7%E5%AD%A6)\n    - [昆山杜克大学](#%E6%98%86%E5%B1%B1%E6%9D%9C%E5%85%8B%E5%A4%A7%E5%AD%A6)\n    - [浙江大学](#%E6%B5%99%E6%B1%9F%E5%A4%A7%E5%AD%A6)\n    - [哈尔滨工业大学](#%E5%93%88%E5%B0%94%E6%BB%A8%E5%B7%A5%E4%B8%9A%E5%A4%A7%E5%AD%A6)\n    - [香港中文大学](#%E9%A6%99%E6%B8%AF%E4%B8%AD%E6%96%87%E5%A4%A7%E5%AD%A6)\n    - [香港科技大学](#%E9%A6%99%E6%B8%AF%E7%A7%91%E6%8A%80%E5%A4%A7%E5%AD%A6)\n    - [香港理工大学](#%E9%A6%99%E6%B8%AF%E7%90%86%E5%B7%A5%E5%A4%A7%E5%AD%A6)\n    - [台湾大学](#%E5%8F%B0%E6%B9%BE%E5%A4%A7%E5%AD%A6)\n  - [海外高校](#%E6%B5%B7%E5%A4%96%E9%AB%98%E6%A0%A1)\n    - [剑桥大学](#%E5%89%91%E6%A1%A5%E5%A4%A7%E5%AD%A6)\n    - [牛津大学](#%E7%89%9B%E6%B4%A5%E5%A4%A7%E5%AD%A6)\n    - [爱丁堡大学](#%E7%88%B1%E4%B8%81%E5%A0%A1%E5%A4%A7%E5%AD%A6)\n    - [谢菲尔德大学](#%E8%B0%A2%E8%8F%B2%E5%B0%94%E5%BE%B7%E5%A4%A7%E5%AD%A6)\n    - [蒙特利尔大学](#%E8%92%99%E7%89%B9%E5%88%A9%E5%B0%94%E5%A4%A7%E5%AD%A6)\n    - [麻省理工大学](#%E9%BA%BB%E7%9C%81%E7%90%86%E5%B7%A5%E5%A4%A7%E5%AD%A6)\n    - [卡耐基梅隆大学](#%E5%8D%A1%E8%80%90%E5%9F%BA%E6%A2%85%E9%9A%86%E5%A4%A7%E5%AD%A6)\n    - [约翰霍普金斯大学](#%E7%BA%A6%E7%BF%B0%E9%9C%8D%E6%99%AE%E9%87%91%E6%96%AF%E5%A4%A7%E5%AD%A6)\n    - [南加州大学](#%E5%8D%97%E5%8A%A0%E5%B7%9E%E5%A4%A7%E5%AD%A6)\n    - [德克萨斯州大学达拉斯分校](#%E5%BE%B7%E5%85%8B%E8%90%A8%E6%96%AF%E5%B7%9E%E5%A4%A7%E5%AD%A6%E8%BE%BE%E6%8B%89%E6%96%AF%E5%88%86%E6%A0%A1)\n    - [罗切斯特大学](#%E7%BD%97%E5%88%87%E6%96%AF%E7%89%B9%E5%A4%A7%E5%AD%A6)\n    - [布尔诺理工大学](#%E5%B8%83%E5%B0%94%E8%AF%BA%E7%90%86%E5%B7%A5%E5%A4%A7%E5%AD%A6)\n    - [俄亥俄州立大学](#%E4%BF%84%E4%BA%A5%E4%BF%84%E5%B7%9E%E7%AB%8B%E5%A4%A7%E5%AD%A6)\n    - [新加坡国立大学](#%E6%96%B0%E5%8A%A0%E5%9D%A1%E5%9B%BD%E7%AB%8B%E5%A4%A7%E5%AD%A6)\n    - [南洋理工大学](#%E5%8D%97%E6%B4%8B%E7%90%86%E5%B7%A5%E5%A4%A7%E5%AD%A6)\n    - [新加坡科技设计大学](#%E6%96%B0%E5%8A%A0%E5%9D%A1%E7%A7%91%E6%8A%80%E8%AE%BE%E8%AE%A1%E5%A4%A7%E5%AD%A6)\n    - [国立情报学研究所（Tokyo）](#%E5%9B%BD%E7%AB%8B%E6%83%85%E6%8A%A5%E5%AD%A6%E7%A0%94%E7%A9%B6%E6%89%80tokyo)\n  - [国内企业](#%E5%9B%BD%E5%86%85%E4%BC%81%E4%B8%9A)\n  - [期刊\u0026会议](#%E6%9C%9F%E5%88%8A%E4%BC%9A%E8%AE%AE)\n  - [竞赛](#%E7%AB%9E%E8%B5%9B)\n  - [公众号](#%E5%85%AC%E4%BC%97%E5%8F%B7)\n  - [知乎专栏](#%E7%9F%A5%E4%B9%8E%E4%B8%93%E6%A0%8F)\n  - [常用资源](#%E5%B8%B8%E7%94%A8%E8%B5%84%E6%BA%90)\n\n\u003c!-- END doctoc generated TOC please keep comment here to allow auto update --\u003e\n\n# Speech-Resource\n\n\u003e 语音方向实验室/公司/资源/实习等，欢迎推荐或自荐（排名不分先后）\n\n\u003cimg src=\"README/Wechat.jpg\" width=\"400px\"/\u003e\n\n## 国内高校\n\n### 清华大学\n\n电子工程系\n\n- 吴及：电子工程系副系主任，研究方向侧重于语音语言智能与医学结合\n- [张超](http://mi.eng.cam.ac.uk/~cz277)：加入清华前为谷歌语音组Senior Research Scientist\n\n[电子工程系语音与音频技术实验室(SATLab)](http://web.ee.tsinghua.edu.cn/satlab)\n\n- 刘加：原实验室主任\n- [张卫强](http://web.ee.tsinghua.edu.cn/wqzhang)：实验室主任，语音识别、音频识别、音乐与声学信号处理\n\n电子工程系语音处理与机器智能实验室(SPMI lab)\n\n- [欧志坚](http://oa.ee.tsinghua.edu.cn/~ouzhijian/index.htm)\n\n[清华大学信息技术研究院语音和语言技术研究中心(CSLT)](http://cslt.riit.tsinghua.edu.cn/index.php)\n\n该实验室以声纹识别为特色，对应北京得意音通公司。\n\n- [郑方](http://cslt.riit.tsinghua.edu.cn/~fzheng/index.htm)\n- [周强](http://cslt.riit.tsinghua.edu.cn/~qzhou/eng/index.htm)\n- [王东](http://wangd.cslt.org/)\n\n计算机系\n\n- [贾珈](http://hcsi.cs.tsinghua.edu.cn/jiajia)：人机语音交互，偏向多媒体方向\n\n[清华大学人机语音交互实验室(THUHCSI)](https://thuhcsi.github.io/)\n\n- [吴志勇](https://www.sigs.tsinghua.edu.cn/zywu/main.htm)\n\n\n\n### 北京大学\n\n[计算机科学技术研究所数字音频实验室](https://www.icst.pku.edu.cn/audioLab/index.htm)\n\n该实验室以多媒体音视频内容的检索与挖掘为主，很多内容涉及音频方向。\n\n- 陈晓鸥\n- 杨德顺\n\n深圳研究生院现代信号与数据处理实验室(ADSPLAB)\n\n- 邹月娴\n\n\n\n### 上海交通大学\n\n[计算机系跨媒体语言智能实验室(现X-Lance，前SpeechLab)](https://x-lance.sjtu.edu.cn/)\n\n对应思必驰公司。\n\n- 俞凯：实验室主任，思必驰首席科学家，语音识别与合成，语音软硬件协同\n- 钱彦旻：实验室副主任，鲁棒性、多语言、低资源语音识别，Kaldi唯一的亚洲作者\n- 吴梦玥：语音感知与生成、多模态语音\n- 陈谐：端到端语音识别，加入交大前为微软语音组Principal Researcher\n\n电子系未来媒体协同创新中心\n\n- 王钰\n\n\n\n### 中国科学院\n\n[自动化所模式识别国家重点实验室](http://www.ia.cas.cn/)\n\n- 徐波\n- 陶建华\n- 刘文举\n- 刘斌\n\n声学所\n\n- 颜永红\n\n\n\n### 中国科学技术大学\n\n[语音及语言信息处理国家工程实验室](http://nelslip.ustc.edu.cn/)\n\n对应科大讯飞，国内领先水平。\n\n- 刘庆峰\n\n- 胡郁\n\n- 戴礼荣\n- 王仁华\n\n- 陈恩红\n- [凌震华](http://staff.ustc.edu.cn/~zhling/)\n- 杜俊\n\n\n\n### 西北工业大学\n\n[音频语音与语言处理研究组(ASLP)](http://www.npu-aslp.org/)\n\n- [谢磊](http://lxie.npu-aslp.org/)\n\n[智能声学与临境通信研究中心(CIAIC)](https://www.ciaic.org/)\n\n- 陈景东：前贝尔实验室资深研究员，信号和信息处理做的很好\n\n\n\n### 天津大学\n\n智能与计算学部\n\n- 党建武\n- 王龙标\n\n\n\n### 厦门大学\n\n智能科学与技术系\n\n- 洪青阳：天聪智能创始人，主要研究语音识别、声纹识别\n\n\n\n### 昆山杜克大学\n\n大数据研究中心(SMIIPLab)\n\n- 李明\n\n\n\n### 浙江大学\n\n计算机科学与技术学院\n\n- 赵洲\n\n### 哈尔滨工业大学\n\n计算机科学与技术学院听觉智能研究中心\n\n- [韩纪庆](http://homepage.hit.edu.cn/hanjiqing)\n\n\n\n### 香港中文大学\n\n[Human-Computer Communications Laboratory (HCCL)](https://www1.se.cuhk.edu.hk/~hccl/publications/)\n\n- [蒙美玲](https://www.se.cuhk.edu.hk/people/academic-staff/prof-meng-mei-ling-helen/)\n- [刘循英](https://www1.se.cuhk.edu.hk/~xyliu/)\n- [吴锡欣](https://www1.se.cuhk.edu.hk/~wuxx/)\n\n香港中文大学电子工程系\n\n- [李丹](https://www.ee.cuhk.edu.hk/~tanlee/)\n- [孔秋强](https://qiuqiangkong.github.io/)\n\n香港中文大学（深圳）数据科学学院\n\n- [李海洲](https://colips.org/~eleliha/)\n- 武执正\n\n\n\n### 香港科技大学\n\n计算机科学与工程系\n\n- [Brain Mak](https://www.cse.ust.hk/faculty/mak/)\n- [雪巍](https://facultyprofiles.hkust.edu.hk/profiles.php?profile=wei-xue-weixue)\n\n\n\n### 香港理工大学\n\n电子信息工程系\n\n- [Man-Wai Mak](http://www.eie.polyu.edu.hk/~mwmak/)\n\n\n\n### 台湾大学\n\nSpeech Processing and Machine Learning Laboratory\n\n- [李琳山](https://speech.ee.ntu.edu.tw/previous_version/lslNew.htm)\n- [李宏毅](https://speech.ee.ntu.edu.tw/~hylee/index.php)\n\n\n\n## 海外高校\n\n### 剑桥大学\n\nMachine Intelligence Laboratory - Speech Research Group\n\n- Steve Young: The HTK book 一作\n\n- Phil Woodland\n- Mark Gales\n\n\n\n### 牛津大学\n\nVisual Geometry Group\n\n- Andrew Zisserman\n\n\n\n### 爱丁堡大学\n\nThe Centre for Speech Technology Research\n\n- [Simon King](https://homepages.inf.ed.ac.uk/simonk/)\n- Steve Renals\n- Peter Bell\n- Hao Tang\n\n\n\n### 谢菲尔德大学\n\nSpeech and Hearing Group\n\n- [Thomas Hain](https://staffwww.dcs.shef.ac.uk/people/T.Hain/)\n- [Jon Barker](http://staffwww.dcs.shef.ac.uk/people/J.Barker/)\n- [Heidi Christensen](https://heidi-christensen.github.io/website//)\n- [Roger K. Moore](http://staffwww.dcs.shef.ac.uk/people/R.K.Moore/)\n\n\n\n### 蒙特利尔大学\n\nMila - Quebec AI Institute\n\n- [Yoshua Bengio](https://yoshuabengio.org/)\n\n\n\n\n### 麻省理工大学\n\nMIT CSAIL\n\n- James Glass\n- [Antonio Torralba](http://web.mit.edu/torralba/www/)\n\n\n\n### 卡耐基梅隆大学\n\n- [Shinji Watanabe](https://sites.google.com/view/shinjiwatanabe)\n\n\n\n### 约翰霍普金斯大学\n\nCenter for Language and Speech Processing\n\n- Sanjeev Khudanpur\n\n\n\n### 南加州大学\n\n- [Shrikanth (Shri) Narayanan](https://scholar.google.com/citations?hl=zh-CN\u0026user=8EDHmYkAAAAJ\u0026view_op=list_works\u0026sortby=pubdate)\n\n\n\n### 德克萨斯州大学达拉斯分校\n\n- [John Hansen](https://scholar.google.com/citations?user=hfADwdIAAAAJ\u0026hl=zh-CN)\n\n\n\n### 罗切斯特大学\n\n- [Zhiyao Duan](https://scholar.google.com/citations?hl=en\u0026user=pJmAoJ4AAAAJ\u0026view_op=list_works\u0026sortby=pubdate)\n\n\n\n### 布尔诺理工大学\n\nFaculty of Information Technology\n\n- Lukas Burget\n- Jan Cernocky\n\n\n\n### 俄亥俄州立大学\n\n- [DeLiang Wang](https://scholar.google.com/citations?user=yO59sggAAAAJ\u0026hl=zh-CN)\n\n\n\n### 新加坡国立大学\n\nHuman Language Technology Laboratory\n\n- [Haizhou Li](https://colips.org/~eleliha/)\n\n\n\n### 南洋理工大学\n\n- [Eng-Siong Chng](https://personal.ntu.edu.sg/aseschng/intro1.html)\n\n\n\n### 新加坡科技设计大学\n\n- [Berrak Sisman](https://istd.sutd.edu.sg/people/faculty/berrak-sisman)\n\n\n\n### 国立情报学研究所（Tokyo）\n\n- [Junichi Yamagishi](https://scholar.google.com/citations?user=nRrdjtwAAAAJ\u0026hl=zh-CN)\n\n\n\n\n## 国内企业\n\n- MSRA-NLC组\n- MSRA-ML组\n- 腾讯AILAB语音技术中心\n- 腾讯天籁实验室\n- 阿里达摩院智能语音实验室\n- 阿里天猫精灵\n- 字节跳动SAMI组\n- 科大讯飞\n- 搜狗\n- 百度小度\n- 小米小爱\n- 小米k2\n- 思必驰\n- 云知声\n- 出门问问WeNet\n- 标贝科技\n- 星辰征途\n\n\n\n## 期刊\u0026会议\n\n- TPAMI（IEEE Trans on Pattern Analysis and Machine Intelligence）\n- TASLP（IEEE Transactions on Audio, Speech, and Language Processing）\n- TSLP（ACM Transactions on Speech and Language Processing）\n- ICASSP（IEEE International Conference on Acoustics, Speech and Signal Processing）\n- INTERSPEECH（Conference of the International Speech Communication Association）\n- ASRU（IEEE Automatic Speech Recognition and Understanding Workshop）\n- SLT（IEEE Spoken Language Technology Workshop）\n- SPL（IEEE Signal Processing Letters）\n- ISCSLP（International Symposium on Chinese Spoken Language Processing）\n- JSLHR（Journal of Speech, Language, and Hearing Research）\n- Computer Speech and Language\n- Speaker Odyssey\n- JASA（Journal of the Acoustical Society of America）\n- Signal Processing\n- Speech Communication\n\n\n\n## 竞赛\n\n- CHiME\n- VCC\n- DCASE\n- NIST SRE\n- Blizzard Challenge\n- OLR东方语种识别\n- VoxSRC\n\n\n\n## 公众号\n\n- 语音杂谈\n- 谈谈语音技术\n- WeNet步行街\n- CCF语音对话与听觉专委会\n\n- 语音之家\n- 智能语音青年\n- 低调奋进\n- 新一代Kaldi\n\n\n\n\n## 知乎专栏\n\n[谈谈语音技术](https://www.zhihu.com/column/c_1409104824050446336)\n\n[自监督语音识别](https://www.zhihu.com/column/c_1446609615102832640)\n\n[Kaldi源码解析](https://www.zhihu.com/column/c_1313042386550267904)\n\n[espnet--一个端到端语音识别工具箱](https://www.zhihu.com/column/espnet)\n\n[新一代Kaldi](https://www.zhihu.com/people/yaozengwei/posts)\n\n\n\n## 常用资源\n\n[语音识别数据集汇总](https://github.com/double22a/speech_dataset)\n\n[语音识别 benchmark](https://github.com/SpeechColab/Leaderboard)\n\n[语音预训练 paper list](https://github.com/ddlBoJack/Awesome-Speech-Pretraining)\n\n[语音合成 paper list](https://github.com/wenet-e2e/speech-synthesis-paper)\n\n[语音增强 paper list](https://github.com/Wenzhe-Liu/awesome-speech-enhancement)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fddlbojack%2Fspeech-resources","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fddlbojack%2Fspeech-resources","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fddlbojack%2Fspeech-resources/lists"}