https://github.com/huawei-noah/speech-backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
https://github.com/huawei-noah/speech-backbones
speech-processing speech-recognition speech-synthesis
Last synced: 6 months ago
JSON representation
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
- Host: GitHub
- URL: https://github.com/huawei-noah/speech-backbones
- Owner: huawei-noah
- Created: 2021-07-19T08:47:17.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2023-09-18T08:22:42.000Z (about 2 years ago)
- Last Synced: 2025-03-28T21:06:42.037Z (7 months ago)
- Topics: speech-processing, speech-recognition, speech-synthesis
- Language: Jupyter Notebook
- Homepage:
- Size: 33.8 MB
- Stars: 576
- Watchers: 22
- Forks: 124
- Open Issues: 22
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
## Grad-TTS
Official implementation of the Grad-TTS model based on Diffusion Probabilistic Modelling. For all details check out our paper accepted to ICML 2021 via [this](https://arxiv.org/abs/2105.06337) link.
**Authors**: Vadim Popov\*, Ivan Vovk\*, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov.
\*Equal contribution.
## SPIRAL
Official implementation of SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training. For all details check out our paper accepted to ICLR 2022 via [this](https://arxiv.org/abs/2201.10207) link.**Authors**: Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu.
## DiffVCOfficial implementation of the paper "Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme" (ICLR 2022, Oral). [Link](https://arxiv.org/abs/2109.13821).
**Authors**: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei.