Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/CODEJIN/VITS_Diffusion
https://github.com/CODEJIN/VITS_Diffusion
Last synced: 25 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/CODEJIN/VITS_Diffusion
- Owner: CODEJIN
- License: mit
- Created: 2022-09-01T11:41:31.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-09-22T15:15:42.000Z (about 2 years ago)
- Last Synced: 2024-08-03T23:03:21.520Z (4 months ago)
- Language: Python
- Size: 15.2 MB
- Stars: 27
- Watchers: 6
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# VITS Diffusion
* [Kim, J., Kong, J., & Son, J. (2021, July). Conditional variational autoencoder with adversarial learning for end-to-end text-to-speech. In International Conference on Machine Learning (pp. 5530-5540). PMLR.](https://arxiv.org/abs/2106.06103)
* [Kong, Z., Ping, W., Huang, J., Zhao, K., & Catanzaro, B. (2020). Diffwave: A versatile diffusion model for audio synthesis. arXiv preprint arXiv:2009.09761.](https://arxiv.org/abs/2009.09761)
* [Popov, V., Vovk, I., Gogoryan, V., Sadekova, T., & Kudinov, M. (2021, July). Grad-tts: A diffusion probabilistic model for text-to-speech. In International Conference on Machine Learning (pp. 8599-8608). PMLR.](https://arxiv.org/abs/2105.06337)* Now working
* This is a test repository to check whether a implementation is possible.