https://github.com/CODEJIN/VITS_Diffusion

Last synced: about 1 month ago
JSON representation

Host: GitHub
URL: https://github.com/CODEJIN/VITS_Diffusion
Owner: CODEJIN
License: mit
Created: 2022-09-01T11:41:31.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2022-09-22T15:15:42.000Z (over 2 years ago)
Last Synced: 2024-11-18T08:55:57.383Z (7 months ago)
Language: Python
Size: 15.2 MB
Stars: 27
Watchers: 6
Forks: 4
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # VITS Diffusion

* [Kim, J., Kong, J., & Son, J. (2021, July). Conditional variational autoencoder with adversarial learning for end-to-end text-to-speech. In International Conference on Machine Learning (pp. 5530-5540). PMLR.](https://arxiv.org/abs/2106.06103)

* [Kong, Z., Ping, W., Huang, J., Zhao, K., & Catanzaro, B. (2020). Diffwave: A versatile diffusion model for audio synthesis. arXiv preprint arXiv:2009.09761.](https://arxiv.org/abs/2009.09761)

* [Popov, V., Vovk, I., Gogoryan, V., Sadekova, T., & Kudinov, M. (2021, July). Grad-tts: A diffusion probabilistic model for text-to-speech. In International Conference on Machine Learning (pp. 8599-8608). PMLR.](https://arxiv.org/abs/2105.06337)

* Now working

* This is a test repository to check whether a implementation is possible.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/CODEJIN/VITS_Diffusion

Awesome Lists containing this project

README