Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
https://github.com/myshell-ai/OpenVoice
text-to-speech tts voice-clone zero-shot-tts
Last synced: about 2 months ago
JSON representation
Instant voice cloning by MIT and MyShell.
- Host: GitHub
- URL: https://github.com/myshell-ai/OpenVoice
- Owner: myshell-ai
- License: mit
- Created: 2023-11-29T12:17:01.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-21T18:04:49.000Z (4 months ago)
- Last Synced: 2024-08-21T20:24:14.301Z (4 months ago)
- Topics: text-to-speech, tts, voice-clone, zero-shot-tts
- Language: Python
- Homepage: https://research.myshell.ai/open-voice
- Size: 3.7 MB
- Stars: 28,007
- Watchers: 212
- Forks: 2,740
- Open Issues: 184
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-gpt - https://github.com/myshell-ai/OpenVoice
- awesome-repositories - myshell-ai/OpenVoice - Instant voice cloning by MIT and MyShell. (Python)
- AiTreasureBox - myshell-ai/OpenVoice - 12-20_30141_2](https://img.shields.io/github/stars/myshell-ai/OpenVoice.svg)|Instant voice cloning by MyShell| (Repos)
- ai-game-devtools - OpenVoice
- awesome-llm-projects - OpenVoice
- StarryDivineSky - myshell-ai/OpenVoice
- awesome - myshell-ai/OpenVoice - Instant voice cloning by MIT and MyShell. (Python)
- awesome-genai - OpenVoice - a versatile instant voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages. [![Forks](https://img.shields.io/github/forks/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/network/members) [![Stars](https://img.shields.io/github/stars/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/stargazers) (Tools & Frameworks / Open-source projects)
- awesome-genai - OpenVoice - a versatile instant voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages. [![Forks](https://img.shields.io/github/forks/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/network/members) [![Stars](https://img.shields.io/github/stars/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/stargazers) (Tools & Frameworks / Open-source projects)
README
[Paper](https://arxiv.org/abs/2312.01479) |
[Website](https://research.myshell.ai/open-voice)## Introduction
### OpenVoice V1
As we detailed in our [paper](https://arxiv.org/abs/2312.01479) and [website](https://research.myshell.ai/open-voice), the advantages of OpenVoice are three-fold:
**1. Accurate Tone Color Cloning.**
OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents.**2. Flexible Voice Style Control.**
OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation.**3. Zero-shot Cross-lingual Voice Cloning.**
Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset.### OpenVoice V2
In April 2024, we released OpenVoice V2, which includes all features in V1 and has:
**1. Better Audio Quality.**
OpenVoice V2 adopts a different training strategy that delivers better audio quality.**2. Native Multi-lingual Support.**
English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2.**3. Free Commercial Use.**
Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use.[Video](https://github.com/myshell-ai/OpenVoice/assets/40556743/3cba936f-82bf-476c-9e52-09f0f417bb2f)
OpenVoice has been powering the instant voice cloning capability of [myshell.ai](https://app.myshell.ai/explore) since May 2023. Until Nov 2023, the voice cloning model has been used tens of millions of times by users worldwide, and witnessed the explosive user growth on the platform.
## Main Contributors
- [Zengyi Qin](https://www.qinzy.tech) at MIT
- [Wenliang Zhao](https://wl-zhao.github.io) at Tsinghua University
- [Xumin Yu](https://yuxumin.github.io) at Tsinghua University
- [Ethan Sun](https://twitter.com/ethan_myshell) at MyShell## How to Use
Please see [usage](docs/USAGE.md) for detailed instructions.## Common Issues
Please see [QA](docs/QA.md) for common questions and answers. We will regularly update the question and answer list.
## Join Our Community
Join our [Discord community](https://discord.gg/myshell) and select the `Developer` role upon joining to gain exclusive access to our developer-only channel! Don't miss out on valuable discussions and collaboration opportunities.
## Citation
```
@article{qin2023openvoice,
title={OpenVoice: Versatile Instant Voice Cloning},
author={Qin, Zengyi and Zhao, Wenliang and Yu, Xumin and Sun, Xin},
journal={arXiv preprint arXiv:2312.01479},
year={2023}
}
```## License
OpenVoice V1 and V2 are MIT Licensed. Free for both commercial and research use.## Acknowledgements
This implementation is based on several excellent projects, [TTS](https://github.com/coqui-ai/TTS), [VITS](https://github.com/jaywalnut310/vits), and [VITS2](https://github.com/daniilrobnikov/vits2). Thanks for their awesome work!