https://github.com/myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell.
https://github.com/myshell-ai/OpenVoice

text-to-speech tts voice-clone zero-shot-tts

Last synced: 12 months ago
JSON representation

Instant voice cloning by MIT and MyShell.

Host: GitHub
URL: https://github.com/myshell-ai/OpenVoice
Owner: myshell-ai
License: mit
Created: 2023-11-29T12:17:01.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-08-21T18:04:49.000Z (over 1 year ago)
Last Synced: 2024-08-21T20:24:14.301Z (over 1 year ago)
Topics: text-to-speech, tts, voice-clone, zero-shot-tts
Language: Python
Homepage: https://research.myshell.ai/open-voice
Size: 3.7 MB
Stars: 28,007
Watchers: 212
Forks: 2,740
Open Issues: 184
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome - myshell-ai/OpenVoice - Instant voice cloning by MIT and MyShell. Audio foundation model. (Python)
awesome-gpt - https://github.com/myshell-ai/OpenVoice
ai-game-devtools - OpenVoice
awesome-llm-projects - OpenVoice
awesome-ai - OpenVoice
StarryDivineSky - myshell-ai/OpenVoice
awesome-genai - OpenVoice - a versatile instant voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages. [![Forks](https://img.shields.io/github/forks/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/network/members) [![Stars](https://img.shields.io/github/stars/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/stargazers) (Tools & Frameworks / Open-source projects)
awesome-hacking-lists - myshell-ai/OpenVoice - Instant voice cloning by MIT and MyShell. Audio foundation model. (Python)
awesome-repositories - myshell-ai/OpenVoice - Instant voice cloning by MIT and MyShell. Audio foundation model. (Python)
awesome-tts-colab - myshell-ai/OpenVoice - tts](https://github.com/idiap/coqui-ai-TTS) (Use as base TTS model) (Colab Notebooks / OpenVoice V2 (Voice Conversion))
AiTreasureBox - myshell-ai/OpenVoice - 11-03_35300_2](https://img.shields.io/github/stars/myshell-ai/OpenVoice.svg)|Instant voice cloning by MyShell| (Repos)
awesome-local-ai - OpenVoice

README

          


   

   

[Paper](https://arxiv.org/abs/2312.01479) |

[Website](https://research.myshell.ai/open-voice) 
 






## Introduction

### OpenVoice V1

As we detailed in our [paper](https://arxiv.org/abs/2312.01479) and [website](https://research.myshell.ai/open-voice), the advantages of OpenVoice are three-fold:

**1. Accurate Tone Color Cloning.**

OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents.

**2. Flexible Voice Style Control.**

OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation. 

**3. Zero-shot Cross-lingual Voice Cloning.**

Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset.

### OpenVoice V2

In April 2024, we released OpenVoice V2, which includes all features in V1 and has:

**1. Better Audio Quality.**

OpenVoice V2 adopts a different training strategy that delivers better audio quality.

**2. Native Multi-lingual Support.**

English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2.

**3. Free Commercial Use.**

Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use.

[Video](https://github.com/myshell-ai/OpenVoice/assets/40556743/3cba936f-82bf-476c-9e52-09f0f417bb2f)

OpenVoice has been powering the instant voice cloning capability of [myshell.ai](https://app.myshell.ai/explore) since May 2023. Until Nov 2023, the voice cloning model has been used tens of millions of times by users worldwide, and witnessed the explosive user growth on the platform.

## Main Contributors

- [Zengyi Qin](https://www.qinzy.tech) at MIT

- [Wenliang Zhao](https://wl-zhao.github.io) at Tsinghua University

- [Xumin Yu](https://yuxumin.github.io) at Tsinghua University

- [Ethan Sun](https://twitter.com/ethan_myshell) at MyShell

## How to Use

Please see [usage](docs/USAGE.md) for detailed instructions.

## Common Issues

Please see [QA](docs/QA.md) for common questions and answers. We will regularly update the question and answer list.

## Citation

```

@article{qin2023openvoice,

  title={OpenVoice: Versatile Instant Voice Cloning},

  author={Qin, Zengyi and Zhao, Wenliang and Yu, Xumin and Sun, Xin},

  journal={arXiv preprint arXiv:2312.01479},

  year={2023}

}

```

## License

OpenVoice V1 and V2 are MIT Licensed. Free for both commercial and research use.

## Acknowledgements

This implementation is based on several excellent projects, [TTS](https://github.com/coqui-ai/TTS), [VITS](https://github.com/jaywalnut310/vits), and [VITS2](https://github.com/daniilrobnikov/vits2). Thanks for their awesome work!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/myshell-ai/OpenVoice

Awesome Lists containing this project

README