https://github.com/KevinWang676/Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech
https://github.com/KevinWang676/Bark-Voice-Cloning

Last synced: 17 days ago
JSON representation

Bark Voice Cloning and Voice Cloning for Chinese Speech

Host: GitHub
URL: https://github.com/KevinWang676/Bark-Voice-Cloning
Owner: KevinWang676
License: mit
Created: 2023-06-11T22:16:56.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2025-04-06T03:27:35.000Z (29 days ago)
Last Synced: 2025-04-10T20:55:19.954Z (24 days ago)
Language: Jupyter Notebook
Homepage:
Size: 3.01 MB
Stars: 2,891
Watchers: 33
Forks: 416
Open Issues: 73
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

StarryDivineSky - KevinWang676/Bark-Voice-Cloning - Voice-Cloning项目旨在实现中文语音克隆，基于Suno-AI/Bark模型。它允许用户使用自己的语音数据训练模型，从而生成个性化的语音。项目特色在于支持中文语音克隆，并提供了详细的教程和示例。其工作原理是利用Bark模型强大的文本到语音转换能力，结合用户提供的语音数据进行微调，使模型能够模仿目标语音的音色和风格。项目包含数据准备、模型训练和语音合成等关键步骤，用户可以通过提供的脚本和工具轻松完成语音克隆过程。该项目为中文语音合成领域的研究和应用提供了新的可能性，并允许用户创造独特的语音体验。它不仅适用于个人娱乐，也可能在语音助手、内容创作等领域发挥作用。项目还提供了预训练模型和相关资源，方便用户快速上手。 (语音合成 / 资源传输下载)

README

        # Bark Voice Cloning 🐶 & Voice Cloning for Chinese Speech 🎶

### [简体中文](https://github.com/KevinWang676/Bark-Voice-Cloning/blob/main/README_zh.md)

## 1️⃣ Bark Voice Cloning

> 10/19/2023: Fixed `ERROR: Exception in ASGI application` by specifying `gradio==3.33.0` and `gradio_client==0.2.7` in [requirements.txt](https://github.com/KevinWang676/Bark-Voice-Cloning/blob/main/requirements.txt).

> 11/08/2023: Integrated [KNN-VC](https://github.com/bshall/knn-vc) into [OpenAI TTS](https://platform.openai.com/docs/guides/text-to-speech) and created an easy-to-use Gradio interface. Try it [here](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/notebooks/OpenAI_TTS_KNN_VC_en.ipynb).

> 02/27/2024: We are thrilled to launch our most powerful **AI song cover generator** ever with [Shanghai Artificial Intelligence Laboratory](https://www.shlab.org.cn/)! Just need to provide the name of a song and our application running on an **A100** GPU will handle everything else. Check it out in our [**website**](https://www.talktalkai.com/) (please click "EN" in the first tab of our website to see the english version)! 💕

Based on [bark-gui](https://github.com/C0untFloyd/bark-gui) and [bark](https://github.com/suno-ai/bark). Thanks to [C0untFloyd](https://github.com/C0untFloyd).

**Quick start**: [**Colab Notebook**](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/Bark_Voice_Cloning.ipynb) ⚡

**HuggingFace Demo**: [**Bark Voice Cloning**](https://huggingface.co/spaces/kevinwang676/Bark-with-Voice-Cloning) 🤗 (Need a GPU)

**Demo Video**: [**YouTube Video**](https://www.youtube.com/watch?v=IAf695dhkUc&t=4s)

If you would like to run the code locally, remember to replace the original path `/content/Bark-Voice-Cloning/bark/assets/prompts/file.npz` with the path of `file.npz` in your own computer.

### If you like the quick start, please star this repository. ⭐⭐⭐

## Easy to use: 

(1) First upload audio for voice cloning and click `Create Voice`.

![image](https://github.com/KevinWang676/Bark-Voice-Cloning/assets/126712357/65e2b695-f529-4fb5-9549-4e86e6a4d8b2)

(2) Choose the option called "file" in `Voice` if you'd like to use voice cloning.

(3) Click `Generate`. Done!

![image](https://github.com/KevinWang676/Bark-Voice-Cloning/assets/126712357/20911e37-768d-47d5-bb86-d12a3ab04c5d)

## 2️⃣ Voice Cloning for Chinese Speech

> 10/26/2023: Integrated labeling, training and inference into an easy-to-use user interface of SambertHifigan. Thanks to [wujohns](https://github.com/wujohns).

We want to point out that [Bark](https://github.com/suno-ai/bark) is very good at generating English speech but relatively poor at generating Chinese speech. So we'd like to adopt another approach, which is called [SambertHifigan](https://www.modelscope.cn/models/speech_tts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/summary), to realizing voice cloning for Chinese speech. Please check out our [Colab Notebook](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/Voice_Cloning_for_Chinese_Speech_v2.ipynb) for the implementation.

Quick start: [Colab Notebook](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/sambert-ui/Sambert_UI.ipynb) ⚡

HuggingFace demo: [Voice Cloning for Chinese Speech](https://huggingface.co/spaces/kevinwang676/Personal-TTS) 🤗

[![Star History Chart](https://api.star-history.com/svg?repos=KevinWang676/Bark-Voice-Cloning&type=Date)](https://star-history.com/#KevinWang676/Bark-Voice-Cloning&Date)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/KevinWang676/Bark-Voice-Cloning

Awesome Lists containing this project

README