https://github.com/KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
https://github.com/KevinWang676/Bark-Voice-Cloning
Last synced: 17 days ago
JSON representation
Bark Voice Cloning and Voice Cloning for Chinese Speech
- Host: GitHub
- URL: https://github.com/KevinWang676/Bark-Voice-Cloning
- Owner: KevinWang676
- License: mit
- Created: 2023-06-11T22:16:56.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2025-04-06T03:27:35.000Z (29 days ago)
- Last Synced: 2025-04-10T20:55:19.954Z (24 days ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 3.01 MB
- Stars: 2,891
- Watchers: 33
- Forks: 416
- Open Issues: 73
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- StarryDivineSky - KevinWang676/Bark-Voice-Cloning - Voice-Cloning项目旨在实现中文语音克隆,基于Suno-AI/Bark模型。它允许用户使用自己的语音数据训练模型,从而生成个性化的语音。项目特色在于支持中文语音克隆,并提供了详细的教程和示例。其工作原理是利用Bark模型强大的文本到语音转换能力,结合用户提供的语音数据进行微调,使模型能够模仿目标语音的音色和风格。项目包含数据准备、模型训练和语音合成等关键步骤,用户可以通过提供的脚本和工具轻松完成语音克隆过程。该项目为中文语音合成领域的研究和应用提供了新的可能性,并允许用户创造独特的语音体验。它不仅适用于个人娱乐,也可能在语音助手、内容创作等领域发挥作用。项目还提供了预训练模型和相关资源,方便用户快速上手。 (语音合成 / 资源传输下载)
README
# Bark Voice Cloning 🐶 & Voice Cloning for Chinese Speech 🎶
### [简体中文](https://github.com/KevinWang676/Bark-Voice-Cloning/blob/main/README_zh.md)
## 1️⃣ Bark Voice Cloning> 10/19/2023: Fixed `ERROR: Exception in ASGI application` by specifying `gradio==3.33.0` and `gradio_client==0.2.7` in [requirements.txt](https://github.com/KevinWang676/Bark-Voice-Cloning/blob/main/requirements.txt).
> 11/08/2023: Integrated [KNN-VC](https://github.com/bshall/knn-vc) into [OpenAI TTS](https://platform.openai.com/docs/guides/text-to-speech) and created an easy-to-use Gradio interface. Try it [here](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/notebooks/OpenAI_TTS_KNN_VC_en.ipynb).
> 02/27/2024: We are thrilled to launch our most powerful **AI song cover generator** ever with [Shanghai Artificial Intelligence Laboratory](https://www.shlab.org.cn/)! Just need to provide the name of a song and our application running on an **A100** GPU will handle everything else. Check it out in our [**website**](https://www.talktalkai.com/) (please click "EN" in the first tab of our website to see the english version)! 💕
Based on [bark-gui](https://github.com/C0untFloyd/bark-gui) and [bark](https://github.com/suno-ai/bark). Thanks to [C0untFloyd](https://github.com/C0untFloyd).
**Quick start**: [**Colab Notebook**](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/Bark_Voice_Cloning.ipynb) ⚡
**HuggingFace Demo**: [**Bark Voice Cloning**](https://huggingface.co/spaces/kevinwang676/Bark-with-Voice-Cloning) 🤗 (Need a GPU)
**Demo Video**: [**YouTube Video**](https://www.youtube.com/watch?v=IAf695dhkUc&t=4s)
If you would like to run the code locally, remember to replace the original path `/content/Bark-Voice-Cloning/bark/assets/prompts/file.npz` with the path of `file.npz` in your own computer.
### If you like the quick start, please star this repository. ⭐⭐⭐
## Easy to use:
(1) First upload audio for voice cloning and click `Create Voice`.

(2) Choose the option called "file" in `Voice` if you'd like to use voice cloning.
(3) Click `Generate`. Done!

## 2️⃣ Voice Cloning for Chinese Speech
> 10/26/2023: Integrated labeling, training and inference into an easy-to-use user interface of SambertHifigan. Thanks to [wujohns](https://github.com/wujohns).We want to point out that [Bark](https://github.com/suno-ai/bark) is very good at generating English speech but relatively poor at generating Chinese speech. So we'd like to adopt another approach, which is called [SambertHifigan](https://www.modelscope.cn/models/speech_tts/speech_sambert-hifigan_tts_zh-cn_multisp_pretrain_16k/summary), to realizing voice cloning for Chinese speech. Please check out our [Colab Notebook](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/Voice_Cloning_for_Chinese_Speech_v2.ipynb) for the implementation.
Quick start: [Colab Notebook](https://colab.research.google.com/github/KevinWang676/Bark-Voice-Cloning/blob/main/sambert-ui/Sambert_UI.ipynb) ⚡
HuggingFace demo: [Voice Cloning for Chinese Speech](https://huggingface.co/spaces/kevinwang676/Personal-TTS) 🤗
[](https://star-history.com/#KevinWang676/Bark-Voice-Cloning&Date)