Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/PlayVoice/lora-svc
singing voice change based on whisper, and lora for singing voice clone
lora singing-voice-conversion speech-to-sing uni-svc vits vits-svc voice-change voice-cloning voice-conversion whisper
Last synced: 10 Jul 2024
![](https://github.com/PlayVoice.png)
https://github.com/expectopatronm/Realtime-voice-cloning-as-a-microservice
SV2TTS as a Microservice (FastAPI endpoint)
Last synced: 10 Jul 2024
![](https://github.com/expectopatronm.png)
https://github.com/jackaduma/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
aigc cyclegan cyclegan-vc cyclegan-vc2 deep-learning deeplearning gan pix2pix pytorch-implementation speech-synthesis voice-cloning voice-conversion
Last synced: 10 Jul 2024
![](https://github.com/jackaduma.png)
https://github.com/FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
multilingual speech-generation voice-cloning
Last synced: 08 Jul 2024
![](https://github.com/FunAudioLLM.png)
https://github.com/IAHispano/Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
ai applio pytorch rvc speech speech-to-speech text-to-speech vc vits voice voice-clone voice-cloning voice-conversion
Last synced: 08 Jul 2024
![](https://github.com/IAHispano.png)
https://github.com/Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
code-switching multilingual speech-synthesis text-to-speech tts voice-cloning
Last synced: 07 Jul 2024
![](https://github.com/Tomiinek.png)
https://github.com/BenAAndrew/Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices
deep-learning python pytorch tacotron2 text-to-speech tts voice-cloning
Last synced: 05 Jul 2024
![](https://github.com/BenAAndrew.png)
https://github.com/coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
speech-emotion-recognition speech-processing speech-recognition speech-separation speech-synthesis speech-to-text stt text-to-speech tts voice-activity-detection voice-cloning voice-recognition
Last synced: 22 Jun 2024
![](https://github.com/coqui-ai.png)
https://github.com/gitmylo/audio-webui
A webui for different audio related Neural Networks
ai aio all-in-one artificial-intelligence audiocraft audioldm bark bark-gui generative-audio generative-music music rvc rvc-gui text-to-audio text-to-speech tts voice-cloning
Last synced: 11 Jun 2024
![](https://github.com/gitmylo.png)
https://github.com/Sharad24/Neural-Voice-Cloning-with-Few-Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
encodings mel-spectrogram speaker-embeddings speaker-encodings speech speech-processing speech-synthesis voice-cloning
Last synced: 10 Jun 2024
![](https://github.com/Sharad24.png)
https://github.com/wladradchenko/wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
controlnet deepfake deepfake-emotion deepfakes diffusion face-swap face-swapping free image-animation retouching-video segment-anything tacotron2 talking-face talking-face-generation talking-head tts vid2vid voice-cloning voice-recognition wunjo
Last synced: 18 May 2024
![](https://github.com/wladradchenko.png)
https://github.com/PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper
Last synced: 15 May 2024
![](https://github.com/PaddlePaddle.png)
https://github.com/coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis
Last synced: 08 May 2024
![](https://github.com/coqui-ai.png)
https://github.com/vlomme/Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on
deep-learning g2p pytorch russian tacotron tensorflow tts voice-cloning wavernn
Last synced: 27 Apr 2024
![](https://github.com/vlomme.png)
https://github.com/CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
deep-learning python pytorch tensorflow tts voice-cloning
Last synced: 20 Apr 2024
![](https://github.com/CorentinJ.png)
https://github.com/RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
text-to-speech tts vits voice-clone voice-cloneai voice-cloning
Last synced: 08 Apr 2024
![](https://github.com/RVC-Boss.png)
https://github.com/CMsmartvoice/One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS
one-shot style-transfer tts voice-cloning
Last synced: 01 Apr 2024
![](https://github.com/CMsmartvoice.png)
https://github.com/FlorianEagox/WeeaBlind
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a11y accessibility anime blindness diariz dubbing python tts voice-cloning
Last synced: 30 Mar 2024
![](https://github.com/FlorianEagox.png)