Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/PlayVoice/lora-svc

singing voice change based on whisper, and lora for singing voice clone

lora singing-voice-conversion speech-to-sing uni-svc vits vits-svc voice-change voice-cloning voice-conversion whisper

Last synced: 10 Jul 2024

https://github.com/expectopatronm/Realtime-voice-cloning-as-a-microservice

SV2TTS as a Microservice (FastAPI endpoint)

fast-api voice-cloning

Last synced: 10 Jul 2024

https://github.com/FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

multilingual speech-generation voice-cloning

Last synced: 08 Jul 2024

https://github.com/IAHispano/Applio

VITS-based Voice Conversion focused on simplicity, quality and performance.

ai applio pytorch rvc speech speech-to-speech text-to-speech vc vits voice voice-clone voice-cloning voice-conversion

Last synced: 08 Jul 2024

https://github.com/Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

code-switching multilingual speech-synthesis text-to-speech tts voice-cloning

Last synced: 07 Jul 2024

https://github.com/BenAAndrew/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

deep-learning python pytorch tacotron2 text-to-speech tts voice-cloning

Last synced: 05 Jul 2024

https://github.com/wladradchenko/wunjo.wladradchenko.ru

Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.

controlnet deepfake deepfake-emotion deepfakes diffusion face-swap face-swapping free image-animation retouching-video segment-anything tacotron2 talking-face talking-face-generation talking-head tts vid2vid voice-cloning voice-recognition wunjo

Last synced: 18 May 2024

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 15 May 2024

https://github.com/vlomme/Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on

deep-learning g2p pytorch russian tacotron tensorflow tts voice-cloning wavernn

Last synced: 27 Apr 2024

https://github.com/CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

deep-learning python pytorch tensorflow tts voice-cloning

Last synced: 20 Apr 2024

https://github.com/RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts vits voice-clone voice-cloneai voice-cloning

Last synced: 08 Apr 2024

https://github.com/CMsmartvoice/One-Shot-Voice-Cloning

:relaxed: One Shot Voice Cloning base on Unet-TTS

one-shot style-transfer tts voice-cloning

Last synced: 01 Apr 2024

https://github.com/FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

a11y accessibility anime blindness diariz dubbing python tts voice-cloning

Last synced: 30 Mar 2024