Projects in Awesome Lists tagged with xtts
A curated list of projects in awesome lists tagged with xtts .
https://github.com/drewthomasson/ebook2audiobook
Generate audiobooks from e-books, voice cloning & 1158+ languages!
audiobook audiobooks chinese colab-notebook docker english epub gradio kaggle linux mac multilingual tts voice-cloning windows xtts
Last synced: 02 May 2026
https://github.com/DrewThomasson/ebook2audiobook
Generate audiobooks from e-books, voice cloning & 1107+ languages!
audiobook audiobooks chinese colab-notebook docker english epub gradio kaggle linux mac multilingual tts voice-cloning windows xtts
Last synced: 14 Aug 2025
https://github.com/daswer123/xtts-webui
Webui for using XTTS and for finetuning it
cocqui finetuning tts xtts xttsv2
Last synced: 15 May 2025
https://github.com/daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
realtime-tts sillytavern tts tts-api xtts xttsv2
Last synced: 15 May 2025
https://github.com/voxos-ai/bolna
End-to-end platform for building voice first multimodal agents
anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts
Last synced: 15 May 2025
https://github.com/lukaszliniewicz/Pandrator
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
audiobook audiobook-creator audiobook-maker audiobooks customtkinterprojects dubbing llm pdf-to-audio rvc silero subtitle-to-speech subtitle-to-voice text-processing text-to-speech tkinter-gui voice-clone voice-cloning voicecraft xtts xttsv2
Last synced: 06 Oct 2025
https://github.com/aman179102/podvoice
Local-first CLI that turns Markdown scripts into multi-speaker podcast-style audio using Coqui XTTS v2.
ai-audio automation cli content-creation coqui-tts developer-tools local-first local-first-ai markdown-to-audio offline-ai open-source-cli opensource podcast python text-to-speech tts xtts
Last synced: 31 May 2026
https://github.com/drewthomasson/doc2interview
This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.
generative-ai ollama pdf tts xtts
Last synced: 03 Mar 2026
https://github.com/merekat/children-stories
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
ai audio-generation data-science image-generation large-language-models llama lora lux neural-networks stable-diffusion story text-generation tts xtts
Last synced: 04 Jul 2025
https://github.com/5ekastanx/voice-synthesis
Проект для синтеза речи с использованием модели fish-speech/xtts. Позволяет преобразовывать текст в речи с клонированием голоса.
Last synced: 03 Sep 2025
https://github.com/mahshid1378/voice-chat-ai
🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs
ai-chat ai-chatbot ai-speech ai-voice elevenlabs-api fastapi gpt-4o-mini-tts llama3 llm mistral ollama openai openai-api python selfhosted tts whisper-ai xai xtts
Last synced: 13 Apr 2026
https://github.com/work-nobu/ohanashigpt
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
ai audio-generation data-science image-generation large-language-models llama3 llamacpp lora low-rank-adaptation stable-diffusion text-generation xtts
Last synced: 09 Feb 2026
https://github.com/everlastconsulting/gpt-oss-local-voice-agent-demo
Demo-Repository (aus YouTube-Video) für einen lokalen Open-Source Sprachassistenten mit gpt-oss, Whisper & XTTS. Bereitstellung zur Inspiration, Weiterverwendung und Erweiterung gedacht.
demo gpt-oss ollama voice-agent voice-assistant whisper xtts
Last synced: 17 Apr 2026
https://github.com/koppalexander/ohanashi-childgpt
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
ai audio-generation data-science flux generative-ai image-generation large-language-models llama lora neural-networks stable-diffusion story text-generation tts xtts
Last synced: 09 Mar 2026
https://github.com/musika08/audiobooks
AI Audiobook Studio — WPF/.NET 8 app converting TXT/PDF/EPUB to WAV/MP3/M4B audiobooks with offline Kokoro TTS and optional GPU voice cloning (XTTS/Chatterbox).
audiobook dotnet kokoro text-to-speech tts wpf xtts
Last synced: 14 Jun 2026
https://github.com/veralvx/xtts-finetune
XTTS fine-tuning via CLI
ai ai-training audio audio-processing coqui coqui-tts docker dockerfile fine-tuning finetuning python python3 text-to-speech training tts tts-model uv xtts xtts-v2 xttsv2
Last synced: 17 May 2026
https://github.com/bilelouahmed/vocal-assistant
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts
Last synced: 10 Feb 2026
https://github.com/sinhaankur/voice-harvester
Extract a clean, isolated voice from any video/audio file — ready for AI voice cloning (XTTS, RVC, ElevenLabs). Drag-and-drop app + CLI.
ai audio demucs desktop-app elevenlabs ffmpeg python rvc speech tts voice-cloning voice-extraction xtts
Last synced: 02 Jul 2026