https://github.com/asiff00/training-tts
Train and finutune text-to-speech models for Bengali and many other languages!
https://github.com/asiff00/training-tts
audio-processing finetune kokoro-tts speech-synthesis styletts2 text-to-speech tts xttsv2
Last synced: 6 months ago
JSON representation
Train and finutune text-to-speech models for Bengali and many other languages!
- Host: GitHub
- URL: https://github.com/asiff00/training-tts
- Owner: asiff00
- License: mit
- Created: 2025-03-05T04:55:03.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2025-04-02T06:20:33.000Z (6 months ago)
- Last Synced: 2025-04-02T07:25:56.749Z (6 months ago)
- Topics: audio-processing, finetune, kokoro-tts, speech-synthesis, styletts2, text-to-speech, tts, xttsv2
- Language: Jupyter Notebook
- Homepage:
- Size: 140 KB
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Training TTS
This is a repository to train TTS models. All the training/finetuning are focusing on Bengali language here but can be used for any languages with minimal changes. The most confusing part is the data formatting; once you get it right for the dataset in hand, the rest just becomes easier. I'll be posting more about it gradually. I will also improve the other parts of the repo as well.
# Trained/Finetuned Models
[Bangla TTS](https://huggingface.co/collections/asif00/bangla-tts-67e6d833ef7210bc3fb73375) contains a list of models trained/tuned using scripts from this repository. Model that we have added to the collections so far are:- https://huggingface.co/asif00/orpheus-bangla-tts
- https://huggingface.co/asif00/orpheus-bangla-tts-gguf-8bit
- https://huggingface.co/asif00/orpheus-bangla-tts-4bit
- https://huggingface.co/asif00/orpheus-bangla-tts-gguf# References
1. https://colab.research.google.com/drive/1PdcCcTUUYAxBM3_mDT2_UNz0ZoUkMee_?usp=sharing#scrollTo=hPz96-1-ttl
2. https://colab.research.google.com/drive/1ZK-2lAV2DokrN92sYPJRl47XyuuluCJe