https://github.com/egorsmkv/ukrainian-tts-datasets
πΊπ¦ Open Source Ukrainian Text-to-Speech datasets
https://github.com/egorsmkv/ukrainian-tts-datasets
speech-ai text-to-speech tts ukrainian
Last synced: 5 months ago
JSON representation
πΊπ¦ Open Source Ukrainian Text-to-Speech datasets
- Host: GitHub
- URL: https://github.com/egorsmkv/ukrainian-tts-datasets
- Owner: egorsmkv
- License: apache-2.0
- Created: 2024-08-15T13:54:30.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2025-02-24T23:52:58.000Z (8 months ago)
- Last Synced: 2025-02-25T00:29:50.470Z (8 months ago)
- Topics: speech-ai, text-to-speech, tts, ukrainian
- Language: Python
- Homepage:
- Size: 11.7 KB
- Stars: 15
- Watchers: 2
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# πΊπ¦ Open Source Ukrainian Text-to-Speech datasets
The texts for these datasets are from [Texts for the Ukrainian Text-to-Speech dataset](https://github.com/egorsmkv/uk-tts-dataset-text)
## Community
- **Discord**: https://bit.ly/discord-uds
- Speech Recognition: https://t.me/speech_recognition_uk
- Speech Synthesis: https://t.me/speech_synthesis_uk## Dataset
Look https://huggingface.co/datasets/Yehor/opentts-uk
## Voices
### Female
#### [Lada](https://github.com/egorsmkv/ukrainian-tts-datasets/tree/master/lada)
- Quality: high
- Duration: 10h37m
- Audio formats: OPUS
- Frequency: 48000 HzListen to [DEMO](https://huggingface.co/spaces/theodotus/ukrainian-voices) (choose "lada" in the Voice field)
#### [Tetiana](https://github.com/egorsmkv/ukrainian-tts-datasets/tree/master/tetiana)
- Quality: high
- Duration: 8h
- Audio formats: OPUS
- Frequency: 48000 Hz#### [Kateryna](https://github.com/egorsmkv/ukrainian-tts-datasets/tree/master/kateryna)
- Quality: high
- Duration: 2h40m
- Audio formats: OPUS
- Frequency: 48000 Hz### Male
#### [Mykyta](https://github.com/egorsmkv/ukrainian-tts-datasets/tree/master/mykyta)
- Quality: high
- Duration: 8h10m
- Audio formats: OPUS
- Frequency: 48000 HzListen to [DEMO](https://huggingface.co/spaces/theodotus/ukrainian-voices) (choose "mykyta" in the Voice field)
#### [Oleksa](https://github.com/egorsmkv/ukrainian-tts-datasets/tree/master/oleksa)
- Quality: high
- Duration: 6h
- Audio formats: OPUS
- Frequency: 48000 Hz## Appearance on the web
- Align Text to Audio and Trim Silence: https://github.com/proger/uk
- NVIDIA's Flowtron: https://github.com/egorsmkv/ukrainian-flowtron-tts
- HF demos:
- https://huggingface.co/spaces/robinhad/ukrainian-tts
- https://huggingface.co/spaces/theodotus/ukrainian-voices
- Lada: Ukrainian High-Quality Female Text-to-Speech Dataset: https://zenodo.org/record/7396774
- Google Colabs (RADTTS model):
- https://colab.research.google.com/drive/13aa0o9fQknDcJtpLrGXhxWPvZpeUggCy?usp=sharing
- https://colab.research.google.com/drive/1pgiBlMm4tk0atKrszStOSy6XaTDnc3v4?usp=sharing
- Lada is in Piper - https://github.com/rhasspy/piper - A fast, local neural text to speech system
- Tetiana in Balacoon - https://balacoon.com/blog/uk_release/
- Demo: https://huggingface.co/spaces/balacoon/tts