https://github.com/SadeghKrmi/pertts-streamlit
Persian text-to-speech streamlit interface
https://github.com/SadeghKrmi/pertts-streamlit
deep-learning machine-learning persian-dataset persian-language persian-tts text-to-speech tts-model vits
Last synced: 12 months ago
JSON representation
Persian text-to-speech streamlit interface
- Host: GitHub
- URL: https://github.com/SadeghKrmi/pertts-streamlit
- Owner: SadeghKrmi
- License: apache-2.0
- Created: 2024-01-04T14:11:08.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2024-12-09T20:31:12.000Z (over 1 year ago)
- Last Synced: 2024-12-09T21:29:45.093Z (over 1 year ago)
- Topics: deep-learning, machine-learning, persian-dataset, persian-language, persian-tts, text-to-speech, tts-model, vits
- Language: Python
- Homepage: https://tts.datacula.com/
- Size: 109 MB
- Stars: 26
- Watchers: 4
- Forks: 4
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## pertts (persian text-to-speech)
This is the implementation and a web interface for pertts (Persian text-to-speech)
powered by [piper](https://github.com/rhasspy/piper)
the live version of persian tts called [pertts](https://tts.datacula.com/)

with love from datacula.com
**Voices**: We are using an AI-based TTS system, trained with
- amir: Amir Sooakhsh's voice from [rokhpodcast](https://rokhpodcast.ir/), Special thanks to Amir :)
- ganji: Ahmad Ganji voice
- gnaji-adabi: Ahmad Ganji voice, poetic version
### dataset
open-source-dataset[amir]: https://huggingface.co/datasets/SadeghK/datacula-pertts-amir
### models
All models are available in [HuggingFace](https://huggingface.co/SadeghK/persian-text-to-speech/tree/main/farsi).
### 🛠️ Installation
#### docker
Build with docker from scratch and run
```bash
docker build --no-cache -t pertts:1.0 .
docker container run --name st --rm -it -p 8501:8501 pertts:1.0
```
Run the latest version of the docker image from docker hub
```bash
docker image pull sadeghk/pertts
docker container run --name st --rm -it -p 8501:8501 sadeghk/pertts
```
#### python
install piper-tts using pip and download the model in pertts-streamlit/model directory
``
pip install piper-tts
```
and then run
```bash
echo 'سلام و درود بر همه فارسی زبانان' | piper \
--model epoch=5261-step=2455712.onnx \
--output_file dorood.wav
```
#### Windows
download the executables for windows _piper_windows_amd64.zip_ from [piper](https://github.com/rhasspy/piper/releases), and unzip
go to the piper directory where piper.exe exists and create a folder with the name models. Download the model for Persian/Farsi from [huggingface](https://huggingface.co/rhasspy/piper-voices/tree/main/fa/fa_IR/amir/medium) with name _fa_IR-amir-medium.onnx_ and _fa_IR-amir-medium.onnx.json_ to models directory.
open a PowerShell and `cd` to the directory where piper.exe exists, and run
```bash
echo "سلام و درود بر شما" | .\piper.exe --model .\models\fa_IR-amir-medium.onnx --output_dir .\outputs
```