https://github.com/SadeghKrmi/pertts-streamlit

Persian text-to-speech streamlit interface
https://github.com/SadeghKrmi/pertts-streamlit

deep-learning machine-learning persian-dataset persian-language persian-tts text-to-speech tts-model vits

Last synced: 12 months ago
JSON representation

Persian text-to-speech streamlit interface

Host: GitHub
URL: https://github.com/SadeghKrmi/pertts-streamlit
Owner: SadeghKrmi
License: apache-2.0
Created: 2024-01-04T14:11:08.000Z (over 2 years ago)
Default Branch: master
Last Pushed: 2024-12-09T20:31:12.000Z (over 1 year ago)
Last Synced: 2024-12-09T21:29:45.093Z (over 1 year ago)
Topics: deep-learning, machine-learning, persian-dataset, persian-language, persian-tts, text-to-speech, tts-model, vits
Language: Python
Homepage: https://tts.datacula.com/
Size: 109 MB
Stars: 26
Watchers: 4
Forks: 4
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

## pertts (persian text-to-speech)
This is the implementation and a web interface for pertts (Persian text-to-speech)

the live version of persian tts called [pertts](https://tts.datacula.com/)

![image](https://github.com/SadeghKrmi/pertts-streamlit/assets/5988663/9b8c751b-d5e7-42eb-9c7f-2516aed6baa6)
with love from datacula.com

**Voices**: We are using an AI-based TTS system, trained with
- amir: Amir Sooakhsh's voice from [rokhpodcast](https://rokhpodcast.ir/), Special thanks to Amir :)
- ganji: Ahmad Ganji voice
- gnaji-adabi: Ahmad Ganji voice, poetic version

### dataset
open-source-dataset[amir]: https://huggingface.co/datasets/SadeghK/datacula-pertts-amir

### models
All models are available in [HuggingFace](https://huggingface.co/SadeghK/persian-text-to-speech/tree/main/farsi).

### 🛠️ Installation

#### docker
Build with docker from scratch and run
```bash
docker build --no-cache -t pertts:1.0 .
docker container run --name st --rm -it -p 8501:8501 pertts:1.0
```

Run the latest version of the docker image from docker hub
```bash
docker image pull sadeghk/pertts
docker container run --name st --rm -it -p 8501:8501 sadeghk/pertts
```

#### python
install piper-tts using pip and download the model in pertts-streamlit/model directory

``
pip install piper-tts
```
and then run
```bash
echo 'سلام و درود بر همه فارسی زبانان' | piper \
--model epoch=5261-step=2455712.onnx \
--output_file dorood.wav
```

#### Windows
download the executables for windows _piper_windows_amd64.zip_ from [piper](https://github.com/rhasspy/piper/releases), and unzip
go to the piper directory where piper.exe exists and create a folder with the name models. Download the model for Persian/Farsi from [huggingface](https://huggingface.co/rhasspy/piper-voices/tree/main/fa/fa_IR/amir/medium) with name _fa_IR-amir-medium.onnx_ and _fa_IR-amir-medium.onnx.json_ to models directory.

open a PowerShell and `cd` to the directory where piper.exe exists, and run
```bash
echo "سلام و درود بر شما" | .\piper.exe --model .\models\fa_IR-amir-medium.onnx --output_dir .\outputs
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/SadeghKrmi/pertts-streamlit

Awesome Lists containing this project

README