https://github.com/veeeetzzzz/mars5-tts
Python implementation for the MARS5 TTS repo that allows you to clone a voice with a command line interface.
https://github.com/veeeetzzzz/mars5-tts
text-to-speech voice-cloning
Last synced: 4 months ago
JSON representation
Python implementation for the MARS5 TTS repo that allows you to clone a voice with a command line interface.
- Host: GitHub
- URL: https://github.com/veeeetzzzz/mars5-tts
- Owner: Veeeetzzzz
- License: agpl-3.0
- Created: 2024-06-29T18:24:07.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-07-11T17:55:41.000Z (10 months ago)
- Last Synced: 2024-11-12T05:35:35.620Z (6 months ago)
- Topics: text-to-speech, voice-cloning
- Language: Python
- Homepage:
- Size: 811 KB
- Stars: 4
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# MARS5 Text-To-Speech (TTS) Deep Voice Clone
[](https://www.codefactor.io/repository/github/veeeetzzzz/mars5-tts)
This is a Python implementation for the [MARS5 TTS repo](https://github.com/Camb-ai/MARS5-TTS) that allows you to clone a voice with a command line interface.
Try the [online demo here](https://6b1a3a8e53ae.ngrok.app/) for a quickstart or follow the instructions below to run this project locally/offline
# Roadmap
- EZ front end
- Enable PyTorch CUDA (supported devices only)# Installation
Install [Python](https://www.python.org/downloads/)
Clone directory/download [.zip file](https://github.com/Veeeetzzzz/mars5-tts/archive/refs/heads/main.zip)
Open CMD/Powershell in directory and run
pip install -r requirements.txt# Usage
## Visual Studio Code
Download/clone this repo and open the folder in Visual Studio Code

Find your Terminal and run
python tts.py # where tts.py is the file name of the Python script

You'll be taken through the steps to start cloning - fill in your own values or press ENTER for default.

## Command Line/PowerShell
Download/clone this repo and open Command Line/PowerShell from the folder (File -> Open PowerShell Window)

There's no impact on us whether we use Command Line or PowerShell. They look the same in terms of input/output.
python tts.py

# Notes
- Keep reference voice between 6-10 seconds
- Tune config to get optimal output
- Provide transcript when possible
- Use sample provided for ease# Known issues
- System won't lock up but CPU usage remains at 100% throughout processing stage
- Results will vary - this is not a one shot model but seems far more impressive than ElevenLabs, Speechify at the cost of processing time