Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/philippe2023/speech-ai
https://github.com/philippe2023/speech-ai
Last synced: 7 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/philippe2023/speech-ai
- Owner: philippe2023
- Created: 2024-11-06T20:21:46.000Z (13 days ago)
- Default Branch: master
- Last Pushed: 2024-11-06T20:26:24.000Z (13 days ago)
- Last Synced: 2024-11-06T21:29:29.289Z (13 days ago)
- Language: Python
- Size: 0 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# VoiceVerse
VoiceVerse is a versatile application built with Streamlit, allowing users to experience advanced text-to-speech, translation, and conversational simulations. This app leverages the ElevenLabs API for high-quality voice synthesis and `deep-translator` for translation capabilities.
## Features
- **Text-to-Speech**: Enter text and listen to it spoken in different voices and languages.
- **Translation**: Translate text between supported languages.
- **Translate & Speak**: Translate text and generate audio in the translated language.
- **Simulated Conversation**: Create a back-and-forth conversation between two people with different text inputs, voices, and languages.
- **Merged Conversation Audio**: Simulate a conversation and merge both audio clips into a single file.## Project Structure
The application is organized as follows:
```
app/
├── main.py # Main app file with sidebar navigation
├── pages/
│ ├── speech.py # Page for basic text-to-speech
│ ├── translator.py # Page for translation
│ ├── translator_plus.py # Page for translation + TTS
│ ├── convo.py # Page for basic conversation simulation
│ └── convo_merge.py # Page for merged conversation audio
└── config.py # File for API keys and configuration
```## Setup and Installation
1. Clone the repository:
```bash
git clone
cd voiceverse
```2. Install dependencies:
```bash
pip install -r requirements.txt
```3. Ensure you have `ffmpeg` installed on your system, as it is required by `pydub`:
- **macOS**: `brew install ffmpeg`
- **Ubuntu**: `sudo apt-get install ffmpeg`
- **Windows**: Download and add to system PATH from [ffmpeg.org](https://ffmpeg.org/download.html).4. Set up API keys:
- In `config.py`, add your ElevenLabs API key.
## Usage
To start the app, run:
```bash
streamlit run app/main.py
```Then, use the sidebar to navigate through the different features.
## Requirements
- Streamlit
- ElevenLabs Python API
- deep-translator
- pydub
- ffmpeg (for audio processing)## License
This project is licensed under the MIT License.