https://github.com/philippe2023/speech-ai
https://github.com/philippe2023/speech-ai
deep-translator elevenlabs ffmpeg pydub python3 streamlit
Last synced: 4 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/philippe2023/speech-ai
- Owner: philippe2023
- Created: 2024-11-06T20:21:46.000Z (8 months ago)
- Default Branch: master
- Last Pushed: 2024-11-07T12:59:50.000Z (8 months ago)
- Last Synced: 2025-01-12T08:21:17.806Z (6 months ago)
- Topics: deep-translator, elevenlabs, ffmpeg, pydub, python3, streamlit
- Language: Python
- Homepage:
- Size: 6.84 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# VoiceVerse
VoiceVerse is a versatile application built with Streamlit, allowing users to experience advanced text-to-speech, translation, and conversational simulations. This app leverages the ElevenLabs API for high-quality voice synthesis and `deep-translator` for translation capabilities.
## Features
- **Text-to-Speech**: Enter text and listen to it spoken in different voices and languages.
- **Translation**: Translate text between supported languages.
- **Translate & Speak**: Translate text and generate audio in the translated language.
- **Simulated Conversation**: Create a back-and-forth conversation between two people with different text inputs, voices, and languages.
- **Merged Conversation Audio**: Simulate a conversation and merge both audio clips into a single file.## Project Structure
The application is organized as follows:
```
app/
├── main.py # Main app file with sidebar navigation
├── pages/
│ ├── speech.py # Page for basic text-to-speech
│ ├── translator.py # Page for translation
│ ├── translator_plus.py # Page for translation + TTS
│ ├── convo.py # Page for basic conversation simulation
│ └── convo_merge.py # Page for merged conversation audio
└── config.py # File for API keys and configuration
```## Setup and Installation
1. Clone the repository:
```bash
git clone
cd voiceverse
```2. Install dependencies:
```bash
pip install -r requirements.txt
```3. Ensure you have `ffmpeg` installed on your system, as it is required by `pydub`:
- **macOS**: `brew install ffmpeg`
- **Ubuntu**: `sudo apt-get install ffmpeg`
- **Windows**: Download and add to system PATH from [ffmpeg.org](https://ffmpeg.org/download.html).4. Set up API keys:
- In `config.py`, add your ElevenLabs API key.
## Usage
To start the app, run:
```bash
streamlit run app/main.py
```Then, use the sidebar to navigate through the different features.
## Requirements
- Streamlit
- ElevenLabs Python API
- deep-translator
- pydub
- ffmpeg (for audio processing)## License
This project is licensed under the MIT License.