https://github.com/philippe2023/speech-ai

deep-translator elevenlabs ffmpeg pydub python3 streamlit

Last synced: 4 months ago
JSON representation

Host: GitHub
URL: https://github.com/philippe2023/speech-ai
Owner: philippe2023
Created: 2024-11-06T20:21:46.000Z (8 months ago)
Default Branch: master
Last Pushed: 2024-11-07T12:59:50.000Z (8 months ago)
Last Synced: 2025-01-12T08:21:17.806Z (6 months ago)
Topics: deep-translator, elevenlabs, ffmpeg, pydub, python3, streamlit
Language: Python
Homepage:
Size: 6.84 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# VoiceVerse

VoiceVerse is a versatile application built with Streamlit, allowing users to experience advanced text-to-speech, translation, and conversational simulations. This app leverages the ElevenLabs API for high-quality voice synthesis and `deep-translator` for translation capabilities.

## Features

- **Text-to-Speech**: Enter text and listen to it spoken in different voices and languages.
- **Translation**: Translate text between supported languages.
- **Translate & Speak**: Translate text and generate audio in the translated language.
- **Simulated Conversation**: Create a back-and-forth conversation between two people with different text inputs, voices, and languages.
- **Merged Conversation Audio**: Simulate a conversation and merge both audio clips into a single file.

## Project Structure

The application is organized as follows:

```
app/
├── main.py # Main app file with sidebar navigation
├── pages/
│ ├── speech.py # Page for basic text-to-speech
│ ├── translator.py # Page for translation
│ ├── translator_plus.py # Page for translation + TTS
│ ├── convo.py # Page for basic conversation simulation
│ └── convo_merge.py # Page for merged conversation audio
└── config.py # File for API keys and configuration
```

## Setup and Installation

1. Clone the repository:

```bash
git clone
cd voiceverse
```

2. Install dependencies:

```bash
pip install -r requirements.txt
```

3. Ensure you have `ffmpeg` installed on your system, as it is required by `pydub`:

- **macOS**: `brew install ffmpeg`
- **Ubuntu**: `sudo apt-get install ffmpeg`
- **Windows**: Download and add to system PATH from [ffmpeg.org](https://ffmpeg.org/download.html).

4. Set up API keys:

- In `config.py`, add your ElevenLabs API key.

## Usage

To start the app, run:

```bash
streamlit run app/main.py
```

Then, use the sidebar to navigate through the different features.

## Requirements

- Streamlit
- ElevenLabs Python API
- deep-translator
- pydub
- ffmpeg (for audio processing)

## License

This project is licensed under the MIT License.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/philippe2023/speech-ai

Awesome Lists containing this project

README