https://github.com/koljab/aivoicechat

Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
https://github.com/koljab/aivoicechat

chatbot elevenlabs elevenlabs-api gpt-4 openai

Last synced: over 1 year ago
JSON representation

Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming

Host: GitHub
URL: https://github.com/koljab/aivoicechat
Owner: KoljaB
Created: 2023-08-14T20:00:51.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2024-06-08T07:01:48.000Z (about 2 years ago)
Last Synced: 2025-03-30T15:09:03.683Z (over 1 year ago)
Topics: chatbot, elevenlabs, elevenlabs-api, gpt-4, openai
Language: Python
Homepage:
Size: 39.1 KB
Stars: 277
Watchers: 4
Forks: 55
Open Issues: 7
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

Seamless and real-time voice interaction with AI.

> **Hint:** *Anybody interested in state-of-the-art voice solutions please also have a look at [Linguflex](https://github.com/KoljaB/Linguflex). It lets you control your environment by speaking and is one of the most capable and sophisticated open-source assistants currently available.*

Uses faster_whisper and elevenlabs input streaming for low latency responses to spoken input.

**[🎥 Watch a Demo Video](https://www.youtube.com/watch?v=lq_Q6y47iUU)**
> **Note**: The demo is conducted on a 10Mbit/s connection, so actual performance might be more impressive on faster connections.

`voice_talk_vad.py` - automatically detects speech

`voice_talk.py` - toggle recording on/off with the spacebar

## 🛠 Setup:

### 1. API Keys:

Replace `your_openai_key` and `your_elevenlabs_key` with your OpenAI and ElevenLabs API key values in the code.

### 2. Dependencies:

Install the required Python libraries:
```bash
pip install openai elevenlabs pyaudio wave keyboard faster_whisper numpy torch
```

### 3. Run the Script:

Execute the main script based on your mode preference:

```bash
python voice_talk_vad.py
```
or
```bash
python voice_talk.py
```
## 🎙 How to Use:

### For `voice_talk_vad.py`:

Talk into your microphone.
Listen to the reply.

### For `voice_talk.py`:

1. Press the **space bar** to initiate talk.
2. Speak your heart out.
3. Hit the **space bar** again once you're done.
4. Listen to reply.

## 🤝 Contribute

Feel free to fork, improve, and submit pull requests. If you're considering significant changes or additions, please start by opening an issue.

## 💖 Acknowledgements

Huge shoutout to:
- The hardworking developers behind [faster_whisper](https://github.com/guillaumekln/faster-whisper).
- [ElevenLabs](https://www.elevenlabs.io/) for their cutting-edge voice API.
- [OpenAI](https://www.openai.com/) for pioneering with the GPT-4 model.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/koljab/aivoicechat

Awesome Lists containing this project

README