https://github.com/natserract/talk2-me

Talk2me Voice Assistant
https://github.com/natserract/talk2-me

Last synced: 7 days ago
JSON representation

Talk2me Voice Assistant

Host: GitHub
URL: https://github.com/natserract/talk2-me
Owner: natserract
Created: 2024-06-25T02:06:50.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-10-14T06:34:07.000Z (about 1 year ago)
Last Synced: 2025-04-04T07:34:15.923Z (7 months ago)
Language: Python
Size: 70.3 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Talk2-Me
Talk to me?

## **Mechanism Overview:**

**Pre-requisites:**
- Use a webcam to capture video stream.
- Provide WebRTC (e.g., Livekit) to stream the video in real-time.

**Input Processing:**
- Capture the image from the video stream.
- Use speech-to-text technology (Deepgram) to convert the user's voice input into text.
- Pass the captured image and text input as context to the LLM (OpenAI GPT-4).

**Response Generation:**
- LLM (OpenAI GPT-4) processes the input context and generates a text response.
- Convert the text response back to speech using a text-to-speech

![Screen Shot 2024-06-25 at 17 32 03](https://github.com/natserract/talk2-me/assets/31182611/7f8a7554-c6f3-40d7-b668-6c35c33c3cca)

## Running

```sh
rye pin 3.11.9

# Install packages
rye sync

# Download livekit plugins
rye run talk2-me download-files

# Running
rye run talk2-me start
```

## Playground
Go to https://agents-playground.livekit.io/

## WIP
- [x] Voice talks
- [ ] Video track & capturing

**Related Issue:**

```sh
LOG: [W NNPACK.cpp:64] Could not initialize NNPACK! Reason: Unsupported hardware.
```
> Still looking how to solve this issue. Currently only able to voice talks.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/natserract/talk2-me

Awesome Lists containing this project

README