https://github.com/malolm/vocal-assistant-amadeus-prototype

An advanced AI personal assistant, enabling great voice interactions through a novel human-machine interface.
https://github.com/malolm/vocal-assistant-amadeus-prototype

electron elevenlabs fastify nodejs openai-api websocket

Last synced: about 2 months ago
JSON representation

An advanced AI personal assistant, enabling great voice interactions through a novel human-machine interface.

Host: GitHub
URL: https://github.com/malolm/vocal-assistant-amadeus-prototype
Owner: MaloLM
License: apache-2.0
Created: 2023-10-15T11:12:09.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2023-12-04T15:35:19.000Z (over 1 year ago)
Last Synced: 2025-05-05T04:14:33.999Z (about 2 months ago)
Topics: electron, elevenlabs, fastify, nodejs, openai-api, websocket
Language: JavaScript
Homepage:
Size: 179 KB
Stars: 8
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md

Awesome Lists containing this project

README

# Amadeus Prototype

### Product introduction

Amadeus is a cutting-edge personal assistant prototype, tapping into the recent advancements in the generative AI domain. Designed with a novel human-machine interface (HMI) in mind, this prototype allows users to engage in vocal interactions with a sophisticated AI, specifically a Large Language Model. Amadeus comprises two core components: an Electron client and a Node server. The genesis of this prototype serves as a proof of concept, with ambitions geared towards a broader commercial deployment.

## DEMO VIDEO

[![Watch the product presentation video here](https://img.youtube.com/vi/B64jnq85leU/maxresdefault.jpg)](https://www.youtube.com/watch?v=B64jnq85leU)

## Technical Description

For anyone intrigued by Amadeus prototype and keen on exploring its functionalities, here's a breakdown of how you can interact with the project:

### 1. How prototype Works

![An image describing how the application is working at high level. User reccords what he/she/they says through an electron client, electron client does speach to text thank to OpenAI transcription. Then, electron client shares text to the node server which manages GPT answer and finnaly answer back GPT sentences to the electron client through a web socket. When sentences are received through web socket, client ask elevenlabs api for text to speach. Finally, client plays the generated voice messages in order.](./readme_assets/images/proto-overview.png)

- **Composition:** Amadeus is structured around two primary elements and two external dependencies.
- **Primary Elements:**
- An Electron client, developed in JavaScript.
- A Node server application, developed in TypeScript.
- **External Dependencies:**
- GPT from OpenAI.
- ElevenLabs.

The image above provides a high-level overview of the application's workflow:

1. **User Voice Input**: The user verbally communicates with the Electron client, which captures the spoken content. in version 1.0.0, user press space bar to talk.

2. **Speech-to-Text**: Utilizing OpenAI's transcription capabilities, the Electron client transcribes the user's voice input into text.

3. **Communication with Node Server**: The transcribed text is relayed to the Node server, which interacts with GPT to generate a response.

4. **Receiving the Response**: The Node server sends the GPT-generated sentences back to the Electron client via a WebSocket connection.

5. **Text-to-Speech**: Upon receiving the text response, the Electron client collaborates with the ElevenLabs API to convert the text into spoken words.

6. **Playback**: The Electron client then sequentially plays back the generated voice messages to the user.

## Contributors (GitHub)

- [@MaloLM](https://github.com/MaloLM/)
- [@Kévin Chea](https://github.com/Kevin-Chea/)

## CONTRIBUTE

We welcome contributions from everyone, irrespective of their origin. Here's how you can help:

1. **Understand our Code of Conduct**: Before contributing, please read our [Code of Conduct](./CODE_OF_CONDUCT.md). We expect all our contributors to adhere to it.

2. **Start with your First Issues**: If you're new to the project, we recommend starting with issues labeled inside [the issues guide](./ISSUES_GUIDE.md).

3. **Read the Getting Started guide**: Detailed instructions on setting up the dev environment and contributing can be found in our [Getting started Guide](./GETTING_STARTED.md).

Remember, every contribution, whether it's code, documentation, translations, or even reporting bugs, is valuable to us. Let's make this project better together!

## CONNECT WITH PROJECT OWNER

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/malolm/vocal-assistant-amadeus-prototype

Awesome Lists containing this project

README