Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/fast1-labs/ai-voice-assistant

AI Assistant that helps with you anytime anywhere
https://github.com/fast1-labs/ai-voice-assistant

ai axios openai react-native speech-recognition speech-to-text text-to-speech

Last synced: 1 day ago
JSON representation

AI Assistant that helps with you anytime anywhere

Host: GitHub
URL: https://github.com/fast1-labs/ai-voice-assistant
Owner: Fast1-Labs
Created: 2024-11-24T10:37:46.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2024-12-01T13:19:33.000Z (about 1 month ago)
Last Synced: 2024-12-18T12:42:12.723Z (23 days ago)
Topics: ai, axios, openai, react-native, speech-recognition, speech-to-text, text-to-speech
Language: TypeScript
Homepage:
Size: 1 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# AI-Powered Assistant

Your personal voice-enabled AI assistant, designed to assist you with any query using advanced GPT-4 language understanding and Whisper API for seamless voice recognition. It provides text-based responses and can also reply using a text-to-speech feature for an immersive and interactive experience. Currently in closed testing, the assistant is optimized for accessibility and convenience.

## Features

- **Voice Recognition**: Powered by Whisper API, the assistant can accurately process your spoken inputs.
- **Text and Speech Responses**: Answers are delivered in text format and optionally via text-to-speech for a conversational experience.
- **AI-Powered Assistance**: Built with GPT-4, it provides intelligent responses to a wide range of queries.
- **Seamless Interaction**: Natural and fluent interaction makes it a handy assistant for day-to-day tasks, learning, or leisure.
- **Accessibility Optimized**: Suitable for users who prefer audio assistance or hands-free operation.

---

## Technical Details

### GPT-4 for Intelligent Responses
- **Model**: GPT-4 by OpenAI
- **Capabilities**: Advanced natural language understanding for complex queries and conversational depth.

### Whisper API for Voice Recognition
- **Model**: OpenAI Whisper
- **Functionality**: Converts user speech to text for processing by the AI assistant.

### Text-to-Speech (TTS)
- **Feature**: Converts AI-generated responses into spoken words for an auditory experience.
- **Use Case**: Ideal for hands-free tasks or accessibility.

---

## Features Overview

- **Voice Input**: Interact with the assistant using your voice.
- **Multi-Modal Replies**: Receive text and audio responses.
- **Wide Knowledge Base**: Ask about any topic; the assistant provides comprehensive answers.
- **User-Friendly**: Designed for intuitive and fluid user interaction.
- **Hands-Free Mode**: Perfect for multitasking or users with accessibility needs.

---

## Screenshots

![Screenshot 1](https://github.com/user-attachments/assets/ffd1c94b-864b-4e92-a295-044396106ecb)
![Screenshot 2](https://github.com/user-attachments/assets/70e12e5a-acb6-44d1-aba9-b844f3766451)
![Screenshot 3](https://github.com/user-attachments/assets/760bab10-4b53-469d-9d9d-d0a14f10715b)
![Screenshot 4](https://github.com/user-attachments/assets/1e14fe03-1897-4511-b1c8-ba3764ae6890)
![Screenshot 5](https://github.com/user-attachments/assets/36f1e2b2-d531-4de9-8bdb-e4e32314d3f0)

---

## Future Plans

- **Custom Voice Options**: Allow users to select their preferred voice for TTS responses.
- **Offline Mode**: Enable limited functionality without an internet connection.
- **Expanded Language Support**: Support for multiple languages for global accessibility.

---