Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/fast1-labs/ai-voice-assistant
AI Assistant that helps with you anytime anywhere
https://github.com/fast1-labs/ai-voice-assistant
ai axios openai react-native speech-recognition speech-to-text text-to-speech
Last synced: 1 day ago
JSON representation
AI Assistant that helps with you anytime anywhere
- Host: GitHub
- URL: https://github.com/fast1-labs/ai-voice-assistant
- Owner: Fast1-Labs
- Created: 2024-11-24T10:37:46.000Z (about 2 months ago)
- Default Branch: main
- Last Pushed: 2024-12-01T13:19:33.000Z (about 1 month ago)
- Last Synced: 2024-12-18T12:42:12.723Z (23 days ago)
- Topics: ai, axios, openai, react-native, speech-recognition, speech-to-text, text-to-speech
- Language: TypeScript
- Homepage:
- Size: 1 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AI-Powered Assistant
Your personal voice-enabled AI assistant, designed to assist you with any query using advanced GPT-4 language understanding and Whisper API for seamless voice recognition. It provides text-based responses and can also reply using a text-to-speech feature for an immersive and interactive experience. Currently in closed testing, the assistant is optimized for accessibility and convenience.
## Features
- **Voice Recognition**: Powered by Whisper API, the assistant can accurately process your spoken inputs.
- **Text and Speech Responses**: Answers are delivered in text format and optionally via text-to-speech for a conversational experience.
- **AI-Powered Assistance**: Built with GPT-4, it provides intelligent responses to a wide range of queries.
- **Seamless Interaction**: Natural and fluent interaction makes it a handy assistant for day-to-day tasks, learning, or leisure.
- **Accessibility Optimized**: Suitable for users who prefer audio assistance or hands-free operation.---
## Technical Details
### GPT-4 for Intelligent Responses
- **Model**: GPT-4 by OpenAI
- **Capabilities**: Advanced natural language understanding for complex queries and conversational depth.### Whisper API for Voice Recognition
- **Model**: OpenAI Whisper
- **Functionality**: Converts user speech to text for processing by the AI assistant.### Text-to-Speech (TTS)
- **Feature**: Converts AI-generated responses into spoken words for an auditory experience.
- **Use Case**: Ideal for hands-free tasks or accessibility.---
## Features Overview
- **Voice Input**: Interact with the assistant using your voice.
- **Multi-Modal Replies**: Receive text and audio responses.
- **Wide Knowledge Base**: Ask about any topic; the assistant provides comprehensive answers.
- **User-Friendly**: Designed for intuitive and fluid user interaction.
- **Hands-Free Mode**: Perfect for multitasking or users with accessibility needs.---
## Screenshots
![Screenshot 1](https://github.com/user-attachments/assets/ffd1c94b-864b-4e92-a295-044396106ecb)
![Screenshot 2](https://github.com/user-attachments/assets/70e12e5a-acb6-44d1-aba9-b844f3766451)
![Screenshot 3](https://github.com/user-attachments/assets/760bab10-4b53-469d-9d9d-d0a14f10715b)
![Screenshot 4](https://github.com/user-attachments/assets/1e14fe03-1897-4511-b1c8-ba3764ae6890)
![Screenshot 5](https://github.com/user-attachments/assets/36f1e2b2-d531-4de9-8bdb-e4e32314d3f0)---
## Future Plans
- **Custom Voice Options**: Allow users to select their preferred voice for TTS responses.
- **Offline Mode**: Enable limited functionality without an internet connection.
- **Expanded Language Support**: Support for multiple languages for global accessibility.---