https://github.com/fast1-labs/ai-voice-assistant
AI Assistant that helps with you anytime anywhere
https://github.com/fast1-labs/ai-voice-assistant
ai axios openai react-native speech-recognition speech-to-text text-to-speech
Last synced: over 1 year ago
JSON representation
AI Assistant that helps with you anytime anywhere
- Host: GitHub
- URL: https://github.com/fast1-labs/ai-voice-assistant
- Owner: Fast1-Labs
- Created: 2024-11-24T10:37:46.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-12-01T13:19:33.000Z (over 1 year ago)
- Last Synced: 2025-01-08T23:06:17.540Z (over 1 year ago)
- Topics: ai, axios, openai, react-native, speech-recognition, speech-to-text, text-to-speech
- Language: TypeScript
- Homepage:
- Size: 1 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AI-Powered Assistant
Your personal voice-enabled AI assistant, designed to assist you with any query using advanced GPT-4 language understanding and Whisper API for seamless voice recognition. It provides text-based responses and can also reply using a text-to-speech feature for an immersive and interactive experience. Currently in closed testing, the assistant is optimized for accessibility and convenience.
## Features
- **Voice Recognition**: Powered by Whisper API, the assistant can accurately process your spoken inputs.
- **Text and Speech Responses**: Answers are delivered in text format and optionally via text-to-speech for a conversational experience.
- **AI-Powered Assistance**: Built with GPT-4, it provides intelligent responses to a wide range of queries.
- **Seamless Interaction**: Natural and fluent interaction makes it a handy assistant for day-to-day tasks, learning, or leisure.
- **Accessibility Optimized**: Suitable for users who prefer audio assistance or hands-free operation.
---
## Technical Details
### GPT-4 for Intelligent Responses
- **Model**: GPT-4 by OpenAI
- **Capabilities**: Advanced natural language understanding for complex queries and conversational depth.
### Whisper API for Voice Recognition
- **Model**: OpenAI Whisper
- **Functionality**: Converts user speech to text for processing by the AI assistant.
### Text-to-Speech (TTS)
- **Feature**: Converts AI-generated responses into spoken words for an auditory experience.
- **Use Case**: Ideal for hands-free tasks or accessibility.
---
## Features Overview
- **Voice Input**: Interact with the assistant using your voice.
- **Multi-Modal Replies**: Receive text and audio responses.
- **Wide Knowledge Base**: Ask about any topic; the assistant provides comprehensive answers.
- **User-Friendly**: Designed for intuitive and fluid user interaction.
- **Hands-Free Mode**: Perfect for multitasking or users with accessibility needs.
---
## Screenshots





---
## Future Plans
- **Custom Voice Options**: Allow users to select their preferred voice for TTS responses.
- **Offline Mode**: Enable limited functionality without an internet connection.
- **Expanded Language Support**: Support for multiple languages for global accessibility.
---