An open API service indexing awesome lists of open source software.

https://github.com/mitesh-kumavat/sightmate

SightMate: AI-Powered Companion for the Visually Impaired.
https://github.com/mitesh-kumavat/sightmate

ai fastapi groq-api nextjs python react sqlite tts-api typescript uvicorn webscraping

Last synced: 2 days ago
JSON representation

SightMate: AI-Powered Companion for the Visually Impaired.

Awesome Lists containing this project

README

          

# πŸš€ **SightMate – Your AI-Powered Companion for the Visually Impaired**
Visually impaired individuals face significant challenges navigating public spaces, accessing information, and staying informed. SightMate solves this by combining vision, audio, and language models into an accessible AI-first platform.

---

## 🎯 Objective

SightMate is an AI-driven, voice-first assistant designed to empower blind and visually impaired individuals. It helps users with:

- Real-time road guidance
- Document & currency reading
- Personalized voice interactions
- Artistic scene understanding
- Daily news summaries

**Approach:**
Blind people deserve a modern, reliable, voice-first experience that goes beyond basic OCR or TTS. SightMate combines powerful LLMs with real-time computer vision and speech processing to truly assist in daily life.

---

## πŸ› οΈ Tech Stack

### πŸ“¦ Core Stack:
- **Frontend:** Next.js, TailwindCSS, ShadCN, Framer Motion
- **Backend:** FastAPI (Python)
- **Database:** SQLite
- **Hosting:** Vercel (Frontend) + Render (Backend)
- **Groq:** Used for ultra-fast inference with:
- `LLaVA` for image-based understanding
- `Mixtral` for LLM-based Q&A
- `TTS` for expressive voice generation

---

## ✨ Key Features

### πŸ›£οΈ Real-Time Scene Monitoring & Road Guidance
- Live camera stream interpreted using LLaVA
- Alerts user with audio feedback about obstacles or road conditions

### πŸ“° Daily News Summarizer
- Fetches real-time news
- Summarizes with Mixtral LLM
- Reads out top headlines in seconds

### πŸ“„ Document & Handwriting Reader
- Users show documents to the camera
- Extracted, summarized, and read out loud

### πŸ’° Indian Currency Recognition
- Detects INR denominations
- Adds up total and reads out count

### 🎨 Artistic Scene Description
- AI describes camera view in poetic or creative style
- Designed to create joyful interaction with the environment

---

## πŸ“½οΈ Demo & Deliverables

- πŸŽ₯ **Demo Video:** *[YouTube](https://youtu.be/tH8MsqGeQG0)*

---

## πŸ§ͺ How to Run the Project

### Requirements
- Python β‰₯ 3.9
- Node.js β‰₯ 18.x
- Groq API Key

## 🧱 Environment Setup

- Copy the `.env.sample` file to `.env` and add your Groq API key.
- Make sure your Groq API key has enough credits to run `playai-tts` model and `meta-llama/llama-4-scout-17b-16e-instruct`
model.

### Clone the repository
```bash
git clone https://github.com/mitesh-kumavat/sightmate
cd sightmate
```

### Backend Setup
- open a terminal and run the following commands:
```bash
pip install -r requirements.txt

# Start FastAPI backend
uvicorn app.main:app --reload
```

### Frontend Setup
- open a new terminal and run the following commands:
```bash
cd frontend
npm install
npm run dev
```

---

## 🌱 Future Scope

- πŸ‘“ Smart Glasses Integration (Raspberry Pi or ESP32 Cam)
- 🧭 Indoor Navigation with Beacons
- πŸ“± Android App with voice activation
- πŸ§‘β€πŸ€β€πŸ§‘ SOS Caretaker Dashboard
- 🌐 Multi-language translation for regional adoption
- πŸ” Object Finder (e.g. β€œFind my keys”)

---

## πŸ“Ž Resources & Acknowledgements

- [Groq API](https://console.groq.com/)
- FastAPI, Uvicorn, SQLModel

---

## 🏁 Final Words

> SightMate isn’t just a project β€” it’s a mission.
> A mission to make the world more inclusive, one intelligent voice at a time.

---