https://github.com/khakans/kens-robot

Robot berbasis AI LLM OpenAI yang dapat melihat menggunakan kamera, mendeteksi wajah/contour, menggerakkan roda, dan berinteraksi dengan percakapan suara secara realtime.
https://github.com/khakans/kens-robot

ai llm raspberry-pi robot stt tts

Last synced: about 2 months ago
JSON representation

Robot berbasis AI LLM OpenAI yang dapat melihat menggunakan kamera, mendeteksi wajah/contour, menggerakkan roda, dan berinteraksi dengan percakapan suara secara realtime.

Host: GitHub
URL: https://github.com/khakans/kens-robot
Owner: khakans
Created: 2025-09-24T14:32:24.000Z (9 months ago)
Default Branch: master
Last Pushed: 2025-09-25T09:16:17.000Z (9 months ago)
Last Synced: 2025-09-25T11:27:15.057Z (9 months ago)
Topics: ai, llm, raspberry-pi, robot, stt, tts
Language: Python
Homepage:
Size: 12.7 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# AI Voice & Actuator Robot - Raspberry Pi 4

> Robot berbasis AI LLM OpenAI yang dapat melihat menggunakan kamera, mendeteksi wajah/contour, menggerakkan roda, dan berinteraksi dengan percakapan suara secara realtime.

---

## Fitur

- **Computer Vision**
- Deteksi wajah menggunakan OpenCV
- Deteksi kontur/obstacle untuk navigasi
- **Robotik**
- Kontrol 2 roda: maju, mundur, belok kiri/kanan
- Interface untuk aktuator motor
- **Voice Interaction**
- STT (Speech-to-Text) realtime
- TTS (Text-to-Speech) untuk membalas percakapan
- Robot dapat berbicara tanpa keyboard
- **AI**
- Integrasi OpenAI LLM untuk percakapan
- Memahami instruksi dan membalas secara natural

---

## Hardware

- Raspberry Pi 4 (4GB)
- Kamera Module / USB Webcam
- Speaker + Microphone
- Motor driver untuk 2 roda

---

## Struktur Direktori

kk-robot/
│
├── src/
│ ├── main.py # Entry point program robot
│ │
│ ├── vision/
│ │ ├── camera.py # Class Camera wrapper OpenCV
│ │ ├── detector.py # Face/Contour detection
│ │ └── room_detector.py # Opsional: deteksi ruangan/obstacle
│ │
│ ├── audio/
│ │ ├── stt_vosk.py # Realtime Speech-to-Text
│ │ └── tts.py # Text-to-Speech
│ │
│ ├── control/
│ │ └── drive.py # DifferentialDrive: kontrol motor 2 roda
│ │
│ └── llm/
│ └── llm_client.py # Wrapper OpenAI LLM API
│
├── requirements.txt # Library Python
├── README.md # Dokumentasi project
└── .gitignore

---

## Instalasi

1. **Update Raspberry Pi & Install Dependencies**

Download vosk-model-small-en-us-0.15 model from https://alphacephei.com/vosk/models

sudo apt update && sudo apt upgrade -y
sudo apt install python3-pip python3-opencv ffmpeg libatlas-base-dev -y

python3 -m venv venv
source venv/bin/activate
pip install --upgrade pip

pip install -r requirements.txt

## Running

1. **with python**
python src/main.py

2. **with script**
chmod +x start.sh
./start.sh

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/khakans/kens-robot

Awesome Lists containing this project

README