https://github.com/tensorboy/hawkeye
Prompt Free, Soul Driven AI Assistant
https://github.com/tensorboy/hawkeye
ai-assistant ai-for-everyone prompt-free soul-driven
Last synced: 11 days ago
JSON representation
Prompt Free, Soul Driven AI Assistant
- Host: GitHub
- URL: https://github.com/tensorboy/hawkeye
- Owner: tensorboy
- License: mit
- Created: 2026-01-20T21:41:42.000Z (30 days ago)
- Default Branch: main
- Last Pushed: 2026-02-02T01:14:17.000Z (18 days ago)
- Last Synced: 2026-02-02T11:10:44.545Z (17 days ago)
- Topics: ai-assistant, ai-for-everyone, prompt-free, soul-driven
- Language: TypeScript
- Homepage: https://www.hawkiyi.com
- Size: 49.5 MB
- Stars: 28
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README

# Hawkeye
### ๐ฆ
The First Proactive AI Assistant for Desktop
**AI that enhances your story. Watch keenly. Act thoughtfully. 10x your productivity.**
โ + โง + H to observe your screen instantly
[](https://github.com/tensorboy/hawkeye)
[](LICENSE)
[](https://github.com/tensorboy/hawkeye/releases)
[๐ Website](https://hawkiyi.com) ยท [๐ Documentation](https://hawkiyi.com/docs) ยท [๐ Report Bug](https://github.com/tensorboy/hawkeye/issues) ยท [๐ก Request Feature](https://github.com/tensorboy/hawkeye/issues)



---
## ๐ฏ What is Hawkeye?
> **Traditional AI waits for your commands. Hawkeye watches and helps proactively.**
Hawkeye is an **AI-powered desktop assistant** that observes your work environmentโscreen, clipboard, filesโand proactively offers intelligent suggestions. No prompts needed.
The AI behind Hawkeye is designed to **enhance your own story** โ turning your screen time into meaningful personal growth by automatically mapping your goals, habits, and progress into a living **Life Tree**.
| Feature | Copilot / Cursor / Claude Code | **Hawkeye** |
|---------|-------------------------------|-------------|
| **Mode** | Reactive (you ask) | **Proactive** (it watches) |
| **Scope** | Code only | Everything: coding, browsing, writing |
| **Privacy** | Cloud-based | **Local-first**, your data stays local |
| **Control** | AI executes | **You decide** what to execute |
## โจ Key Features
### ๐ Zero-Prompt Intelligence
- Automatically understands your context
- No need to explain what you're doing
- Suggests actions before you ask
### ๐ Privacy-First Architecture
- All perception runs **100% locally**
- Data never leaves your device
- Works offline with local LLMs
### ๐ฏ Smart Task Tracking
- Identifies your main task goal
- Generates actionable next steps
- Learns from your workflow
### ๐ Multi-Platform Sync
- Desktop โ Browser seamless sync
- VS Code extension integration
- Cross-app workflow automation
### ๐ณ Life Tree โ AI Enhances Your Story
- Automatically maps your activities into life stages, goals, and tasks
- Proposes micro-experiments to optimize your habits and workflows
- Graduated experiment phases: task โ goal โ automation
- Your AI companion that turns screen time into personal growth
## ๐ Quick Start
### Download
Platform
Download

[Apple Silicon (.dmg)](https://github.com/tensorboy/hawkeye/releases/latest) ยท [Intel (.dmg)](https://github.com/tensorboy/hawkeye/releases/latest)

[Installer (.exe)](https://github.com/tensorboy/hawkeye/releases/latest)

[Debian/Ubuntu (.deb)](https://github.com/tensorboy/hawkeye/releases/latest) ยท [AppImage](https://github.com/tensorboy/hawkeye/releases/latest)
โ ๏ธ macOS: "App is damaged" fix
```bash
# Remove quarantine attribute
xattr -cr /Applications/Hawkeye.app
```
### Setup in 60 Seconds
```bash
# 1. Clone
git clone https://github.com/tensorboy/hawkeye.git && cd hawkeye
# 2. Install
pnpm install
# 3. Run
pnpm dev
```
### Configure AI Provider
Option 1: Google Gemini (Recommended โ free tier)
1. Get a free API key at [aistudio.google.com/apikey](https://aistudio.google.com/apikey)
2. Enter your key in Settings โ Gemini API Key
3. Model defaults to `gemini-2.0-flash` (1M context window)
Option 2: OpenAI-Compatible API
Works with OpenAI, DeepSeek, Groq, Together AI, or any OpenAI-compatible endpoint.
Set your base URL, API key, and model name in Settings.
Option 3: Local LLM with node-llama-cpp (100% Offline)
Download a GGUF model and set the model path in Settings. Supports Metal GPU acceleration on macOS.
Recommended models:
- **Qwen 2.5 7B** โ general purpose (4.7 GB)
- **Llama 3.2 3B** โ lightweight (2.0 GB)
- **LLaVA 1.6 7B** โ vision support (4.5 GB)
Option 4: Ollama (Legacy)
```bash
brew install ollama && ollama pull qwen3:8b
```
Select "Ollama" in Hawkeye settings.
## ๐๏ธ Architecture
```
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ HAWKEYE ENGINE โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ โ
โ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โ
โ โ PERCEPTION โโโโโถโ REASONING โโโโโถโ EXECUTION โ โ
โ โ Engine โ โ Engine โ โ Engine โ โ
โ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโ โ
โ โ โ โ โ
โ โข Screen OCR โข Claude/Ollama โข Shell Commands โ
โ โข Clipboard โข Task Analysis โข File Operations โ
โ โข File Watch โข Intent Detect โข App Control โ
โ โข Window Track โข Suggestions โข Browser Auto โ
โ โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ INTERFACES โ
โโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโค
โ ๐ฅ๏ธ Desktop โ ๐งฉ VS Code โ ๐ Chrome โ ๐ฆ Core โ
โ (Electron) โ Extension โ Extension โ (npm pkg) โ
โโโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโดโโโโโโโโโโโโโโโโโโ
```
### ๐ฎ Future: Multi-Modal HCI Pipeline
Hawkeye is evolving into a full multi-modal human-computer interaction system that combines **audio understanding**, **visual perception**, and **gesture control**.
```
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ HAWKEYE MULTI-MODAL HCI PIPELINE โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ INPUT LAYER โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ
โ โ ๐ท Camera โโโโโถ MediaPipe Holistic โ โ
โ โ โข Face: 468 landmarks โ โ
โ โ โข Pose: 33 keypoints โ โ
โ โ โข Hands: 21 ร 2 keypoints โ โ
โ โ โ โ
โ โ ๐๏ธ Microphone โโถ Silero VAD โโถ Audio Buffer โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ โ โ
โ โผ โผ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ VISUAL PROCESSING โ โ AUDIO PROCESSING โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ
โ โ Face Tracker โ โ DiariZen / Pyannote โ โ
โ โ โโ Multi-face detection โ โ โโ Speaker diarization โ โ
โ โ โโ Face ID assignment โ โ โโ "Who is speaking?" โ โ
โ โ โโ Lip movement analysis โ โ โโ Speaker embeddings โ โ
โ โ โ โ โ โ
โ โ Gesture Recognizer โ โ Whisper (smart-whisper) โ โ
โ โ โโ Hand pose classification โ โ โโ Speech-to-text โ โ
โ โ โโ Dynamic gesture detect โ โ โโ Language detection โ โ
โ โ โโ Custom gesture mapping โ โ โโ Timestamp alignment โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ โ โ
โ โผ โผ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ FUSION & MATCHING LAYER โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ
โ โ โ โ
โ โ Audio-Visual Matching โ โ
โ โ โโ Lip-sync correlation (who's lips match the audio?) โ โ
โ โ โโ Face-voice association (learn speaker identity) โ โ
โ โ โโ Active speaker detection (LoCoNet / AS-Net) โ โ
โ โ โ โ
โ โ Context Aggregation โ โ
โ โ โโ Combine: transcription + speaker ID + face ID + gesture โ โ
โ โ โโ Generate unified interaction events โ โ
โ โ โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ โ
โ โผ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ ACTION EXECUTION โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ
โ โ โ โ
โ โ Gesture โ Command Mapping โ โ
โ โ โโ ๐ Thumbs Up โ Confirm action โ โ
โ โ โโ โ Open Palm โ Pause / Stop โ โ
โ โ โโ ๐ Point Up โ Scroll up โ โ
โ โ โโ ๐ Point Down โ Scroll down โ โ
โ โ โโ โ๏ธ Victory โ Screenshot โ โ
โ โ โโ ๐ค Pinch โ Zoom in/out โ โ
โ โ โโ ๐๏ธ Swipe โ Switch window / tab โ โ
โ โ โ โ
โ โ Voice Command + Gesture = Enhanced Control โ โ
โ โ โโ "Open browser" + Point โ Open browser at pointed location โ โ
โ โ โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ โ
โ โผ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ OUTPUT โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ
โ โ โ โ
โ โ ๐ Attributed Transcription โ โ
โ โ "Alice: Let's review the code changes" โ โ
โ โ "Bob: I'll share my screen [๐ pointing at screen]" โ โ
โ โ โ โ
โ โ ๐ฎ System Control โ โ
โ โ Mouse movement, clicks, keyboard shortcuts, app switching โ โ
โ โ โ โ
โ โ ๐ณ Life Tree Update โ โ
โ โ Activity tracking, goal inference, habit analysis โ โ
โ โ โ โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
```
**Key Technologies:**
| Component | Technology | Status |
|-----------|------------|--------|
| Voice Activity Detection | Silero VAD | โ
Planned |
| Speech-to-Text | Whisper (smart-whisper) | โ
Implemented |
| Speaker Diarization | DiariZen / Pyannote | ๐ Research |
| Active Speaker Detection | LoCoNet (CVPR 2024) | ๐ Research |
| Body Tracking | MediaPipe Holistic | โ
Planned |
| Gesture Recognition | MediaPipe Gesture | โ
Planned |
| Face-Voice Matching | Custom Fusion | ๐ Research |
## ๐ฆ Project Structure
```
hawkeye/
โโโ packages/
โ โโโ core/ # ๐ง Core engine (local processing)
โ โ โโโ perception/ # Screen, clipboard, file monitoring
โ โ โโโ ai/ # AI providers (Claude, Ollama, etc.)
โ โ โโโ execution/ # Action execution system
โ โ โโโ storage/ # Local database (SQLite)
โ โ
โ โโโ desktop/ # ๐ฅ๏ธ Electron desktop app
โ โโโ vscode-extension/ # ๐งฉ VS Code extension
โ โโโ chrome-extension/ # ๐ Chrome browser extension
โ
โโโ docs/ # ๐ Documentation
โโโ website/ # ๐ Marketing site
```
## ๐ Privacy & Security
| Aspect | How We Protect You |
|--------|-------------------|
| **Screenshots** | โ
Analyzed locally, never uploaded |
| **Clipboard** | โ
Processed on-device only |
| **Files** | โ
Monitored locally, paths never sent |
| **AI Calls** | โ
Only minimal context text sent (or use local LLM) |
| **Dangerous Ops** | โ
Always requires your confirmation |
> ๐ All data stored in `~/.hawkeye/` โ you own your data.
## ๐ Usage Examples
### As a Library
```typescript
import { HawkeyeEngine } from '@hawkeye/core';
const engine = new HawkeyeEngine({
provider: 'ollama',
model: 'qwen3:8b'
});
// Get AI-powered suggestions based on current context
const suggestions = await engine.observe();
// Execute a suggestion with user confirmation
await engine.execute(suggestions[0].id);
```
### File Watcher
```typescript
import { FileWatcher } from '@hawkeye/core';
const watcher = new FileWatcher({
paths: ['~/Downloads', '~/Documents'],
events: ['create', 'move']
});
watcher.on('change', (event) => {
console.log(`${event.type}: ${event.path}`);
});
```
## ๐ก๏ธ Advanced Features
### Exponential Backoff Retry
AI provider calls use exponential backoff with jitter to handle transient failures gracefully, preventing thundering herd effects.
### SQLite FTS5 Full-Text Search
Context history (window titles, clipboard, OCR text) is indexed with SQLite FTS5 for instant fuzzy search across all recorded observations.
### Adaptive Refresh Rate
The observation interval adjusts dynamically based on user activity โ fast polling when active, slow polling when idle โ saving CPU and battery.
### Priority Task Queue
A priority-based task queue with deduplication ensures that AI requests and plan executions are processed efficiently without duplicate work.
### MCP Server Tools
Hawkeye exposes 15+ tools via MCP (Model Context Protocol) for screen perception, window management, file organization, and automation.
### Safety Guardrails
An agent monitor enforces cost limits, blocks dangerous operations (e.g. `rm -rf /`), requires confirmation for risky actions, and supports a sandbox mode.
### Menu Bar Panel
A macOS-style popover panel accessible from the system tray provides quick actions, recent activity feed, and real-time module status indicators.
### Provider Unified Protocol
All AI providers declare their capabilities (chat, vision, streaming, function calling), enabling intelligent routing and health monitoring across providers.
## ๐บ๏ธ Roadmap
- [x] Core perception engine
- [x] Desktop app (Electron)
- [x] VS Code extension
- [x] Chrome extension
- [x] Local LLM support (Ollama, node-llama-cpp)
- [x] Multi-provider AI (Gemini, OpenAI-compatible, LlamaCpp)
- [x] Provider unified protocol with capability routing
- [x] Streaming and health check support
- [x] SQLite FTS5 full-text search
- [x] Exponential backoff retry strategy
- [x] Adaptive refresh rate
- [x] Priority task queue
- [x] MCP Server with 15+ tools
- [x] Safety guardrails and agent monitoring
- [x] Menu bar panel (macOS-style popover)
- [x] Life Tree โ AI maps your life journey and enhances your story
- [ ] Desktop โ Extension real-time sync
- [ ] Plugin system
- [ ] Custom workflow builder
- [ ] Mobile companion app
## ๐ค Contributing
Contributions are what make the open source community amazing! Any contributions you make are **greatly appreciated**.
1. Fork the Project
2. Create your Feature Branch (`git checkout -b feature/AmazingFeature`)
3. Commit your Changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the Branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request
See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed guidelines.
## โญ Star History
## ๐ License
Distributed under the MIT License. See [LICENSE](LICENSE) for more information.
## โ Support
---
**[๐ Website](https://hawkiyi.com)** ยท **[๐ Docs](https://hawkiyi.com/docs)** ยท **[๐ฆ Twitter](https://twitter.com/hawkeyeai)** ยท **[๐ฌ Discord](https://discord.gg/hawkeye)**
Built with โค๏ธ by the Hawkeye Team
**If Hawkeye helps you, please consider giving it a โญ**
