{"id":50266676,"url":"https://github.com/dortanes/atlas","last_synced_at":"2026-05-27T14:12:13.116Z","repository":{"id":344623737,"uuid":"1179088375","full_name":"dortanes/atlas","owner":"dortanes","description":"An AI-powered computer-use agent built with Electron. Automate desktop tasks by letting AI see and interact with your OS.","archived":false,"fork":false,"pushed_at":"2026-03-16T01:58:03.000Z","size":3174,"stargazers_count":2,"open_issues_count":2,"forks_count":1,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-03-16T04:24:28.534Z","etag":null,"topics":["ai-powered","computer-use","computer-use-agent","desktop-agent","electron","gemini","openai"],"latest_commit_sha":null,"homepage":"","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dortanes.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-11T17:16:14.000Z","updated_at":"2026-03-16T01:54:58.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/dortanes/atlas","commit_stats":null,"previous_names":["dortanes/atlas"],"tags_count":5,"template":false,"template_full_name":null,"purl":"pkg:github/dortanes/atlas","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dortanes%2Fatlas","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dortanes%2Fatlas/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dortanes%2Fatlas/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dortanes%2Fatlas/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dortanes","download_url":"https://codeload.github.com/dortanes/atlas/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dortanes%2Fatlas/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33569282,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-05-27T02:00:06.184Z","response_time":53,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai-powered","computer-use","computer-use-agent","desktop-agent","electron","gemini","openai"],"created_at":"2026-05-27T14:12:08.909Z","updated_at":"2026-05-27T14:12:13.111Z","avatar_url":"https://github.com/dortanes.png","language":"TypeScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003cimg src=\"build/icon.png\" width=\"128\" height=\"128\" alt=\"Atlas\" /\u003e\n\u003c/p\u003e\n\n\u003ch1 align=\"center\"\u003eAtlas\u003c/h1\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cb\u003eAI agent that lives on your desktop.\u003c/b\u003e\u003cbr/\u003e\n  It sees your screen, understands what you need, and gets things done — hands-free.\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003ca href=\"https://github.com/dortanes/atlas/releases\"\u003e\u003cimg src=\"https://img.shields.io/badge/download-v0.2.3-7c3aed?style=for-the-badge\u0026logo=windows\u0026logoColor=white\" alt=\"Download\" /\u003e\u003c/a\u003e\u0026nbsp;\n  \u003ca href=\"#-getting-started\"\u003e\u003cimg src=\"https://img.shields.io/badge/get%20started-→-0ea5e9?style=for-the-badge\" alt=\"Get Started\" /\u003e\u003c/a\u003e\u0026nbsp;\n  \u003ca href=\"LICENSE\"\u003e\u003cimg src=\"https://img.shields.io/badge/license-Apache%202.0-gray?style=for-the-badge\" alt=\"License\" /\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://img.shields.io/badge/Windows-supported-brightgreen?style=flat-square\u0026logo=windows\u0026logoColor=white\" alt=\"Windows\" /\u003e\n  \u003cimg src=\"https://img.shields.io/badge/macOS%20%26%20Linux-coming%20soon-yellow?style=flat-square\" alt=\"macOS \u0026 Linux\" /\u003e\n  \u003cimg src=\"https://img.shields.io/badge/privacy--first-local%20only-blue?style=flat-square\" alt=\"Privacy\" /\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\u003cimg src=\"docs/preview.png\" width=\"300\" alt=\"Atlas Demo\" /\u003e\u003c/p\u003e\n\n---\n\n\u003e **⚠️ Atlas is in active development (v0.2.3).**\n\u003e \n\u003e - 🤖 **LLM support:** Gemini (including native [Computer Use API](https://ai.google.dev/gemini-api/docs/computer-use)) and OpenAI. More providers on the way.\n\u003e - 🖥 **Screen control:** Gemini 3.x models use native [Computer Use API](https://ai.google.dev/gemini-api/docs/computer-use) for precise actions. Older models use vision-based coordinate prediction.\n\u003e - 💻 **Platform:** Windows only for now. macOS \u0026 Linux support is planned.\n\u003e - 🐛 **Found a bug?** We'd love to hear about it — [open an issue](https://github.com/dortanes/atlas/issues).\n\n---\n\n## What is Atlas?\n\nAtlas is an **AI-powered desktop agent** that works alongside you as a transparent overlay. Press `Ctrl+Space`, tell it what to do — and it figures out the rest: navigating apps, clicking buttons, typing text, searching the web, finding files, running commands.\n\nThink of it as a **copilot for your entire OS**.\n\n- 🖥 **Sees your screen** — captures what's on your display and understands the context\n- 🧠 **Thinks before it acts** — plans multi-step tasks and shows progress in real time\n- 🖱 **Controls your computer** — mouse, keyboard, and terminal — all automated\n- 🎯 **Shows what it's doing** — you can see the agent's cursor moving on screen\n- 🔍 **Searches the web** — finds answers and brings them back, no tab-switching needed\n- 📂 **Finds your files** — searches local files and folders by name, right from chat\n- 🗣 **Speaks to you** — real-time voice responses with streaming TTS\n- 🎙 **Listens to you** — local speech-to-text with wake word activation, no cloud required\n- 🔊 **Sound feedback** — distinct sounds for every state: activation, processing, task complete, warnings\n- 🛡 **Asks before doing anything risky** — built-in safety system with permission prompts\n\n---\n\n## ✨ Key Features\n\n### 🔮 The Orb\nA glowing AI indicator that shows you exactly what Atlas is doing — idle, thinking, acting, or waiting for your input. Always visible, never in the way.\n\n### 🏝 Islands\nContext-aware floating panels that appear when relevant:\n- **Action Island** — shows the current task and progress\n- **Response Island** — streams Atlas's thoughts and replies word by word\n- **Permission Island** — asks for confirmation before risky operations\n- **Microtask Island** — your task queue with real-time step progress (queue new tasks while the agent is busy)\n- **Search Island** — web search results and local file search results\n- **Listening Island** — live transcript display during voice input\n- **Warning Island** — dismissable warnings for errors and quota issues\n\n### 🎯 Agent Cursor\nWhen Atlas controls your desktop, you can see its cursor moving on screen — clicking, typing, and scrolling — so you always know what's happening.\n\n### 🖥 Computer Use\nWith compatible Gemini 3.x models, Atlas uses the native **[Computer Use API](https://ai.google.dev/gemini-api/docs/computer-use)** for precise screen control — clicking, typing, scrolling, navigating, and searching — all without opening extra apps. Multi-monitor setups are supported.\n\n### 🧩 Smart Task Planning\nBefore executing complex commands, Atlas breaks them into high-level steps (2–5) and displays them in the Task Queue. You see planned steps before execution begins and watch progress as each step completes.\n\n### 🎭 Personas\nCreate multiple AI agents with unique personalities, knowledge, and voices. Each persona has its own memory and prompt settings — switch between them from the tray menu.\n\n### 🧠 Memory\nAtlas remembers your preferences and context across sessions. It learns facts about you from conversations and uses them to give better responses over time. Browse conversation history and view, edit, or delete learned facts in Settings.\n\n### 🎙 Voice Input\nLocal offline speech-to-text via Vosk — just say the wake word (the active persona's name) and Atlas starts listening. No cloud API required.\n\n### ✍️ Editable Prompts\nFull control over the AI's behavior — modify system, action, and safety prompts directly from the Settings UI. Reset to defaults anytime.\n\n### ⚙️ Customizable Layout\nChoose where Atlas appears on screen (left, right, or center) and configure your preferred activation hotkey — all from Settings.\n\n### 🔧 Debug Logging\nEnable per-request session logs to trace the full pipeline: intent classification → LLM calls → actions → response streaming — with precise timing for every stage.\n\n---\n\n## 🚀 Getting Started\n\n### Download \u0026 Install\n\n1. Go to [**Releases**](https://github.com/dortanes/atlas/releases) and download the latest installer for Windows\n2. Run the installer — Atlas will appear in your system tray\n3. Get a **Gemini API key**: go to [Google AI Studio](https://aistudio.google.com/apikey) → sign in → **Create API Key** → copy it\n4. Click the **Atlas tray icon** → **Settings** → **Intelligence** tab → paste your API key\n5. Set the recommended models in the **Intelligence** tab:\n\n   | Setting | Free tier | Paid tier |\n   |---------|-----------|-----------|\n   | **Text model** | `gemini-3.1-flash-lite-preview` | `gemini-3.1-flash-lite-preview` |\n   | **Vision model** | `gemini-3.1-flash-lite-preview` | `gemini-3-flash-preview` |\n\n   \u003e Vision model handles screen control \u0026 Computer Use. Paid tier model is more accurate but requires a billing-enabled API key.\n\n6. *(Optional)* For voice output:\n   - **Alice** (free, no API key): **Voice** tab → select **Alice** → done!\n   - **ElevenLabs** (premium voices): get an [ElevenLabs](https://elevenlabs.io/) API key → **Voice** tab → paste key + voice ID\n7. Press `Ctrl+Space` and start giving Atlas tasks 🎉\n\n### Build from Source\n\n\u003e For contributors and developers who want to run Atlas from source.\n\n```bash\ngit clone https://github.com/dortanes/atlas.git\ncd atlas\nyarn install\nyarn dev\n```\n\n\u003e **Requires:** [Node.js](https://nodejs.org/) ≥ 20 · [Yarn](https://yarnpkg.com/) ≥ 1.22\n\n---\n\n## 🗺 Roadmap\n\n| Status | Feature |\n|:------:|---------|\n| ✅ | Transparent glassmorphism overlay with Orb + Island UI |\n| ✅ | LLM integration (Gemini + OpenAI) with multi-provider architecture |\n| ✅ | Screen vision + desktop automation (robotjs) |\n| ✅ | Native Gemini [Computer Use API](https://ai.google.dev/gemini-api/docs/computer-use) |\n| ✅ | Smart task planning with step-by-step progress |\n| ✅ | Agent cursor animations (click, type, scroll overlays) |\n| ✅ | Streaming TTS (ElevenLabs + Alice) |\n| ✅ | Persona system with isolated memory \u0026 custom voices |\n| ✅ | Web search + local file search |\n| ✅ | Settings UI with prompt editor + debug logging |\n| ✅ | Intent classification (direct / action / chat) |\n| ✅ | Context caching (Gemini prompt caching for token optimization) |\n| ✅ | Voice input (wake word + local STT via Vosk) |\n| 🔜 | Action whitelist/blacklist \u0026 audit log |\n| 🔜 | Onboarding flow |\n| 🔜 | Auto-update |\n\n---\n\n## ⭐ Support the Project\n\nIf you find Atlas useful, please consider giving the repository a **star** ⭐ — it helps others discover the project and motivates further development!\n\n[![Star on GitHub](https://img.shields.io/github/stars/dortanes/atlas?style=social)](https://github.com/dortanes/atlas)\n\n## 🤝 Contributing\n\nContributions are welcome! Feel free to open an issue or submit a pull request.\n\n## 📜 License\n\n[Apache License 2.0](LICENSE) — use it, modify it, build on it.\n\n---\n\n\u003cp align=\"center\"\u003e\n  Vibecoded with ❤️ by \u003ca href=\"https://github.com/dortanes\"\u003edortanes\u003c/a\u003e\n\u003c/p\u003e","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdortanes%2Fatlas","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdortanes%2Fatlas","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdortanes%2Fatlas/lists"}