An open API service indexing awesome lists of open source software.

https://github.com/omnimind-ai/openomnibot

Your on-device AI assistant, capable of operating terminals and performing a wide range of tasks in the Android world || 你的端侧 AI 助手,她可以操作终端,也可以完成 Android 世界的广泛任务
https://github.com/omnimind-ai/openomnibot

agent ai alpine android automation bot claw codex gui linux on-device skills

Last synced: 3 days ago
JSON representation

Your on-device AI assistant, capable of operating terminals and performing a wide range of tasks in the Android world || 你的端侧 AI 助手,她可以操作终端,也可以完成 Android 世界的广泛任务

Awesome Lists containing this project

README

          



OpenOmniBot


English |
简体中文


Your On-Device AI Assistant


GitHub Repo stars
GitHub Release


OmniMind
Linux Do Community

WeChat Group


|
Demo
|
Quick Start
|
Release
|
Issues
|

> Unlike traditional mobile AI chat apps, OpenOmniBot runs directly on your device and can operate your Android phone like a human, including apps, gestures, and system settings.

OpenOmniBot is an on-device AI agent built with native Android Kotlin and Flutter. Instead of stopping at chat, it focuses on the full loop of **understand -> decide -> execute -> reflect**.

Core Capabilities

- **Extensible tool ecosystem**: Skills, Alpine environment, browser access, MCP, and Android system-level tools.
- **Phone task automation**: Uses vision models to understand and operate mobile interfaces.
- **System-level actions**: Supports scheduled tasks, alarms, calendar creation/query/update, and audio playback control.
- **Memory system**: Short-term and long-term memory with embedding support.
- **Productivity tools**: Read and write files, browse the workspace, use the browser, and access the terminal.

Quick Start


Example

### Configure the app

Open the settings page from the left sidebar:


Configure AI capabilities
Configure AI providers

Then open the scenario model settings:


Configure AI models

Note: `Memory embedding` requires an embedding model. For the best overall experience, the other scenarios should use multimodal or vision-capable models whenever possible.


Alpine environment

The app usually initializes the Alpine environment automatically on startup, and you can also manage that environment from the same settings area.

Use Cases

### Skills

You can ask OmniBot to install a skill by simply sending it the repository link. Recommended collection: https://github.com/OpenMinis/MinisSkills

Enable or disable skills from the skill repository:


Skill store
Skill example

### VLM tasks


VLM task

Before starting a task, open the chat page and grant all required permissions from the top-right corner.

### Local model inference


Local inference

Supports both MNN and llama backends.

### Scheduled tasks


Scheduled task
Timing

Scheduled tasks can execute work such as VLM tasks and subagent flows. Alarms are reminder-only. A subagent can be assigned a complete task and behaves like a full agent.

### Browser


Browser

### Workspace


Workspace

Development Guide

### Requirements

- Flutter SDK `3.9.2+`
- JDK `11+`

### Get the code

```bash
git clone https://github.com/omnimind-ai/OpenOmniBot.git
cd OpenOmniBot

git submodule update --init third_party/omniinfer
git -C third_party/omniinfer submodule update --init framework/mnn
git -C third_party/omniinfer submodule update --init framework/llama.cpp

cd ui
flutter pub get
```

If Flutter reports `Could not read script '.../ui/.android/include_flutter.groovy'`, run:

```bash
flutter clean
flutter pub get
```

### Build and install

```bash
cd ..
./gradlew :app:installDevelopDebug
```

Architecture Overview

```text
OpenOmniBot/
├── app/ # Android host app: entry point, agent orchestration, system abilities, MCP, services
├── ui/ # Flutter UI: chat, settings, tasks, memory, and web chat bundle
├── baselib/ # Shared core libraries: database, storage, networking, model config, OCR, permissions
├── assists/ # Automation engine: task scheduling, state machine, visual detection, execution control
├── accessibility/ # Accessibility and screen perception: accessibility service, screenshots, projection
├── omniintelligence/ # AI abstractions: model protocol, task status, request/response models
├── uikit/ # Native overlay UI: floating ball, overlay panels, half-screen surfaces
├── third_party/omniinfer/ # Local inference runtime and Android integration modules
└── ReTerminal/core/ # Embedded terminal experience modules
```

Community

Thanks to the community (including [LINUX](linux.do))developers supporting OpenOmniBot.

Special thanks to these open-source projects:

- https://github.com/RohitKushvaha01/ReTerminal
- https://github.com/OpenMinis



WeChat Group

WeChat Group