https://github.com/mjason/long

Single-binary LLM agent runtime built on Elixir/OTP: chat UI, 4-tier memory, Anthropic-compatible Skills, scheduled tasks, multi-provider LLM routing, and platform bots.
https://github.com/mjason/long

ai-agent anthropic-skills ash-framework chat elixir liveview llm phoenix

Last synced: 30 days ago
JSON representation

Single-binary LLM agent runtime built on Elixir/OTP: chat UI, 4-tier memory, Anthropic-compatible Skills, scheduled tasks, multi-provider LLM routing, and platform bots.

Host: GitHub
URL: https://github.com/mjason/long
Owner: mjason
License: mit
Created: 2026-05-16T03:54:05.000Z (about 2 months ago)
Default Branch: main
Last Pushed: 2026-06-09T04:40:48.000Z (about 1 month ago)
Last Synced: 2026-06-09T05:23:59.155Z (about 1 month ago)
Topics: ai-agent, anthropic-skills, ash-framework, chat, elixir, liveview, llm, phoenix
Language: Elixir
Homepage: https://github.com/mjason/long
Size: 983 KB
Stars: 24
Watchers: 0
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Long

[![Elixir](https://img.shields.io/badge/elixir-1.15%2B-purple.svg)](https://elixir-lang.org)
[![Phoenix](https://img.shields.io/badge/phoenix-1.8-orange.svg)](https://phoenixframework.org)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
[![ghcr.io](https://img.shields.io/badge/ghcr.io-mjason%2Flong-2496ED?logo=docker&logoColor=white)](https://github.com/mjason/long/pkgs/container/long)

**English** · [简体中文](README.zh-CN.md)

> A single-process, multi-user LLM agent runtime on Elixir/OTP — for a household or a small team. Phoenix for the UI, Ash for the data layer, Oban for scheduled tasks, ReqLLM for provider abstraction.

Long *started* as a port of the Python [GenericAgent](https://github.com/lsdefine/GenericAgent) to Elixir, borrowing its core shape — *one session → ReAct loop → tools + memory + skills*. The design has since diverged substantially: on the BEAM it gets real concurrency, fault tolerance, and long-lived push messaging natively (one supervised GenServer per session rather than a bolted-on Python process model), and the agent's capability layer has been rebuilt on **mature, standard technology rather than a bespoke tool protocol** — most notably **GraphQL as the agent's primary skill** (see below).

It's **web-first**: you don't run a CLI to talk to it. Open the browser, and chat, configuration, memory, channels, and scheduled tasks are all just pages.

## Design philosophy

Long is built to install and run **like a personal CLI tool, not like server infrastructure.** Everything below follows from that one decision.

- **One self-contained binary.** `mix release` bundles the Erlang VM (ERTS) and every BEAM dependency into a single tarball. The target machine needs neither Erlang nor Elixir installed — just untar and run. No Docker image to build, no base image to track.
- **Installs into one directory, owns nothing else.** `curl | bash` drops everything under `~/.long/` — the binary, the VM, the SQLite database, the config, the agent workspace, the skills. Uninstall is `rm -rf ~/.long`. Upgrades wipe only `bin/ lib/ releases/ erts-*` and **preserve your `env`, `long.db`, and `agent/` data**.
- **No external services to provision.** Storage is SQLite (a file) plus the filesystem (skills, workspace). No Postgres, no Redis, no message broker. The whole runtime is *one OS process* — the BEAM — internally supervising sessions, bots, the scheduler, and even the headless-browser subprocess.
- **Userspace, no root.** Install and autostart run entirely as your user. `~/.long/service install` wires up launchd (macOS) or a systemd **user** unit (Linux) so the agent survives reboot — you never write a unit file or `sudo` anything.
- **No language runtime to provision.** `code_run` executes TypeScript/JavaScript in a sandboxed [Deno](https://deno.land/) binary that the app downloads and manages itself on first use — no Python, Node, or `uv` to install (`bash` is available for shell commands). The optional headless browser, Obscura, is fetched the same way.
- **LAN-first, not internet-hardened.** Binds `0.0.0.0`, `check_origin` off, no forced SSL — so a freshly installed node is reachable from any device on your home network by IP. Internet exposure is opt-in (`LONG_CHECK_ORIGIN`, a reverse proxy, etc.), never the default that locks you out on first run. In the same spirit, `code_run`'s `bash` mode runs with the server's full host access (only the Deno engine — the default — is sandboxed per-member); that's fine for a trusted household, not for untrusted members.
- **Open-source-friendly delivery.** The installer pulls release tarballs straight from GitHub Releases over plain `curl` — no `gh` CLI, no GitHub account, no auth. Anyone can install with one line.

The result: getting Long onto a Mac mini or a Linux box in the corner is `curl | bash` + paste an API key, and it behaves like an appliance from there.

## GraphQL as the agent's primary skill

Most agent frameworks invent a bespoke tool protocol — one narrow tool per capability (`schedule_task`, `remember_fact`, `update_checkpoint`, …), each with a hand-maintained schema the model has to be taught.

Long takes a different route: **the agent's main capability is a single `graphql` tool** over the entire Ash data layer — sessions, messages, both memory tiers, working checkpoints, scheduled tasks, secrets, LLM/search configs — for **read *and* write** through one uniform interface.

- **The model already speaks it.** GraphQL is in every model's pretraining; there's no custom DSL to explain.
- **It's self-describing.** The schema is introspectable (`{ __schema { queryType { fields { name } } } }`), so the agent discovers its own capabilities at runtime instead of us maintaining a wall of tool descriptions.
- **One tool replaces ~10.** Adding a new Ash resource automatically grants the agent CRUD over it — no new tool to write, register, or document.

This is the core bet: lean on mature, introspectable, widely-understood technology (GraphQL) as the agent's capability surface, and let the model's existing fluency do the rest. File-based **Skills** (`SKILL.md` + scripts, below) remain for packaged, code-carrying capabilities; GraphQL is how the agent reads and writes its own world.

## Web UI

Everything is a page — there's no separate CLI you have to learn to operate the agent day to day.

| Page | What it is |
|---|---|
| `/chat` | the agent. Streaming replies, live tool-call display, a memory side-rail, AI-named sessions. |
| `/manage` | everything else. LLMs, memory, skills, **groups & members**, sessions, search providers, channels, scheduled tasks, **phrases (i18n)** — each a LiveView. |

## Features

- **GraphQL capability layer** — one introspectable `graphql` tool gives the agent read/write over its whole data world (see above).
- **Groups & members (multi-tenant)** — a deployment hosts one or more **groups** (a family, a small team). Each **member** links their own WeChat / Telegram by sending `/bind `, gets an isolated per-member code workspace + personal skills, and can message other members in the group ("notify …"). One owner, many members. - **Sandboxed code execution** — `code_run` runs TypeScript/JavaScript on a managed [Deno](https://deno.land/) binary, sandboxed (read/write) to the caller's per-member workspace; `bash` is there for shell. No Python — Deno auto-downloads on first use. - **Per-chat language (i18n)** — every bot string resolves through a fallback chain (channel → member → group → platform-detected → system default) and is overridable at `/manage/phrases`; set a system-wide default language in one click. - **Web-first LiveView UI** — `/chat` (streaming output + tool-call display + live memory side-rail + AI-generated session titles) and `/manage` for everything else. - **Four-tier memory:** - L1 `WorkingCheckpoint` (one `key_info` row per session) - L2 `GlobalMemory` / `SessionMemory` (fact / preference / goal / decision, with importance + recency decay) - L3 **Anthropic-compatible Skills** (`SKILL.md` + scripts/references/assets), **scoped per-member or shared group-wide** (promote a personal skill to global, or view a skill's full `SKILL.md`, at `/manage/skills`); the filesystem is the source of truth, a watcher drives an ETS index - L4 `SessionArchive` (session archival + LLM summary) - **Multi-provider LLM routing** — ReqLLM speaks 20+ providers natively (openai / anthropic / google / groq / deepseek / openrouter / mistral / ollama / xai / bedrock / …); wire protocol is configurable, one alias is the default. - **Unified admin at `/manage`** — LLM configs, memory editing, skill browsing, **groups & members, channels, phrases (i18n)**, session management, search providers, scheduled tasks, secrets — all LiveView, no ash_admin dependency. - **Scheduled tasks** — Oban-driven; the LLM schedules its own via GraphQL `createScheduledTask`, or you create them by hand at `/manage/scheduled`. - **Multi-account channels** — host several WeChat accounts and/or Telegram bots at once, each bound to a different member; inbound auto-tags the sender, outbound routes back through the exact account the chat arrived on. Onboarded at `/manage/credentials`. - **Web search aggregation** — Tavily / Brave API + SERP scrapers, RRF multi-source merge, providers configured at `/manage/search`. - **Real headless browsing** — the Obscura CLI (a Rust Chromium fork) powers the `web_scan` / `web_execute_js` tools. - **Error observability** — ErrorTracker dashboard, a `:logger` crash backstop, transparent exponential-backoff retry on LLM calls. - **In-conversation control** — `/clear` wipes a session, `/status` asks what the agent is doing, `/btw ` interjects mid-run, `/bind ` links this chat to a member.

## Architecture ``` ┌─────────────────────────────────────────────────────────────────┐ │ Phoenix LiveView ─ /chat ─ /manage ─ navigation hub │ ├─────────────────────────────────────────────────────────────────┤ │ Long.Agent.Server (GenServer per session) ──► ReAct loop │ │ │ │ │ │ ├── persist messages → DB ├── tools (file/web/memory/…) │ │ └── PubSub stream → LiveView └── ReqLLM streaming │ ├─────────────────────────────────────────────────────────────────┤ │ Memory L1 WorkingCheckpoint L2 Global + Session │ │ L3 Skill.Store (FS + watcher + ETS) │ │ L4 SessionArchive │ ├─────────────────────────────────────────────────────────────────┤ │ Long.Agent.Bots ─ WeChat │ Telegram │ │ Oban ─ ScheduledTask runner ErrorTracker ─ /errors │ ├─────────────────────────────────────────────────────────────────┤ │ Storage: SQLite (Ash) + filesystem (skills, workspace) │ └─────────────────────────────────────────────────────────────────┘ ``` Core stack: | Dependency | Role | |---|---| | Elixir 1.15+, Phoenix 1.8, Ash 3, AshSqlite | App / data layer | | Oban + AshOban + Oban Web | Background job scheduling | | ReqLLM | Unified multi-provider LLM access | | Jido + Jido.AI | Tool system (Zoi schema + auto JSONSchema) | | Mishka Chelekom | 70+ Tailwind LiveView components | | Obscura | Rust headless-browser CLI | | ErrorTracker | In-app exception aggregation | ## Quick start ### One-line install (macOS / Linux) Prebuilt releases cover `macos-arm64`, `linux-x64`, and `linux-arm64`: ```bash curl -fsSL https://raw.githubusercontent.com/mjason/long/main/install.sh | bash ``` The script: - pulls the latest tarball from GitHub Releases and extracts it to `~/.long/`, - on first run generates `~/.long/env` (with an auto-generated `SECRET_KEY_BASE`, `DATABASE_PATH`, …), - writes the `~/.long/run` launcher and the `~/.long/service` autostart controller. ```bash $EDITOR ~/.long/env # usually nothing to change ~/.long/run # start; open http://localhost:4000 ``` Make it start on boot (no root, no unit-file editing): ```bash ~/.long/service install # enable autostart (launchd / systemd-user) ~/.long/service status # is it registered + running? ~/.long/service logs # tail run.log ~/.long/service uninstall # disable autostart ``` Installer environment variables: | Variable | Default | Meaning | |---|---|---| | `LONG_INSTALL_DIR` | `~/.long` | install target | | `LONG_VERSION` | latest | pin a version, e.g. `v0.2.9` | ### Docker A self-contained image bakes in the `mix release` **plus** Deno and Obscura — nothing is downloaded on first run. The repo ships a ready `docker-compose.yml`: ```yaml services: long: image: ghcr.io/mjason/long:latest # or build locally: build: . ports: ["4000:4000"] environment: SECRET_KEY_BASE: ${SECRET_KEY_BASE} # generate once: openssl rand -base64 48 PHX_HOST: ${PHX_HOST:-localhost} # LONG_CHECK_ORIGIN: "true" # set when exposing to the internet volumes: ["long_data:/data"] # SQLite DB + skills + workspace + memory restart: unless-stopped volumes: long_data: ``` From a clone of the repo: ```bash export SECRET_KEY_BASE=$(openssl rand -base64 48) docker compose up -d # build + run # → http://localhost:4000 ``` Data (SQLite DB, skills, agent workspace, memory) persists in the `long_data` volume across restarts and upgrades. Prefer a bare image? `docker build -t long .`. ### Run from source Requirements: - Elixir 1.15+ / Erlang 26+ - SQLite 3 - (Deno and the optional Obscura browser are auto-downloaded at runtime — nothing to pre-install.) ```bash git clone https://github.com/mjason/long.git cd long mix setup # deps.get + ecto.create + migrate + seeds + asset build mix phx.server ``` Open and enter `/chat` or `/manage/llms` from the navigation hub. ### Configure your first LLM Open `/manage/llms` → **New LLM**: | Field | Example | |---|---| | Alias | `claude_main` | | Provider | `anthropic` | | Wire protocol | `anthropic_messages` | | Model | `claude-sonnet-4` | | API base | `https://api.anthropic.com` (or a proxy) | | API key | `sk-ant-…`, or leave blank to use `api_key_env_var` | | Set as default | ✓ | Save, go back to `/chat`, and new sessions bind to this alias automatically. The same flow works for OpenAI / Google / Groq / DeepSeek / any ReqLLM-supported provider. ### Install your first Skill (optional) ```bash mkdir -p priv/agent/skills/hello-world/scripts cat > priv/agent/skills/hello-world/SKILL.md <<'MD' --- name: hello-world description: Demo skill — takes a `name` arg, returns a greeting string. tags: [demo] --- # hello-world Run `scripts/hello.py ""`. MD cat > priv/agent/skills/hello-world/scripts/hello.py <<'PY' import json, sys name = (json.loads(sys.argv[1]) if len(sys.argv) > 1 else {}).get("name", "world") print(json.dumps({"greeting": f"hello, {name}"}, ensure_ascii=False)) PY mix long.skill reindex # or restart the server; the watcher also picks it up ``` Next conversation, the LLM sees `hello-world` under `# Available skills` and can `skill_read` then `code_run` it. The format is fully compatible with [Anthropic Agent Skills](https://code.claude.com/docs/en/skills), so you can `git clone https://github.com/anthropics/skills priv/agent/skills/` to grab the official repo wholesale. ## Channels & members A deployment serves one or more **groups**; each group has **members** (people), and each member links their own chat accounts. The owner sets channels up once at **`/manage/credentials`** — no env vars, no restart: - **WeChat** — click *扫码登录* (scan to log in) for an inline QR and host an account via Tencent's iLink bot API (no desktop hook). **Multiple accounts** are supported — host one per member/role; the worker hot-reloads on connect. - **Telegram** — paste a [@BotFather](https://t.me/BotFather) token; the bot starts long-polling immediately. Replies render as Telegram HTML, with typing indicators and inbound/outbound media. **Multiple bots** are supported, one per member. Members then link themselves: each member has a `/bind ` (shown at `/manage/groups`); they send it to the shared bot and that chat account is tied to them. Once bound, members can address each other — *"notify my spouse / 通知老张 …"* — and the agent delivers to every channel that member has linked, in **their** language (see i18n above). Outbound always routes back through the exact account a chat arrived on. ## CLI tools | Command | Purpose | |---|---| | `mix phx.server` | start the web server (default port 4000) | | `mix long.skill list / reindex / remove NAME` | skill index management | | `mix long.wechat.login` | WeChat QR login + buf persistence | | `iex -S mix` | REPL: `Long.Agent.list_sessions()`, etc. | | `mix test` | test suite | | `mix precommit` | `compile --warnings-as-errors + format + test` | ## Configuration **Configure from the web, not from files.** Almost everything — LLMs, search providers, channels, scheduled tasks, memory, secrets — lives in the DB and is edited at `/manage`. There's no config file to redeploy for day-to-day changes. The only file-level config is a handful of filesystem roots, under `:long, Long.Agent` in `config/config.exs` (the installed release reads these from `~/.long/env` instead): ```elixir config :long, Long.Agent, memory_root: "priv/agent/memory", # legacy GenericAgent-compatible path skill_root: "priv/agent/skills", # L3 skill dir (SKILL.md lives here) workspace_root: "priv/agent/workspace" # root for code_run / file_* tools ``` Everything else is a page in `/manage` (or, if you prefer, an IEx call like `Long.Agent.register_llm/1`). ## Development ```bash mix test # unit + LiveView tests mix test test/long/jido # run one group mix format mix precommit # compile --warnings-as-errors + format + test mix usage_rules.docs Ash.Resource # look up dependency docs ``` `CLAUDE.md` carries project-level AI-agent guidance (usage rules + skill entry points), auto-loaded if you use Claude Code / Cursor / similar IDE agents. ## Status **Beta — small multi-user.** Runs as a daily AI assistant for a household / small workgroup (a few members, one box on the LAN). - Multi-tenant by **group + member** — per-member code workspaces, personal/shared skills, per-channel binding — but **no hard auth/RBAC yet**: members are trusted, and `bash` runs unsandboxed (only the Deno engine is per-member sandboxed). Fine for a family/team; not for untrusted members or internet exposure. - The schema changes occasionally; no backward-compat promise. - Some paths (mixin LLM, full WeChat / Telegram chains) have light test coverage. - Deployment docs are single-node only. Issues and PRs welcome, but there's no committed release cadence. ## Roadmap Near-term: - [x] Multi-user via groups + members (per-member workspace, personal/shared skills, channel binding) - [ ] Group-level RBAC — member roles enforced at the data layer (for workgroups / semi-trusted members) - [ ] Consolidate the dual ReAct loop + tool families (retire the legacy `Long.Agent.Loop` / `Long.Agent.Tools`) - [ ] Pin + SHA-verify the Deno / Obscura binary downloads (supply chain) Longer-term: - [ ] PubSub reconnect replay — buffer recent turn events so a reconnecting client doesn't miss mid-stream output - [ ] Timezone-aware scheduling (currently UTC-only) - [ ] Observability page — tool error rates, LLM retries, Deno / Obscura install status Backlog (nice-to-have, not roadmap): - Framework-driven `/manage` forms (Mishka `<.text_field>` / AshPhoenix.Form) instead of hand-rolled ``s ## Acknowledgements - The Python [GenericAgent](https://github.com/lsdefine/GenericAgent) was the original starting point. - [Ash Framework](https://ash-hq.org) lets data + domain logic be described together. - [ReqLLM](https://hexdocs.pm/req_llm/) makes onboarding a new LLM provider a one-line alias. - [Mishka Chelekom](https://mishka.tools/chelekom/) provides the full LiveView component library. - [Anthropic Agent Skills](https://github.com/anthropics/skills) defines the SKILL.md format. - [Obscura](https://github.com/h4ckf0r0day/obscura) forked Chromium into a clean CLI. ## License

MIT — see [LICENSE](LICENSE).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mjason/long

Awesome Lists containing this project

README