awesome-agent-evolution
A curated list of AI Agent evolution, memory systems, multi-agent architectures, and self-improvement projects. | evomap.ai
https://github.com/EvoMap/awesome-agent-evolution
Last synced: about 7 hours ago
JSON representation
-
Agent Coding and Software Engineering
- **Claude Code** - Terminal-native agentic coding tool from Anthropic. Understands your codebase and executes tasks through natural language. by [@anthropics](https://github.com/anthropics) (132,937 stars)
- **Codex** - Lightweight coding agent from OpenAI written in Rust. Runs locally as CLI, IDE extension, or desktop app. by [@OpenAI](https://github.com/OpenAI) (91,654 stars)
- **Pi** - Self-extensible coding agent and agent harness. Bundles an interactive coding CLI, an agent runtime with tool calling and state, and a unified multi-provider LLM API. by [@earendil-works](https://github.com/earendil-works) (63,438 stars)
- **Cline** - Autonomous coding agent available as an IDE extension, CLI, or SDK. Plans and executes multi-step edits with human-in-the-loop approval. by [@cline](https://github.com/cline) (63,419 stars)
- **agent-skills** - Production-grade engineering skills and best practices for AI coding agents. by [@addyosmani](https://github.com/addyosmani) (61,714 stars)
- **goose** - Open-source extensible AI coding agent that goes beyond code suggestions. by [@aaif-goose](https://github.com/aaif-goose) (49,654 stars)
- **Aider** - AI pair programming in your terminal. Edit code with LLMs across 100+ languages with deep Git integration. by [@Aider-AI](https://github.com/Aider-AI) (46,359 stars)
- **Taste-Skill** - High-Agency frontend skill that helps AI generate less generic, more tasteful outputs. by [@Leonxlnx](https://github.com/Leonxlnx) (45,625 stars)
- **Qwen Code** - Open-source AI coding agent that lives in your terminal, optimized for Qwen-Coder models. by [@QwenLM](https://github.com/QwenLM) (25,297 stars)
- **SWE-Agent** - Automatically fix GitHub issues and handle cybersecurity challenges. State-of-the-art on SWE-bench. by [@SWE-agent](https://github.com/SWE-agent) (19,542 stars)
- **Devika** - The first open-source implementation of an Agentic Software Engineer. An open-source alternative to Devin. by [@stitionai](https://github.com/stitionai) (19,513 stars)
- **context-mode** - Context window optimization tool for AI coding agents with sandboxed tool output and 98% token reduction. by [@mksglu](https://github.com/mksglu) (17,634 stars)
- **Plandex** - Open-source AI coding agent designed for large projects and complex real-world tasks with persistent context. by [@plandex-ai](https://github.com/plandex-ai) (15,457 stars)
- **Trae Agent** - LLM-based agent by ByteDance for general-purpose software engineering tasks. by [@bytedance](https://github.com/bytedance) (11,672 stars)
- **Open SWE** - Open-source asynchronous coding agent by LangChain for software engineering tasks. by [@langchain-ai](https://github.com/langchain-ai) (9,989 stars)
- **Mini-SWE-Agent** - The 100-line AI agent that solves GitHub issues. Radically simple but scores >74% on SWE-bench verified. by [@SWE-agent](https://github.com/SWE-agent) (5,237 stars)
- **Reflexion** - Language agents with verbal reinforcement learning. Agents that learn from mistakes through self-reflection. by [@noahshinn](https://github.com/noahshinn) (3,183 stars)
-
Agent Development Platforms
- **dify** - Production-ready platform for building agentic AI workflows with visual orchestration. by [@langgenius](https://github.com/langgenius) (145,594 stars)
- **LangChain** - Full-stack agent engineering platform with composable chains, tools, and memory integration. by [@langchain-ai](https://github.com/langchain-ai) (139,540 stars)
- **OpenHands** - Open platform for AI software developers as generalist agents. Autonomous coding, debugging, and deployment. by [@OpenHands](https://github.com/OpenHands) (77,481 stars)
- **CowAgent** - Super AI assistant based on LLMs with autonomous thinking, task planning, skill creation, and long-term memory. by [@zhayujie](https://github.com/zhayujie) (45,371 stars)
- **agno** - Production-ready agent framework that turns agents into deployable services with multi-framework support. by [@agno-agi](https://github.com/agno-agi) (40,741 stars)
- **langgraph** - Build resilient language agents as stateful graphs with persistence and streaming. by [@langchain-ai](https://github.com/langchain-ai) (35,007 stars)
- **AgenticSeek** - Fully local autonomous agent with browsing, coding, and multi-agent capabilities. No API keys required. by [@Fosowl](https://github.com/Fosowl) (26,532 stars)
- **haystack** - Open-source AI orchestration framework for building context-engineered production applications. by [@deepset-ai](https://github.com/deepset-ai) (25,589 stars)
- **mastra** - TypeScript framework for building AI-powered applications with agent workflows and RAG. by [@mastra-ai](https://github.com/mastra-ai) (25,162 stars)
- **Coze Studio** - AI agent development platform with visual tools for creating, debugging, and deploying agents. by [@coze-dev](https://github.com/coze-dev) (21,000 stars)
- **Google ADK** - Open-source Python toolkit by Google for building, evaluating, and deploying sophisticated AI agents. by [@google](https://github.com/google) (20,149 stars)
- **CoPaw** - Co Personal Agent Workstation built on AgentScope. Desktop agent platform with multi-agent collaboration and tool integration. by [@agentscope-ai](https://github.com/agentscope-ai) (18,676 stars)
- **Parlant** - The conversational control layer for customer-facing AI agents. A context-engineering framework for controlling interactions. by [@emcie-co](https://github.com/emcie-co) (18,118 stars)
- **OpenFang** - Open-source Agent Operating System for deploying and managing AI agents. by [@RightNow-AI](https://github.com/RightNow-AI) (17,849 stars)
- **PydanticAI** - Type-safe AI agent framework built on Pydantic with structured outputs and dependency injection. by [@pydantic](https://github.com/pydantic) (17,805 stars)
- **agents** - Framework for building realtime voice AI agents with speech-to-speech pipelines. by [@livekit](https://github.com/livekit) (11,012 stars)
- **ten-framework** - Open-source framework for building conversational voice AI agents. by [@TEN-framework](https://github.com/TEN-framework) (10,680 stars)
- **Agent-Squad** - Flexible framework for managing multiple AI agents and handling complex conversations. by [@2FastLabs](https://github.com/2FastLabs) (7,661 stars)
- **PySpur** - Visual playground for agentic workflows with rapid iteration on multi-agent pipelines. by [@PySpur-Dev](https://github.com/PySpur-Dev) (5,737 stars)
- **MS-Agent** - Lightweight framework by ModelScope to empower agentic execution of complex tasks with memory and deep research. by [@modelscope](https://github.com/modelscope) (4,307 stars)
-
Agent Evolution and Self-Improvement
- **Eliza** - Autonomous agents for everyone. A framework for creating and deploying AI agents that evolve over time. by [@elizaOS](https://github.com/elizaOS) (18,598 stars)
- **Agent Zero** - General-purpose AI agent framework that learns and evolves through interaction. by [@agent0ai](https://github.com/agent0ai) (18,108 stars)
- **SuperAGI** - A dev-first open source autonomous AI agent framework. Build, manage and run self-improving autonomous agents. by [@TransformerOptimus](https://github.com/TransformerOptimus) (17,573 stars)
- **evolver** - The GEP-powered self-evolution engine for AI agents. Genome Evolution Protocol enables agents to evolve autonomously via mutation and selection. by [@EvoMap](https://github.com/EvoMap) (8,756 stars)
- **OpenEvolve** - Open-source evolutionary coding agent inspired by AlphaEvolve. Evolves code solutions through LLM-driven mutation and selection. by [@algorithmicsuperintelligence](https://github.com/algorithmicsuperintelligence) (6,564 stars)
- **Agents (aiwaves)** - An open-source framework for data-centric, self-evolving autonomous language agents. by [@aiwaves-cn](https://github.com/aiwaves-cn) (5,932 stars)
- **EvoAgentX** - Automated framework for evolving agentic workflows. Optimizes agent prompts, tools, and pipelines via evolutionary algorithms. by [@EvoAgentX](https://github.com/EvoAgentX) (3,077 stars)
- **HyperAgents** - Self-referential self-improving agents by Meta. DGM-Hyperagents add an optimization layer so agents edit their own improvement process. by [@facebookresearch](https://github.com/facebookresearch) (2,583 stars)
- **SIA** - Self-improving AI framework that autonomously optimizes the performance of any AI system through iterative evaluation and refinement. by [@hexo-ai](https://github.com/hexo-ai) (1,753 stars)
- **Agent0** - Self-evolving agent framework from UNC/Salesforce/Stanford. Improves without human-curated datasets via curriculum and executor agent competition. by [@aiming-lab](https://github.com/aiming-lab) (1,220 stars)
- **Ouroboros** - Self-creating AI agent that writes its own code and evolves autonomously. Completed 30+ evolution cycles in first 24 hours with zero human intervention. by [@razzant](https://github.com/razzant) (642 stars)
- **A-Evolve** - The PyTorch for Agentic AI. Open-source infrastructure that evolves any agent across any domain with zero human intervention. #1 on MCP-Atlas (79.4%). by [@A-EVO-Lab](https://github.com/A-EVO-Lab) (611 stars)
- **SEAgent** - Self-Evolving Computer Use Agent with Autonomous Learning from Experience. by [@SunzeY](https://github.com/SunzeY) (251 stars)
-
Agent Safety and Guardrails
- **NeMo Guardrails** - NVIDIA's toolkit for adding programmable guardrails to LLM conversational systems. Policy-based safety controls. by [@NVIDIA-NeMo](https://github.com/NVIDIA-NeMo) (6,454 stars)
- **AgentDoG** - Diagnostic guardrail framework for AI agent safety and security. Detects and intercepts unsafe agent behavior at runtime. by [@AI45Lab](https://github.com/AI45Lab) (626 stars)
-
Agent-to-Agent Protocols
- **Google A2A** - Google's open Agent-to-Agent protocol. Enables agent discovery, secure collaboration, and long-running tasks while preserving agent opacity. by [@a2aproject](https://github.com/a2aproject) (24,321 stars)
- **mcp-use** - The fullstack MCP framework to develop MCP Apps for ChatGPT/Claude and MCP Servers for AI Agents. by [@mcp-use](https://github.com/mcp-use) (10,114 stars)
- **openagent** - Enterprise AI platform with MCP and A2A protocol management, knowledge base, and admin interface. by [@the-open-agent](https://github.com/the-open-agent) (5,266 stars)
- **ViteMCP** - A TypeScript framework for building MCP servers. by [@punkpeye](https://github.com/punkpeye) (3,198 stars)
- **arcade-mcp** - MCP server framework and tool-development library for building custom agent capabilities and authenticated tool calls. by [@ArcadeAI](https://github.com/ArcadeAI) (925 stars)
- **A2A x402** - A2A protocol extension adding x402 on-chain payments, letting agents monetize services over Agent-to-Agent calls. by [@google-agentic-commerce](https://github.com/google-agentic-commerce) (526 stars)
- **GEP MCP Server** - MCP Server for Genome Evolution Protocol. Exposes evolution tools to Claude Desktop, Cursor, and any MCP client. by [@EvoMap](https://github.com/EvoMap) (4 stars)
-
Benchmarks and Evaluation
-
Embodied AI and Robotics
- SWE-bench - Can agents resolve real-world GitHub issues?
- AgentBench - Multi-dimensional evaluation of LLMs as agents.
- WebArena - Realistic web environment for autonomous agents.
- OSWorld - Open-ended tasks in real computer environments.
- GAIA - General AI assistant capabilities benchmark.
- EvoClaw - Evaluating agents on continuous software evolution.
- LoCoMo - Long-context memory benchmark for agent memory systems.
-
-
Community and Knowledge
-
Embodied AI and Robotics
- **Awesome-Self-Evolving-Agents** - A comprehensive survey of self-evolving AI agents. Covers single-agent optimization, multi-agent optimization, and domain-specific approaches. by [@EvoAgentX](https://github.com/EvoAgentX) (2,245 stars)
-
-
Embodied AI
- **Open-AutoGLM** - An Open Phone Agent Model and Framework. Unlocking the AI Phone for Everyone. by [@zai-org](https://github.com/zai-org) (25,550 stars)
- **LeRobot** - Open-source robotics framework by Hugging Face. Models, datasets, and tools for real-world robotics in PyTorch. (25,053 stars)
- **Nanobrowser** - Chrome extension for AI-powered web automation. Run multi-agent workflows using your own AI keys. by [@nanobrowser](https://github.com/nanobrowser) (13,307 stars)
- **XcodeBuildMCP** - A MCP server and CLI for agent use when working on iOS and macOS projects. by [@getsentry](https://github.com/getsentry) (5,917 stars)
- **Mobile MCP** - Model Context Protocol Server for Mobile Automation and Scraping (iOS, Android, Emulators and Real Devices). by [@mobile-next](https://github.com/mobile-next) (5,219 stars)
- **agent-device** - CLI that lets AI agents drive real iOS and Android devices — taps, text input, screenshots, and app control for mobile automation. by [@callstack](https://github.com/callstack) (2,807 stars)
- **ROS-LLM** - Framework for embodied intelligence in ROS. Natural language interactions with LLMs for robot control. by [@Auromix](https://github.com/Auromix) (802 stars)
- **RAI** - Vendor-agnostic agentic framework for Physical AI and robotics. Connects LLM agents to ROS 2 tools for perception, reasoning, and control. by [@RobotecAI](https://github.com/RobotecAI) (529 stars)
-
Footnotes
-
Embodied AI and Robotics
- EvoMap
- Awesome Agent Swarm - agent orchestration, swarm intelligence, and collaborative agent systems.
-  (58,775 stars)
- **Letta** - Platform for building stateful agents with advanced self-editing memory. Formerly MemGPT. by [@letta-ai](https://github.com/letta-ai) (23,376 stars)
- **agentmemory** - Persistent, benchmark-tuned memory for coding agents (Claude Code, Cursor, Copilot CLI, Codex, and any MCP client). Remembers context across sessions so you stop re-explaining. by [@rohitg00](https://github.com/rohitg00) (23,222 stars)
- **Cognee** - Knowledge engine for AI agent memory. Build and query knowledge graphs from unstructured data in 6 lines of code. by [@topoteretes](https://github.com/topoteretes) (17,870 stars)
- **Memvid** - Single-file memory layer for AI Agents in Rust. +35% SOTA on LoCoMo with ultra-low latency (0.025ms P50). by [@memvid](https://github.com/memvid) (15,662 stars)
- **memU** - Memory system for 24/7 proactive agents. Persistent memory across sessions and platforms. by [@NevaMind-AI](https://github.com/NevaMind-AI) (13,882 stars)
- **EverMemOS** - Long-term memory for 24/7 AI agents across LLMs and platforms. by [@EverMind-AI](https://github.com/EverMind-AI) (7,629 stars)
- **ChatLab** - Rediscover your social memories with local, AI-powered analysis. 本地化的聊天记录分析工具,通过 AI Agent 回顾你的社交记忆。. by [@ChatLab](https://github.com/ChatLab) (6,716 stars)
- **TencentDB Agent Memory** - Fully local long-term memory for AI agents via a four-tier progressive storage architecture, from Tencent Cloud. by [@TencentCloud](https://github.com/TencentCloud) (5,859 stars)
- **holaOS** - Agent environment for long-horizon work, continuity, and self-evolution. by [@holaboss-ai](https://github.com/holaboss-ai) (5,513 stars)
- **honcho** - Memory library for building stateful agents with user context management. by [@plastic-labs](https://github.com/plastic-labs) (5,231 stars)
- **memgraph** - High-performance open-source in-memory graph database for GraphRAG, AI memory, agentic AI, and real-time graph analytics. Cypher-compatible, built in C++. by [@memgraph](https://github.com/memgraph) (4,164 stars)
- **Acontext** - Open-source skill memory layer for AI agents. Automatically captures learnings from agent runs and stores them as reusable skill files. by [@memodb-io](https://github.com/memodb-io) (3,538 stars)
- **MemMachine** - Universal memory layer for AI agents. Episodic (graph-based), profile (SQL), and working memory with scalable storage and retrieval. by [@MemMachine](https://github.com/MemMachine) (3,119 stars)
- **ReMe** - Memory management kit for agents. File-based and vector-based memory systems. SOTA on LoCoMo and HaluMem benchmarks. by [@agentscope-ai](https://github.com/agentscope-ai) (3,094 stars)
- **datachain** - Operational data context layer for AI agents providing typed and versioned datasets over multimodal content. by [@datachain-ai](https://github.com/datachain-ai) (2,784 stars)
- **nocturne_memory** - Lightweight, rollbackable Long-Term Memory Server for MCP Agents with graph-like structured memory. by [@Dataojitori](https://github.com/Dataojitori) (1,210 stars)
- **Mem9** - Unlimited persistent memory layer for AI agents. Cloud-synced memory across sessions and tools. by [@mem9-ai](https://github.com/mem9-ai) (1,143 stars)
- **Awesome-AI-Memory** - Curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and system design. by [@IAAR-Shanghai](https://github.com/IAAR-Shanghai) (994 stars)
- **MemSkill** - Learning and evolving memory skills for self-evolving agents. Meta-memory that determines what to extract, remember, and forget. by [@ViktorAxelsen](https://github.com/ViktorAxelsen) (522 stars)
- **Awesome-Agent-Memory** - Curated systems, benchmarks, and papers on memory for LLMs/MLLMs -- long-term context, retrieval, and reasoning. by [@TeleAI-UAGI](https://github.com/TeleAI-UAGI) (476 stars)
- **TeleMem** - High-performance drop-in Mem0 replacement. 19% higher accuracy, 43% fewer tokens, and 2.1x speedup via narrative dynamic extraction. by [@TeleAI-UAGI](https://github.com/TeleAI-UAGI) (466 stars)
-
Prompt and Behaviour Optimization
- **Promptfoo** - Open-source LLM evaluation and red-teaming framework. Test prompts, agents, and RAGs with 90+ model providers and 67+ security plugins. by [@promptfoo](https://github.com/promptfoo) (22,304 stars)
- **TextGrad** - Automatic differentiation via text. Backpropagation through LLM-provided textual gradients, published in Nature. by [@zou-group](https://github.com/zou-group) (3,610 stars)
Programming Languages
Categories
Key Research Papers
42
Memory Systems
22
Agent Development Platforms
20
Agent Coding and Software Engineering
17
Agent Evolution and Self-Improvement
13
Embodied AI
8
Agent-to-Agent Protocols
7
Benchmarks and Evaluation
7
Footnotes
3
Prompt and Behaviour Optimization
2
Agent Safety and Guardrails
2
Community and Knowledge
1
Sub Categories
Keywords
ai
30
llm
28
agents
16
agent
15
ai-agents
14
mcp
12
rag
12
python
12
openai
12
chatgpt
8
generative-ai
7
memory
5
gemini
5
agentic-ai
5
gpt-4
5
openclaw
5
chatbot
5
genai
5
artificial-intelligence
4
nlp
4
multimodal
4
skills
4
javascript
4
cli
4
pydantic
4
open-source
4
langchain
4
typescript
4
agentic
3
agent-framework
3
claude-code
3
model-context-protocol
3
codex
3
a2a
3
ai-agent
3
knowledge-base
3
llm-agent
3
autonomous-agents
3
language-model
3
workflow
3
nextjs
3
gpt
3
llmops
3
framework
3
anthropic
3
developer-tools
3
enterprise
2
semantic-search
2
retrieval-augmented-generation
2
langgraph
2