Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent
Last synced: 2 days ago
JSON representation
-
Surveys
-
LLM-based agent
-
Algorithm design
-
Multi-agent (e.g., society, coperation)
- CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving
- Building Cooperative Embodied Agents Modularly with Large Language Models
- OKR-Agent: An Object and Key Results Driven Agent System with Hierarchical Self-Collaboration and Self-Evaluation
- MetaGPT: Meta Programming for Multi-Agent Collaborative Framework
- AutoAgents: A Framework for Automatic Agent Generation
- Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
- AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors
- Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View
- REX: Rapid Exploration and eXploitation for AI agents
- Emergence of Social Norms in Large Language Model-based Agent Societies
-
Benchmark & Dataset
- ACL'24
- ICLR'23
- SmartPlay : A Benchmark for LLMs as Intelligent Agents
- AgentBench: Evaluating LLMs as Agents
- Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
- SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
- SocioDojo: Building Lifelong Analytical Agents with Real-world Text and Time Series
- WebArena: A Realistic Web Environment for Building Autonomous Agents
- LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Game
- Evaluating Large Language Models at Evaluating Instruction Following
- CivRealm: A Learning and Reasoning Odyssey for Decision-Making Agents
-
Task-specific designing
- Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
- NeurIPS'23
- NeurIPS'23
- Rethinking the Buyer’s Inspection Paradox in Information Markets with Language Agents
- A Language-Agent Approach to Formal Theorem-Proving
- Agent Instructs Large Language Models to be General Zero-Shot Reasoners
- Ghost in the Minecraft: Hierarchical Agents for Minecraft via Large Language Models with Text-based Knowledge and Memory
- PaperQA: Retrieval-Augmented Generative Agent for Scientific Research
- Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
-
Multimodal
- ICML'23
- Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds
- Multimodal Web Navigation with Instruction-Finetuned Foundation Models
- You Only Look at Screens: Multimodal Chain-of-Action Agents
- Learning Embodied Vision-Language Programming From Instruction, Exploration, and Environmental Feedback
- An Embodied Generalist Agent in 3D World
- JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
-
Train LLM for generalization & adaptation
-
Experimental analysis
- Identifying the Risks of LM Agents with an LM-Emulated Sandbox
- Evaluating Multi-Agent Coordination Abilities in Large Language Models
- Large Language Models as Gaming Agents
- Benchmarking Large Language Models as AI Research Agents
- Adaptive Environmental Modeling for Task-Oriented Language Agents
- CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
-
Applications
-
Combined with RL
-
Others
-
-
Update history
-
RL-based agent
-
LLM as a tool
- STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
- NeurIPS'23
- ICLR'23
- ICML'23
- ICML'23
- ICML'23
- Leveraging Large Language Models for Optimised Coordination in Textual Multi-Agent Reinforcement Learning
- Text2Reward: Dense Reward Generation with Language Models for Reinforcement Learning
-
Instruction following
- NeurIPS'23
- NeurIPS'23
- Compositional Instruction Following with Language Models and Reinforcement Learning
- RT-1: Robotics Transformer for Real-World Control at Scale - 1-robotics-transformer-for-real.html)]
- RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control - 2-new-model-translates-vision-and-language-into-action/)]
- Open X-Embodiment: Robotic Learning Datasets and RT-X Models - up-learning-across-many-different-robot-types/)]
- LEO: An Embodied Generalist Agent in 3D World - generalist.github.io/)]
-
Build agent based on World model
-
Language as knowledge
-
Generalization across tasks
-
Continual learning
-
Combine RL and LLM
-
Transformer-based policy
- NeurIPS'23 - agent.github.io/)]
-
Trajectory to language
-
Trajectory predication
-
Others
-
Sub Categories
Benchmark & Dataset
11
Multi-agent (e.g., society, coperation)
10
Task-specific designing
9
LLM as a tool
8
Multimodal
7
Instruction following
7
Others
6
Experimental analysis
6
Algorithm design
6
Trajectory to language
4
Train LLM for generalization & adaptation
4
Combine RL and LLM
4
Continual learning
3
Combined with RL
3
Language as knowledge
3
Build agent based on World model
3
Applications
2
Generalization across tasks
2
Transformer-based policy
1
Trajectory predication
1