https://github.com/zorazrw/agent-workflow-memory
AWM: Agent Workflow Memory
https://github.com/zorazrw/agent-workflow-memory
agent generalization web-navigation
Last synced: 4 days ago
JSON representation
AWM: Agent Workflow Memory
- Host: GitHub
- URL: https://github.com/zorazrw/agent-workflow-memory
- Owner: zorazrw
- License: apache-2.0
- Created: 2024-08-28T22:02:57.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2025-01-31T16:06:52.000Z (2 months ago)
- Last Synced: 2025-04-04T02:42:58.659Z (12 days ago)
- Topics: agent, generalization, web-navigation
- Language: Python
- Homepage:
- Size: 1.28 MB
- Stars: 253
- Watchers: 4
- Forks: 25
- Open Issues: 10
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- acu - Code
README
## Quickstart :boom:
To run AWM on WebArena under `webarena/`:
```bash
cd webarena
python pipeline.py --website "shopping" # choose one from ['shopping', 'shopping_admin', 'reddit', 'gitlab', 'map']
```To run AWM on Mind2Web under `mind2web/`:
```bash
cd mind2web
python pipeline.py --setup "offline" # or "online"
```
Check `webarena/` and `mind2web/` folders for more detailed instructions about environment and data setups.## What is Agent Workflow Memory? ðŸ§
Agent Workflow Memory (AWM) proposes to induce, integrate, and utilize workflows via an agent memory.
A workflow is usually a common sub-routine in solving tasks, with example-specific contexts being abstracted out.AWM can operate in both offline and online settings:
- *offline* (left): when additional (e.g., training) examples are available, agents induce workflows from ground-truth annotated examples
- *online* (right): without any auxiliary data, agents induce workflows from past experiences on the fly.## How does AWM work? 📈
### On WebArena
We achieve the state-of-the-art result -- 35.6% success rate.Check the code in `./webarena/` directory.
### On Mind2Web
We also get the best scores among text-based agents. Particularly, AWM offline effectively generalizes across a wide range of tasks, websites, and domains.
Check the code in `./mind2web/` directory.
## Citation 📜
```bibtex
@inproceedings{awm2024wang,
title = {Agent Workflow Memory},
author = {Wang, Zhiruo anf Mao, Jiayuan, and Fried, Daniel and Neubig, Graham},
journal={arXiv preprint arXiv:2409.07429},
year = {2024},
}
```