An open API service indexing awesome lists of open source software.

https://github.com/steel-dev/awesome-web-agents

๐Ÿ”ฅ A list of tools, frameworks, and resources for building AI web agents
https://github.com/steel-dev/awesome-web-agents

List: awesome-web-agents

ai ai-agents browser-automation llms llms-benchmarking

Last synced: 5 months ago
JSON representation

๐Ÿ”ฅ A list of tools, frameworks, and resources for building AI web agents

Awesome Lists containing this project

README

          

# Awesome Web Agents










A curated list of tools, frameworks, and resources for building AI agents that can browse and interact with the web.

About Steel



Steel is an [open-source](https://github.com/steel-dev/steel-browser) browser API built specifically for AI agents. We make it easy to build AI applications that can effectively interact with the web.

โœจ Get started for free [here](https://app.steel.dev).

Contents

- [Awesome Web Agents](#awesome-web-agents)
- [Autonomous Web Agents](#autonomous-web-agents)
- [Computer-use Agents](#computer-use-agents)
- [AI Web Automation Tools](#ai-web-automation-tools)
- [Dev Tools](#dev-tools)
- [AI Web Scrapers/Crawlers](#ai-web-scraperscrawlers)
- [Web Search \& Query Tools](#web-search--query-tools)
- [Benchmarks \& Research](#benchmarks--research)
- [Tutorials \& Guides](#tutorials--guides)
- [Interested in implementing Steel?](#interested-in-implementing-steel)
- [Join the Community](#join-the-community)
- [Contributing](#contributing)
- [Contributors](#contributors)

## Autonomous Web Agents

AI agents that autonomously navigate and interact with the web through a user-friendly interface. (a.k.a Browser Agents)

- [Surf.new](https://surf.new) - An open-source playground for chatting with different web agents. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/surf.new?style=social)
- [OpenAI Operator](https://openai.com/index/introducing-operator/) - OpenAI's AI agents that can browser the web for you.
- [Browser-Use](https://www.browser-use.com) - SOTA agent and framework that makes the web LLM-friendly. ![GitHub Repo stars](https://img.shields.io/github/stars/Browser-Use/browser-use?style=social)
- [Skyvern-AI](https://www.skyvern.com/) - Framework to automate browser-based workflows. ![GitHub Repo stars](https://img.shields.io/github/stars/Skyvern-AI/skyvern?style=social)
- [Proxy by Convergence](https://convergence.ai) - Proxy is your AI-powered digital assistant that explores the web and executes tasks through simple conversation.
- [Google Project Mariner](https://deepmind.google/technologies/project-mariner/) - A research prototype exploring the future of human-agent interaction, starting with your browser.
- [Runner H](https://www.hcompany.ai/) - Runner H is a state-of-the-art AI agent that will allow anyone to automate complex, cumbersome, multi-step tasks without repetitive and manual input.
- [WebVoyager (Agent)](https://github.com/MinorJerry/WebVoyager) - Vision-enabled web agent. ![GitHub Repo stars](https://img.shields.io/github/stars/MinorJerry/WebVoyager?style=social)
- [AgentGPT](https://github.com/reworkd/AgentGPT) - Deploy autonomous AI agents in your browser. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/AgentGPT?style=social)
- [Agent-E](https://github.com/EmergenceAI/Agent-E) - Agent & framework with HTML DOM distillation. ![GitHub Repo stars](https://img.shields.io/github/stars/EmergenceAI/Agent-E?style=social)
- [Kura](https://www.trykura.com/) - Web Agents for the Enterprise.
- [Manus](https://manus.im/) - A general AI agent that can execute long running tasks across tools like browsers, terminals, and text editors.
- [doBrowser](https://www.dobrowser.io) - An AI-powered Chrome extension that understands natural language and takes actions in your browser on your behalf.
- [WebSurfer (Autogen)](https://microsoft.github.io/autogen/stable/reference/python/autogen_ext.agents.web_surfer.html#autogen_ext.agents.web_surfer.MultimodalWebSurfer) - MultimodalWebSurfer is a multimodal agent that can search the web and visit web pages. ![GitHub Repo stars](https://img.shields.io/github/stars/microsoft/autogen?style=social)
- [Magentic-One](https://www.microsoft.com/en-us/research/articles/magentic-one-a-generalist-multi-agent-system-for-solving-complex-tasks/) - A generalist multi-agent system for solving complex tasks including surfing the web via Autogen's MultimodalWebSurfer.
- [Harpa.ai](https://harpa.ai/) - An AI-powered Chrome extension & browser agent that understands natural language and takes actions on your behalf.
- [Yutori](https://yutori.com/) - A multi-agent system that executes browser-based tasks in parallel given a natural language prompt.
- [Automina](https://automina.app/) - AI browser automation tool with natural language control.
- [rtrvr.ai](https://www.rtrvr.ai/) - AI Web Agent Chrome Extension that autonomously does tasks, scrapes to Sheets, and calls API's โ€“ all with just prompts and your own browser!
- [Nanobrowser](https://nanobrowser.ai) - An open-source & local-first AI web agent Chrome extension with flexible LLM options and multi-agent system. ![GitHub Repo stars](https://img.shields.io/github/stars/nanobrowser/nanobrowser?style=social)
- [Browserable](https://browserable.ai) - An open-source & self-hostable browser automation library for AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/browserable/browserable?style=social)
- [Tongyi WebAgent](https://github.com/Alibaba-NLP/WebAgent) - WebAgent for Information Seeking bulit by Tongyi Lab, Alibaba Group. ![GitHub Repo stars](https://img.shields.io/github/stars/Alibaba-NLP/WebAgent?style=social)

### Computer-use Agents

- [Anthropic Computer Use](https://www.anthropic.com/news/3-5-models-and-computer-use) - Computer use agent that can control your browser.
- [Self-Operating Computer Framework](https://github.com/OthersideAI/self-operating-computer) - A framework to enable multimodal models to operate a computer. ![GitHub Repo stars](https://img.shields.io/github/stars/OthersideAI/self-operating-computer?style=social)
- [Highlight](https://highlightai.com/) - Highlight AI lets models understand your desktop activity. Get stuff done faster.
- [OpenInterpreter](https://github.com/openinterpreter/open-interpreter) - An open-source CLI based agent that can write & execute code as well as control your browser. ![GitHub Repo stars](https://img.shields.io/github/stars/openinterpreter/open-interpreter?style=social)
- [UI-TARS](https://github.com/bytedance/UI-TARS?tab=readme-ov-file) - A GUI agent model designed to interact seamlessly with GUIs using human-like perception, reasoning, and action capabilities. ![GitHub Repo stars](https://img.shields.io/github/stars/bytedance/UI-TARS?style=social)

## AI Web Automation Tools

Tools, frameworks and libraries that translate natural language instructions into web interactions.

- [Asteroid.ai](https://asteroid.ai/) - Hosted Browser Agents for SMEs to automate complex workflows. ![GitHub Repo stars](https://img.shields.io/github/stars/ishan0102/vimGPT?style=social)
- [PulsarRPA](https://github.com/platonai/pulsarRPA) - AI-powered browser automation for data extraction. ![GitHub Repo stars](https://img.shields.io/github/stars/platonai/pulsarRPA?style=social)
- [VimGPT](https://github.com/ishan0102/vimGPT) - Experimental project using GPT-4 Vision to browse the web via the Vimium extension. ![GitHub Repo stars](https://img.shields.io/github/stars/ishan0102/vimGPT?style=social)
- [Cekura.io](https://www.cekura.io/) - An AI browser agent that helps companies maintain up-to-date documentation.
- [Dex by Dexterity](https://getdexterity.com/) - An AI coworker embedding into and controlling your browser.
- [Autobrowser](https://autobrowser.ai/) - A free, experimental Chrome extension that leverages Claude Computer Use to automate tasks in your browser.
- [Bytebot](https://bytebot.ai) - Bytebot provides AI-powered scraping automations that evolve with your target sites.
- [Runcopycat](https://www.runcopycat.com/) - A no-code browser automation platform that turns screen recordings into reusable automated workflows.
- [Bardeen.ai](https://bardeen.ai) - A Chrome extension that enables AI-powered browser automations, allowing users to automate tasks and workflows directly within the browser.
- [Starizon.ai](https://starizon.ai/) - Browser assistant for web task automation.
- [BrowserGPT](https://browsegpt.ai/) - Browser extension for page summaries and Q&A.
- [Browse.ai](https://www.browse.ai/) - Chrome extension webscraping that can leverage AI for structured data extraction.
- [Strawberry Browser](https://www.strawberrybrowser.com/) - A personal assistant that sits in your browser, automates repetitive web actions, learns your workflows.
- [Deta.surf](https://deta.surf/) - An integrated platform that combines a browser, file manager, and AI assistant with browser-level context.
- [Comet by Perplexity](https://www.perplexity.ai/comet) - An AI-powered browser by Perplexity. Not much more details out yet.
- [Dia Browser](https://www.diabrowser.com/) - Dia Browser is envisioned as an entirely new web browser built with AI at the center by The Browser Company (Arc).
- [Reworkd](https://reworkd.ai) - No-code web data extraction solution using agentic AI.
- [Ottogrid](https://ottogrid.ai/) - Spreadsheet based web agents to automate manual research.

### Dev Tools

- [Steel.dev](https://steel.dev) - Open-source headless browser API built specifically for AI agents and apps. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/steel-browser?style=social)
- [Omniparser](https://microsoft.github.io/OmniParser/) - Tool for parsing GUIs for vision based agents. ![GitHub Repo stars](https://img.shields.io/github/stars/microsoft/OmniParser?style=social)
- [LaVague](https://www.lavague.ai/) - Framework for natural language web automation. ![GitHub Repo stars](https://img.shields.io/github/stars/lavague-ai/LaVague?style=social)
- [Langchain Playwright toolkit](https://python.langchain.com/docs/integrations/tools/playwright/#use-within-an-agent) - Toolkit integration with AI agents.
- [Browserbase](https://browserbase.com) - A headless browser API for AI workflows.
- [Stagehand](https://www.stagehand.dev/) - AI web browsing framework. ![GitHub Repo stars](https://img.shields.io/github/stars/browserbase/stagehand?style=social)
- [Tarsier](https://github.com/reworkd/tarsier) - Vision utilities library for web interaction agents. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/tarsier?style=social)
- [AutoGPT](https://github.com/Significant-Gravitas/AutoGPT) - Experimental agent for task completion and web browsing. ![GitHub Repo stars](https://img.shields.io/github/stars/Significant-Gravitas/AutoGPT?style=social)
- [Bytebot](https://github.com/bytebot-ai/bytebot) - Containerized computer use agent framework with a virtual desktop environment. ![GitHub Repo stars](https://img.shields.io/github/stars/bytebot-ai/bytebot?style=social)

## AI Web Scrapers/Crawlers

Web crawlers & scrapers that leverage AI to navigate websites and extract content.

- [FireCrawl](https://www.firecrawl.dev/) - APIs for turning websites into LLM-friendly markdown. ![GitHub Repo stars](https://img.shields.io/github/stars/mendableai/firecrawl?style=social)
- [Crawl4AI](https://crawl4ai.com) - Open-source LLM Friendly Web Crawler & Scraper. ![GitHub Repo stars](https://img.shields.io/github/stars/unclecode/crawl4ai?style=social)
- [ScrapeGraphAI](https://scrapegraphai.com/) - Python scraper based on AI. ![GitHub Repo stars](https://img.shields.io/github/stars/ScrapeGraphAI/Scrapegraph-ai?style=social)
- [WebAgent (OpenAgents)](https://github.com/xlang-ai/OpenAgents) - The web-browsing agent module of the OpenAgents platform (HKU). Enables autonomous navigation of websites via natural language, as part of a larger multi-modal agent framework. ![GitHub Repo stars](https://img.shields.io/github/stars/xlang-ai/OpenAgents?style=social)
- [Expand.ai](https://www.expand.ai/) - Turns any website into a type-safe API you can rely on.
- [LLM Scraper](https://github.com/mishushakov/llm-scraper) - Uses LLMs for intelligent scraping and content understanding. ![GitHub Repo stars](https://img.shields.io/github/stars/mishushakov/llm-scraper?style=social)
- [SpiderCreator](https://github.com/carlosplanchon/spidercreator) - Create complex Playwright spiders with natural language prompts. ![GitHub Repo stars](https://img.shields.io/github/stars/carlosplanchon/spidercreator?style=social)

## Web Search & Query Tools

Utilities that help agents search the web or query web data via natural language.

- [AgentQL](https://www.agentql.com/) - A query language and toolkit that makes the web AI-ready. ![GitHub Repo stars](https://img.shields.io/github/stars/tinyfish-io/agentql?style=social)
- [SerpAPI](https://serpapi.com/) - Search API that provides Google Search results for your agents.
- [Serper.dev](https://serper.dev/) - Performant and cost effective search API that provides Google Search results for your agents.
- [Jina.ai](https://jina.ai/) - Neural search platform for web data.
- [Exa.ai](https://exa.ai) - Semantic Search Engine for AI.

## Benchmarks & Research

Datasets, benchmarks, and notable research efforts for evaluating and advancing web-capable AI agents.

- [Web Agent Leaderboard](https://leaderboard.steel.dev) - Web agent leaderboard compiling different AI agent products and how they perform on the widely used WebVoyager benchmarks. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/leaderboard?style=social)
- [Web Games by Convergence](https://webgames.convergence.ai/) - a collection of challenges designed for testing general-purpose web-browsing AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/convergence-ai/webgames?style=social)
- [Bananalyzer](https://github.com/reworkd/bananalyzer) - An open-source evaluation framework for web-based AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/bananalyzer?style=social)
- [Mind2Web](https://osu-nlp-group.github.io/Mind2Web) - A large-scale dataset for generalist web agents. ![GitHub Repo stars](https://img.shields.io/github/stars/OSU-NLP-Group/Mind2Web?style=social)
- [World of Bits: An Open-Domain Platform for Web-Based Agents](https://proceedings.mlr.press/v70/shi17a/shi17a.pdf) - OpenAI's research paper that introduces World or Bits: a platform where agents complete tasks on the internet by performing low-level keyboard and mouse actions.
- [MiniWoB++](https://miniwob.farama.org) - A classic suite of 104 mini web browser tasks in a synthetic environment. It's is an extension of the OpenAI MiniWoB benchmark. ![GitHub Repo stars](https://img.shields.io/github/stars/Farama-Foundation/miniwob-plusplus?style=social)
- [WebArena](https://webarena.dev) - A realistic, self-hostable web environment for autonomous agents. Includes official leaderboard tracking agent performance. ![GitHub Repo stars](https://img.shields.io/github/stars/web-arena-x/webarena?style=social)
- [WebCanvas](https://github.com/iMeanAI/WebCanvas) - An online evaluation framework for dynamic web environments. Tests agents on live websites. ![GitHub Repo stars](https://img.shields.io/github/stars/iMeanAI/WebCanvas?style=social)
- [WebGPT](https://openai.com/research/webgpt) - OpenAI's browser-assisted question-answering research project.
- [WebShop](https://webshop-pnlp.github.io) - A simulated e-commerce shopping environment with 1.18M real Amazon products. ![GitHub Repo stars](https://img.shields.io/github/stars/princeton-nlp/WebShop?style=social)
- [WebVoyager (Benchmark)](https://github.com/MinorJerry/WebVoyager) - Vision-enabled web agent using GPT-4V for real-world website interaction. ![GitHub Repo stars](https://img.shields.io/github/stars/MinorJerry/WebVoyager?style=social)
- [WorkArena](https://github.com/ServiceNow/WorkArena) - A suite of 33 browser-based tasks for enterprise "knowledge worker" scenarios. ![GitHub Repo stars](https://img.shields.io/github/stars/ServiceNow/WorkArena?style=social)
- [BrowserGym by ServiceNow](https://github.com/ServiceNow/BrowserGym) - A gym environment for web task automation. ![GitHub Repo stars](https://img.shields.io/github/stars/ServiceNow/BrowserGym?style=social)

## Tutorials & Guides

Resources for learning how to build, deploy, or utilize AI web agents.

- [LangGraph WebVoyager Tutorial](https://langchain-ai.github.io/langgraph/tutorials/web-navigation/web_voyager/) - Tutorial demonstrating how to build a web navigation agent using LangGraph Agents, Vision Models, and Web Voyager.
- [Build an AI Browser Agent](https://dzone.com/articles/build-ai-browser-agent-llms-playwright-browser-use) - Step-by-step guide to create an AI that browses the web using Playwright and the Browser-Use library.
- [Install & Run Browser-Use Locally](https://aleksandarhaber.com/install-and-run-browser-use-ai-agents-locally-using-ollama/) - Instructions on installing the open-source Browser-Use agent with a local LLM.
- [Build a Browser Agent with DeepSeek](https://nodeshift.com/blog/build-a-browser-use-agent-with-deepseek-a-step-by-step-guide) - Walks through deploying a Browser-Use web UI agent powered by the DeepSeek model on a cloud VM.

## Interested in implementing Steel?
Feel free to reach out at [team@steel.dev](mailto:team@steel.dev?subject=Hello%20from%20github!) or on [Discord](https://discord.gg/steel-dev).

Steel is an [open-source](https://github.com/steel-dev/steel-browser) browser API built specifically for AI agents. Get started for free [here](https://app.steel.dev).

## Join the Community

- Follow [@steeldotdev](https://x.com/steeldotdev) on X.
- Join the [Discord community](https://discord.gg/steel-dev).
- Feel free to reach out to us at [team@steel.dev](mailto:team@steel.dev?subject=Hello%20from%20github!)

## Contributing

[Contributions of any kind welcome, just follow the guidelines](contributing.md)!

### Contributors

[Thanks goes to these contributors](https://github.com/steel-dev/awesome-web-agents/graphs/contributors)!