An open API service indexing awesome lists of open source software.

awesome-web-agents

🔥 A list of tools, frameworks, and resources for building AI web agents
https://github.com/steel-dev/awesome-web-agents

Last synced: 4 days ago
JSON representation

  • Autonomous Web Agents

      • Runner H - Runner H is a state-of-the-art AI agent that will allow anyone to automate complex, cumbersome, multi-step tasks without repetitive and manual input.
      • Surf.new - An open-source playground for chatting with different web agents. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/surf.new?style=social)
      • OpenAI Operator - OpenAI's AI agents that can browser the web for you.
      • Skyvern-AI - Framework to automate browser-based workflows. ![GitHub Repo stars](https://img.shields.io/github/stars/Skyvern-AI/skyvern?style=social)
      • Proxy by Convergence - Proxy is your AI-powered digital assistant that explores the web and executes tasks through simple conversation.
      • Google Project Mariner - A research prototype exploring the future of human-agent interaction, starting with your browser.
      • AgentGPT - Deploy autonomous AI agents in your browser. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/AgentGPT?style=social)
      • doBrowser - An AI-powered Chrome extension that understands natural language and takes actions in your browser on your behalf.
      • Surf.new - An open-source playground for chatting with different web agents. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/surf.new?style=social)
      • OpenAI Operator - OpenAI's AI agents that can browser the web for you.
      • Skyvern-AI - Framework to automate browser-based workflows. ![GitHub Repo stars](https://img.shields.io/github/stars/Skyvern-AI/skyvern?style=social)
      • Proxy by Convergence - Proxy is your AI-powered digital assistant that explores the web and executes tasks through simple conversation.
      • Google Project Mariner - A research prototype exploring the future of human-agent interaction, starting with your browser.
      • Runner H - Runner H is a state-of-the-art AI agent that will allow anyone to automate complex, cumbersome, multi-step tasks without repetitive and manual input.
      • AgentGPT - Deploy autonomous AI agents in your browser. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/AgentGPT?style=social)
      • Agent-E - Agent & framework with HTML DOM distillation. ![GitHub Repo stars](https://img.shields.io/github/stars/EmergenceAI/Agent-E?style=social)
      • Kura - Web Agents for the Enterprise.
      • Manus - A general AI agent that can execute long running tasks across tools like browsers, terminals, and text editors.
      • Yutori - A multi-agent system that executes browser-based tasks in parallel given a natural language prompt.
      • Automina - AI browser automation tool with natural language control.
      • rtrvr.ai - AI Web Agent Chrome Extension that autonomously does tasks, scrapes to Sheets, and calls API's – all with just prompts and your own browser!
      • Agent-E - Agent & framework with HTML DOM distillation. ![GitHub Repo stars](https://img.shields.io/github/stars/EmergenceAI/Agent-E?style=social)
      • Kura - Web Agents for the Enterprise.
      • Manus - A general AI agent that can execute long running tasks across tools like browsers, terminals, and text editors.
      • doBrowser - An AI-powered Chrome extension that understands natural language and takes actions in your browser on your behalf.
      • WebSurfer (Autogen) - MultimodalWebSurfer is a multimodal agent that can search the web and visit web pages. ![GitHub Repo stars](https://img.shields.io/github/stars/microsoft/autogen?style=social)
      • Magentic-One - A generalist multi-agent system for solving complex tasks including surfing the web via Autogen's MultimodalWebSurfer.
      • Harpa.ai - An AI-powered Chrome extension & browser agent that understands natural language and takes actions on your behalf.
      • Magentic-One - A generalist multi-agent system for solving complex tasks including surfing the web via Autogen's MultimodalWebSurfer.
      • Yutori - A multi-agent system that executes browser-based tasks in parallel given a natural language prompt.
      • Automina - AI browser automation tool with natural language control.
      • rtrvr.ai - AI Web Agent Chrome Extension that autonomously does tasks, scrapes to Sheets, and calls API's – all with just prompts and your own browser!
      • Nanobrowser - An open-source & local-first AI web agent Chrome extension with flexible LLM options and multi-agent system. ![GitHub Repo stars](https://img.shields.io/github/stars/nanobrowser/nanobrowser?style=social)
    • Computer-use Agents

      • Anthropic Computer Use - Computer use agent that can control your browser.
      • Self-Operating Computer Framework - A framework to enable multimodal models to operate a computer. ![GitHub Repo stars](https://img.shields.io/github/stars/OthersideAI/self-operating-computer?style=social)
      • Highlight - Highlight AI lets models understand your desktop activity. Get stuff done faster.
      • OpenInterpreter - An open-source CLI based agent that can write & execute code as well as control your browser. ![GitHub Repo stars](https://img.shields.io/github/stars/openinterpreter/open-interpreter?style=social)
      • UI-TARS - A GUI agent model designed to interact seamlessly with GUIs using human-like perception, reasoning, and action capabilities. ![GitHub Repo stars](https://img.shields.io/github/stars/bytedance/UI-TARS?style=social)
      • Anthropic Computer Use - Computer use agent that can control your browser.
      • Self-Operating Computer Framework - A framework to enable multimodal models to operate a computer. ![GitHub Repo stars](https://img.shields.io/github/stars/OthersideAI/self-operating-computer?style=social)
      • Highlight - Highlight AI lets models understand your desktop activity. Get stuff done faster.
      • OpenInterpreter - An open-source CLI based agent that can write & execute code as well as control your browser. ![GitHub Repo stars](https://img.shields.io/github/stars/openinterpreter/open-interpreter?style=social)
      • UI-TARS - A GUI agent model designed to interact seamlessly with GUIs using human-like perception, reasoning, and action capabilities. ![GitHub Repo stars](https://img.shields.io/github/stars/bytedance/UI-TARS?style=social)
    • Opera Agentic Feature - Opera announces a new agentic feature for its browser, showcasing innovative web agent integration.
    • Opera Agentic Feature - Opera announces a new agentic feature for its browser, showcasing innovative web agent integration.
  • Benchmarks & Research

    • Dev Tools

      • WebVoyager (Benchmark) - Vision-enabled web agent using GPT-4V for real-world website interaction. ![GitHub Repo stars](https://img.shields.io/github/stars/MinorJerry/WebVoyager?style=social)
      • Web Agent Leaderboard - Web agent leaderboard compiling different AI agent products and how they perform on the widely used WebVoyager benchmarks. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/leaderboard?style=social)
      • Web Games by Convergence - a collection of challenges designed for testing general-purpose web-browsing AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/convergence-ai/webgames?style=social)
      • Bananalyzer - An open-source evaluation framework for web-based AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/bananalyzer?style=social)
      • Mind2Web - A large-scale dataset for generalist web agents. ![GitHub Repo stars](https://img.shields.io/github/stars/OSU-NLP-Group/Mind2Web?style=social)
      • Web Games by Convergence - a collection of challenges designed for testing general-purpose web-browsing AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/convergence-ai/webgames?style=social)
      • Bananalyzer - An open-source evaluation framework for web-based AI agents. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/bananalyzer?style=social)
      • Mind2Web - A large-scale dataset for generalist web agents. ![GitHub Repo stars](https://img.shields.io/github/stars/OSU-NLP-Group/Mind2Web?style=social)
      • World of Bits: An Open-Domain Platform for Web-Based Agents - OpenAI's research paper that introduces World or Bits: a platform where agents complete tasks on the internet by performing low-level keyboard and mouse actions.
      • MiniWoB++ - A classic suite of 104 mini web browser tasks in a synthetic environment. It's is an extension of the OpenAI MiniWoB benchmark. ![GitHub Repo stars](https://img.shields.io/github/stars/Farama-Foundation/miniwob-plusplus?style=social)
      • WebCanvas - An online evaluation framework for dynamic web environments. Tests agents on live websites. ![GitHub Repo stars](https://img.shields.io/github/stars/iMeanAI/WebCanvas?style=social)
      • WebGPT - OpenAI's browser-assisted question-answering research project.
      • World of Bits: An Open-Domain Platform for Web-Based Agents - OpenAI's research paper that introduces World or Bits: a platform where agents complete tasks on the internet by performing low-level keyboard and mouse actions.
      • MiniWoB++ - A classic suite of 104 mini web browser tasks in a synthetic environment. It's is an extension of the OpenAI MiniWoB benchmark. ![GitHub Repo stars](https://img.shields.io/github/stars/Farama-Foundation/miniwob-plusplus?style=social)
      • WebShop - A simulated e-commerce shopping environment with 1.18M real Amazon products. ![GitHub Repo stars](https://img.shields.io/github/stars/princeton-nlp/WebShop?style=social)
      • WorkArena - A suite of 33 browser-based tasks for enterprise "knowledge worker" scenarios. ![GitHub Repo stars](https://img.shields.io/github/stars/ServiceNow/WorkArena?style=social)
      • BrowserGym by ServiceNow - A gym environment for web task automation. ![GitHub Repo stars](https://img.shields.io/github/stars/ServiceNow/BrowserGym?style=social)
      • WebCanvas - An online evaluation framework for dynamic web environments. Tests agents on live websites. ![GitHub Repo stars](https://img.shields.io/github/stars/iMeanAI/WebCanvas?style=social)
      • WebGPT - OpenAI's browser-assisted question-answering research project.
      • WebShop - A simulated e-commerce shopping environment with 1.18M real Amazon products. ![GitHub Repo stars](https://img.shields.io/github/stars/princeton-nlp/WebShop?style=social)
      • WebVoyager (Benchmark) - Vision-enabled web agent using GPT-4V for real-world website interaction. ![GitHub Repo stars](https://img.shields.io/github/stars/MinorJerry/WebVoyager?style=social)
      • WorkArena - A suite of 33 browser-based tasks for enterprise "knowledge worker" scenarios. ![GitHub Repo stars](https://img.shields.io/github/stars/ServiceNow/WorkArena?style=social)
      • BrowserGym by ServiceNow - A gym environment for web task automation. ![GitHub Repo stars](https://img.shields.io/github/stars/ServiceNow/BrowserGym?style=social)
  • AI Web Automation Tools

    • Computer-use Agents

      • Asteroid.ai - Hosted Browser Agents for SMEs to automate complex workflows. ![GitHub Repo stars](https://img.shields.io/github/stars/ishan0102/vimGPT?style=social)
      • PulsarRPA - AI-powered browser automation for data extraction. ![GitHub Repo stars](https://img.shields.io/github/stars/platonai/pulsarRPA?style=social)
      • Cekura.io - An AI browser agent that helps companies maintain up-to-date documentation.
      • Dex by Dexterity - An AI coworker embedding into and controlling your browser.
      • Autobrowser - A free, experimental Chrome extension that leverages Claude Computer Use to automate tasks in your browser.
      • Bytebot - Bytebot provides AI-powered scraping automations that evolve with your target sites.
      • Runcopycat - A no-code browser automation platform that turns screen recordings into reusable automated workflows.
      • PulsarRPA - AI-powered browser automation for data extraction. ![GitHub Repo stars](https://img.shields.io/github/stars/platonai/pulsarRPA?style=social)
      • VimGPT - Experimental project using GPT-4 Vision to browse the web via the Vimium extension. ![GitHub Repo stars](https://img.shields.io/github/stars/ishan0102/vimGPT?style=social)
      • Cekura.io - An AI browser agent that helps companies maintain up-to-date documentation.
      • Dex by Dexterity - An AI coworker embedding into and controlling your browser.
      • Starizon.ai - Browser assistant for web task automation.
      • Autobrowser - A free, experimental Chrome extension that leverages Claude Computer Use to automate tasks in your browser.
      • Runcopycat - A no-code browser automation platform that turns screen recordings into reusable automated workflows.
      • BrowserGPT - Browser extension for page summaries and Q&A.
      • Browse.ai - Chrome extension webscraping that can leverage AI for structured data extraction.
      • Strawberry Browser - A personal assistant that sits in your browser, automates repetitive web actions, learns your workflows.
      • Deta.surf - An integrated platform that combines a browser, file manager, and AI assistant with browser-level context.
      • Comet by Perplexity - An AI-powered browser by Perplexity. Not much more details out yet.
      • Dia Browser - Dia Browser is envisioned as an entirely new web browser built with AI at the center by The Browser Company (Arc).
      • Ottogrid - Spreadsheet based web agents to automate manual research.
      • Starizon.ai - Browser assistant for web task automation.
      • BrowserGPT - Browser extension for page summaries and Q&A.
      • Browse.ai - Chrome extension webscraping that can leverage AI for structured data extraction.
      • Strawberry Browser - A personal assistant that sits in your browser, automates repetitive web actions, learns your workflows.
      • Deta.surf - An integrated platform that combines a browser, file manager, and AI assistant with browser-level context.
      • Comet by Perplexity - An AI-powered browser by Perplexity. Not much more details out yet.
      • Dia Browser - Dia Browser is envisioned as an entirely new web browser built with AI at the center by The Browser Company (Arc).
      • Ottogrid - Spreadsheet based web agents to automate manual research.
    • Dev Tools

      • Langchain Playwright toolkit - Toolkit integration with AI agents.
      • Browserbase - A headless browser API for AI workflows.
      • Stagehand - AI web browsing framework. ![GitHub Repo stars](https://img.shields.io/github/stars/browserbase/stagehand?style=social)
      • AutoGPT - Experimental agent for task completion and web browsing. ![GitHub Repo stars](https://img.shields.io/github/stars/Significant-Gravitas/AutoGPT?style=social)
      • Bytebot - Containerized computer use agent framework with a virtual desktop environment. ![GitHub Repo stars](https://img.shields.io/github/stars/bytebot-ai/bytebot?style=social)
      • Steel.dev - Open-source headless browser API built specifically for AI agents and apps. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/steel-browser?style=social)
      • Omniparser - Tool for parsing GUIs for vision based agents. ![GitHub Repo stars](https://img.shields.io/github/stars/microsoft/OmniParser?style=social)
      • LaVague - Framework for natural language web automation. ![GitHub Repo stars](https://img.shields.io/github/stars/lavague-ai/LaVague?style=social)
      • Tarsier - Vision utilities library for web interaction agents. ![GitHub Repo stars](https://img.shields.io/github/stars/reworkd/tarsier?style=social)
      • Steel.dev - Open-source headless browser API built specifically for AI agents and apps. ![GitHub Repo stars](https://img.shields.io/github/stars/steel-dev/steel-browser?style=social)
      • AutoGPT - Experimental agent for task completion and web browsing. ![GitHub Repo stars](https://img.shields.io/github/stars/Significant-Gravitas/AutoGPT?style=social)
      • Omniparser - Tool for parsing GUIs for vision based agents. ![GitHub Repo stars](https://img.shields.io/github/stars/microsoft/OmniParser?style=social)
      • LaVague - Framework for natural language web automation. ![GitHub Repo stars](https://img.shields.io/github/stars/lavague-ai/LaVague?style=social)
      • Langchain Playwright toolkit - Toolkit integration with AI agents.
      • Bytebot - Containerized computer use agent framework with a virtual desktop environment. ![GitHub Repo stars](https://img.shields.io/github/stars/bytebot-ai/bytebot?style=social)
  • AI Web Scrapers/Crawlers

    • Dev Tools

      • FireCrawl - APIs for turning websites into LLM-friendly markdown. ![GitHub Repo stars](https://img.shields.io/github/stars/mendableai/firecrawl?style=social)
      • Crawl4AI - Open-source LLM Friendly Web Crawler & Scraper. ![GitHub Repo stars](https://img.shields.io/github/stars/unclecode/crawl4ai?style=social)
      • ScrapeGraphAI - Python scraper based on AI. ![GitHub Repo stars](https://img.shields.io/github/stars/ScrapeGraphAI/Scrapegraph-ai?style=social)
      • WebAgent (OpenAgents) - The web-browsing agent module of the OpenAgents platform (HKU). Enables autonomous navigation of websites via natural language, as part of a larger multi-modal agent framework. ![GitHub Repo stars](https://img.shields.io/github/stars/xlang-ai/OpenAgents?style=social)
      • FireCrawl - APIs for turning websites into LLM-friendly markdown. ![GitHub Repo stars](https://img.shields.io/github/stars/mendableai/firecrawl?style=social)
      • Crawl4AI - Open-source LLM Friendly Web Crawler & Scraper. ![GitHub Repo stars](https://img.shields.io/github/stars/unclecode/crawl4ai?style=social)
      • ScrapeGraphAI - Python scraper based on AI. ![GitHub Repo stars](https://img.shields.io/github/stars/ScrapeGraphAI/Scrapegraph-ai?style=social)
      • WebAgent (OpenAgents) - The web-browsing agent module of the OpenAgents platform (HKU). Enables autonomous navigation of websites via natural language, as part of a larger multi-modal agent framework. ![GitHub Repo stars](https://img.shields.io/github/stars/xlang-ai/OpenAgents?style=social)
      • Expand.ai - Turns any website into a type-safe API you can rely on.
      • LLM Scraper - Uses LLMs for intelligent scraping and content understanding. ![GitHub Repo stars](https://img.shields.io/github/stars/mishushakov/llm-scraper?style=social)
      • Expand.ai - Turns any website into a type-safe API you can rely on.
      • LLM Scraper - Uses LLMs for intelligent scraping and content understanding. ![GitHub Repo stars](https://img.shields.io/github/stars/mishushakov/llm-scraper?style=social)
      • SpiderCreator - Create complex Playwright spiders with natural language prompts. ![GitHub Repo stars](https://img.shields.io/github/stars/carlosplanchon/spidercreator?style=social)
    • Dev Tools

      • AgentQL - A query language and toolkit that makes the web AI-ready. ![GitHub Repo stars](https://img.shields.io/github/stars/tinyfish-io/agentql?style=social)
      • SerpAPI - Search API that provides Google Search results for your agents.
      • Serper.dev - Performant and cost effective search API that provides Google Search results for your agents.
      • Jina.ai - Neural search platform for web data.
      • AgentQL - A query language and toolkit that makes the web AI-ready. ![GitHub Repo stars](https://img.shields.io/github/stars/tinyfish-io/agentql?style=social)
      • Serper.dev - Performant and cost effective search API that provides Google Search results for your agents.
  • Tutorials & Guides

  • Interested in implementing Steel?

  • Join the Community