{"id":18261586,"url":"https://github.com/getmaxun/maxun","last_synced_at":"2026-01-23T17:15:26.342Z","repository":{"id":260261178,"uuid":"709027512","full_name":"getmaxun/maxun","owner":"getmaxun","description":"🔥 Open Source No Code Web Data Extraction Platform • Turn Websites To APIs \u0026 Spreadsheets With No-Code Robots In Minutes 🔥","archived":false,"fork":false,"pushed_at":"2025-05-08T13:00:32.000Z","size":4447,"stargazers_count":12521,"open_issues_count":85,"forks_count":973,"subscribers_count":72,"default_branch":"develop","last_synced_at":"2025-05-08T20:55:59.359Z","etag":null,"topics":["agents","api","automation","browser","browser-automation","data-extraction","no-code","no-code-web-scraper","playwright","robotic-process-automation","rpa","scraper","self-hosted","web-agent","web-automation","web-scraper","web-scraping","web-scraping-agent","webscraping","website-to-api"],"latest_commit_sha":null,"homepage":"https://www.maxun.dev","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/getmaxun.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":".github/CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2023-10-23T21:40:19.000Z","updated_at":"2025-05-08T19:18:43.000Z","dependencies_parsed_at":"2024-11-27T21:19:00.908Z","dependency_job_id":"39365e29-68a9-4a28-843d-e091226d9f6a","html_url":"https://github.com/getmaxun/maxun","commit_stats":{"total_commits":3841,"total_committers":11,"mean_commits":349.1818181818182,"dds":0.09034105701640194,"last_synced_commit":"5058a3b1331b754256c76a9c45c4d1e438333a9b"},"previous_names":["getmaxun/maxun"],"tags_count":15,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/getmaxun%2Fmaxun","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/getmaxun%2Fmaxun/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/getmaxun%2Fmaxun/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/getmaxun%2Fmaxun/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/getmaxun","download_url":"https://codeload.github.com/getmaxun/maxun/tar.gz/refs/heads/develop","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253157021,"owners_count":21863046,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["agents","api","automation","browser","browser-automation","data-extraction","no-code","no-code-web-scraper","playwright","robotic-process-automation","rpa","scraper","self-hosted","web-agent","web-automation","web-scraper","web-scraping","web-scraping-agent","webscraping","website-to-api"],"created_at":"2024-11-05T11:03:49.671Z","updated_at":"2026-01-23T17:15:26.336Z","avatar_url":"https://github.com/getmaxun.png","language":"TypeScript","readme":"\u003ch2 align=\"center\"\u003e\n    \u003cdiv\u003e\n        \u003ca href=\"https://www.maxun.dev/?ref=ghread\"\u003e\n            \u003cimg src=\"/src/assets/maxunlogo.png\" width=\"70\" /\u003e\n            \u003cbr\u003e\n            Maxun\n        \u003c/a\u003e\n    \u003c/div\u003e\n    Transform the Web into Structured Intelligence\u003cbr\u003e\n\u003c/h2\u003e\n\n\u003cp align=\"center\"\u003e\n✨ Turn any website into clean, contextualized data pipelines for your AI applications ✨\n\n\u003cp align=\"center\"\u003e\n    \u003ca href=\"https://app.maxun.dev/?ref=ghread\"\u003e\u003cb\u003eGo To App\u003c/b\u003e\u003c/a\u003e •\n    \u003ca href=\"https://docs.maxun.dev/?ref=ghread\"\u003e\u003cb\u003eDocumentation\u003c/b\u003e\u003c/a\u003e •\n    \u003ca href=\"https://www.maxun.dev/?ref=ghread\"\u003e\u003cb\u003eWebsite\u003c/b\u003e\u003c/a\u003e •\n    \u003ca href=\"https://discord.gg/5GbPjBUkws\"\u003e\u003cb\u003eDiscord\u003c/b\u003e\u003c/a\u003e •\n    \u003ca href=\"https://www.youtube.com/@MaxunOSS?ref=ghread\"\u003e\u003cb\u003eWatch Tutorials\u003c/b\u003e\u003c/a\u003e\n    \u003cbr /\u003e\n    \u003cbr /\u003e\n\u003ca href=\"https://trendshift.io/repositories/12113\" target=\"_blank\"\u003e\u003cimg src=\"https://trendshift.io/api/badge/repositories/12113\" alt=\"getmaxun%2Fmaxun | Trendshift\" style=\"width: 250px; height: 55px; margin-top: 10px;\" width=\"250\" height=\"55\"/\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n## What is Maxun?\n\nMaxun helps you transform websites into structured APIs, clean markdown for AI workflows, and production-ready data pipelines — all in minutes.\n\n### Ecosystem\n\n1. **[Extract](https://docs.maxun.dev/category/extract)** – Emulate real user behavior and collect structured data from any website.\n   * **[Recorder Mode](https://docs.maxun.dev/robot/extract/robot-actions)** - Record your actions as you browse; Maxun turns them into a reusable extraction robot.\n   * **[AI Mode](https://docs.maxun.dev/robot/extract/llm-extraction)** - Describe what you want in natural language and let LLM-powered extraction do the rest.\n\n2. **[Scrape](https://docs.maxun.dev/robot/scrape/scrape-robots)** – Convert full webpages into clean Markdown or HTML and capture screenshots.\n3. **[Crawl](https://docs.maxun.dev/robot/crawl/crawl-introduction)** - Crawl entire websites and extract content from every relevant page, with full control over scope and discovery.\n4. **[Search](https://docs.maxun.dev/robot/search/search-introduction)** - Run automated web searches to discover or scrape results, with support for time-based filters.\n5. **[SDK](https://docs.maxun.dev/sdk/sdk-overview)** – A complete developer toolkit for scraping, extraction, scheduling, and end-to-end data automation.\n\n## How Does It Work?\n\nMaxun robots are automated tools that help you collect data from websites without writing any code. Think of them as your personal web assistants that can navigate websites, extract information, and organize data just like you would manually - but faster and more efficiently.\n\nThere are four types of robots, each designed for a different job.\n\n### 1. Extract\nExtract emulates real user behavior and captures structured data.\n- \u003ca href=\"/robot/extract/robot-actions\"\u003eRecorder Mode\u003c/a\u003e - Record your actions as you browse; Maxun turns them into a reusable extraction robot.\n### Example: Extract 10 Property Listings from Airbnb\n\n[https://github.com/user-attachments/assets/recorder-mode-demo-video](https://github.com/user-attachments/assets/c6baa75f-b950-482c-8d26-8a8b6c5382c3)\n- \u003ca href=\"/robot/extract/llm-extraction\"\u003eAI Mode\u003c/a\u003e - Describe what you want in natural language and let LLM-powered extraction do the rest.\n### Example: Extract Names, Rating \u0026 Duration of Top 50 Movies from IMDb\n\nhttps://github.com/user-attachments/assets/f714e860-58d6-44ed-bbcd-c9374b629384\n\nLearn more \u003ca href=\"/category/extract\"\u003ehere\u003c/a\u003e.\n\n### 2. Scrape\nScrape converts full webpages into clean Markdown, HTML and can capture screenshots. Ideal for AI workflows, agents, and document processing. \n\nLearn more \u003ca href=\"https://docs.maxun.dev/robot/scrape/scrape-robots\"\u003ehere\u003c/a\u003e.\n\n### 3. Crawl\nCrawl entire websites and extract content from every relevant page, with full control over scope and discovery.\n\nLearn more \u003ca href=\"/robot/crawl/crawl-introduction\"\u003ehere\u003c/a\u003e.\n\n### 4. Search\nRun automated web searches to discover or scrape results, with support for time-based filters.\n\nLearn more \u003ca href=\"https://docs.maxun.dev/robot/search/search-introduction\"\u003ehere\u003c/a\u003e.\n\n## Quick Start\n\n### Getting Started\nThe simplest \u0026 fastest way to get started is to use the hosted version: https://app.maxun.dev. You can self-host if you prefer!\n\n### Installation\nMaxun can run locally with or without Docker\n1. [Setup with Docker Compose](https://docs.maxun.dev/installation/docker)\n2. [Setup without Docker](https://docs.maxun.dev/installation/local)\n3. [Environment Variables](https://docs.maxun.dev/installation/environment_variables)\n4. [SDK](https://github.com/getmaxun/node-sdk)\n\n### Upgrading \u0026 Self Hosting\n1. [Self Host Maxun With Docker \u0026 Portainer](https://docs.maxun.dev/self-host)\n2. [Upgrade Maxun With Docker Compose Setup](https://docs.maxun.dev/installation/upgrade#upgrading-with-docker-compose)\n3. [Upgrade Maxun Without Docker Compose Setup](https://docs.maxun.dev/installation/upgrade#upgrading-with-local-setup)\n\n## Sponsors\n\u003ctable\u003e\n  \u003ctr\u003e\n  \u003ctd width=\"229\"\u003e\n      \u003cbr/\u003e\n      \u003ca href=\"https://www.testmu.ai/?utm_source=maxun\u0026utm_medium=sponsor\" target=\"_blank\"\u003e\n        \u003cimg src=\"https://github.com/user-attachments/assets/6c96005b-85df-43e0-9b63-96aaca676c11\" /\u003e\u003cbr/\u003e\u003cbr/\u003e\n        \u003cb\u003eTestMu AI\u003c/b\u003e\n      \u003c/a\u003e\n      \u003cbr/\u003e\n      \u003csub\u003eThe Native AI-Agentic Cloud Platform to Supercharge Quality Engineering. Test Intelligently and Ship Faster.\n      \u003c/sub\u003e\n    \u003c/td\u003e\n  \u003c/tr\u003e\n\u003c/table\u003e\n\n## Features\n\n- ✨ **Extract Data With No-Code** – Point and click interface\n- ✨ **LLM-Powered Extraction** – Describe what you want; use LLMs to scrape structured data\n- ✨ **Developer SDK** – Programmatic extraction, scheduling, and robot management\n- ✨ **Handle Pagination \u0026 Scrolling** – Automatic navigation\n- ✨ **Run Robots On Schedules** – Set it and forget it\n- ✨ **Turn Websites to APIs** – RESTful endpoints from any site\n- ✨ **Turn Websites to Spreadsheets** – Direct data export to Google Sheets \u0026 Airtable\n- ✨ **Adapt To Website Layout Changes** – Auto-recovery from site updates\n- ✨ **Extract Behind Login** – Handle authentication seamlessly\n- ✨ **Integrations** – Connect with your favorite tools\n- ✨ **MCP Support** – Model Context Protocol integration\n- ✨ **LLM-Ready Data** – Clean Markdown for AI applications\n- ✨ **Self-Hostable** – Full control over your infrastructure\n- ✨ **Open Source** – Transparent and community-driven\n\n## Demos\nMaxun can be used for various use-cases, including lead generation, market research, content aggregation and more.\nView demos here: https://www.maxun.dev/usecases\n\n## Note\nThis project is in early stages of development. Your feedback is very important for us - we're actively working on improvements. \u003c/a\u003e\n\n## License\n\u003cp\u003e\nThis project is licensed under \u003ca href=\"./LICENSE\"\u003eAGPLv3\u003c/a\u003e.\n\u003c/p\u003e\n\n## Support Us\nStar the repository, contribute if you love what we’re building, or [sponsor us](https://github.com/sponsors/amhsirak). \n\n## Contributors\nThank you to the combined efforts of everyone who contributes!\n\n\u003ca href=\"https://github.com/getmaxun/maxun/graphs/contributors\"\u003e\n  \u003cimg src=\"https://contrib.rocks/image?repo=getmaxun/maxun\" /\u003e\n\u003c/a\u003e\n","funding_links":["https://github.com/sponsors/amhsirak"],"categories":["TypeScript","AI \u0026 LLM","⚙️ Backend \u0026 APIs","Repos","置顶","api","Low-Code/No-Code Platforms"],"sub_categories":["Agents \u0026 Orchestration","1、AI应用生态","Virtual Town AI"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgetmaxun%2Fmaxun","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgetmaxun%2Fmaxun","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgetmaxun%2Fmaxun/lists"}