Projects in Awesome Lists tagged with web-extraction
A curated list of projects in awesome lists tagged with web-extraction .
https://github.com/0xMassi/webclaw
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
ai ai-agents ai-scraping cli crawler data-extraction html-to-markdown llm markdown mcp mcp-server rust scraper self-hosted tls-fingerprinting web-crawler web-extraction web-scraper web-scraping webscraping
Last synced: 04 Apr 2026
https://github.com/lightfeed/browser-agent
Serverless AI browser agent
ai ai-agents automation aws-lambda browser browser-agent browser-automation crawling playwright scraping serverless serverless-framework web-crawling web-extraction web-scraping
Last synced: 20 Jun 2025
https://github.com/dorukardahan/nole
Free local web search/extraction router for AI agents. Go CLI + MCP, BYOK/free-first routing, keyless DDGS/Scrapling fallback, setup writers and client guides.
ai-agents byok cli go hermes-agent mcp mcp-server openclaw retrieval web-extraction web-search
Last synced: 31 May 2026
https://github.com/bharatpurohit97/webextractor
Extracting links from any website.
python selenium web-extraction
Last synced: 26 Apr 2026