Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-web-scraping

Best scraping tools collection in town. Find everything you need for scraping, crawling, and processing data from the web
https://github.com/lukas-bear/awesome-web-scraping

Last synced: 5 days ago
JSON representation

  • Core Libraries

    • Python

    • JavaScript/Node.js

    • Java

    • Go

      • Colly
      • Fetchbot - A simple and flexible web crawler that follows the robots.txt policies and crawl delays.
      • Goquery - like API for parsing and manipulating HTML documents.
      • Rod - level browser automation framework powered by Chromium DevTools.
      • Playwright-go - headless browser automation.
      • Gocrawl - Polite, slim and concurrent web crawler.
      • Colly
      • Fetchbot - A simple and flexible web crawler that follows the robots.txt policies and crawl delays.
      • Goquery - like API for parsing and manipulating HTML documents.
      • Rod - level browser automation framework powered by Chromium DevTools.
      • Playwright-go - headless browser automation.
      • Gocrawl - Polite, slim and concurrent web crawler.
    • Ruby

    • PHP

      • DiDOM - A blazing-fast and easy-to-use HTML parser.
      • Crawler - A powerful library for rapid web scraping and crawling development.
      • DiDOM - A blazing-fast and easy-to-use HTML parser.
      • Goutte - A lightweight PHP web scraper for effortless data extraction.
      • Crawler - A powerful library for rapid web scraping and crawling development.