Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-web-scraper

A collection of awesome web scaper, crawler.
https://github.com/duyet/awesome-web-scraper

Last synced: 4 days ago
JSON representation

  • Java

    • websphinx - Website-Specific Processors for HTML INformation eXtraction.
  • C/C++

    • HTTrack - Highly extensible, highly scalable Web crawler. Pluggable parsing, protocols, storage and indexing.
  • C#

    • ccrawler - Built in C# 3.5 version. it contains a simple extention of web content categorizer, which can saparate between the web page depending on their content.
  • PHP

    • PHPCrawl - PHPCrawl is a framework for crawling/spidering websites written in PHP.
    • Crawler - A library for Rapid Web Crawler and Scraper Development.
    • Crawler - A library for Rapid Web Crawler and Scraper Development.
    • Crawler - A library for Rapid Web Crawler and Scraper Development.
    • Crawler - A library for Rapid Web Crawler and Scraper Development.
  • Contributing