Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.
https://github.com/lorien/awesome-web-scraping

List: awesome-web-scraping

captcha-bypass captcha-recaptcha crawler crawling crawling-framework crawling-python crawling-tool scraping scraping-framework scraping-python scraping-tool spider web-scraping webscraping

Last synced: 24 days ago
JSON representation

List of libraries, tools and APIs for web scraping and data processing.

Awesome Lists containing this project

README

        

# Awesome Web Scraping

Lists of packages, services and manuals related to web scraping.

## Topics

* [Python](https://github.com/lorien/web-scraping/blob/master/python.md) - Python packages
* [PHP](https://github.com/lorien/web-scraping/blob/master/php.md) - PHP packages
* [Ruby](https://github.com/lorien/web-scraping/blob/master/ruby.md) - Ruby packages
* [JavaScript](https://github.com/lorien/web-scraping/blob/master/javascript.md) - JavaScript packages
* [Go](https://github.com/lorien/web-scraping/blob/master/golang.md) - Go packages
* [Web Scraping Manuals](https://github.com/lorien/awesome-web-scraping/blob/master/manuals.md) - list of articles and books teaching web scraping
* [dhamaniasad / HeadlessBrowsers](https://github.com/dhamaniasad/HeadlessBrowsers) - list of (almost) all headless web browsers in existence
* [DNS over HTTPS providers](https://github.com/curl/curl/wiki/DNS-over-HTTPS) - list of DNS over HTTPs providers
* [Awesome Pastebins](https://github.com/lorien/awesome-pastebins) - list of pastebin sites

## Captcha Solving Services

* [https://2captcha.com](https://2captcha.com/?from=3019071)

## Proxy Server Marketplaces

* https://www.blackhatworld.com/forums/proxies-for-sale.112/
* https://forum.antichat.com/forums/147/

## Telegram Discussion Groups
* [@grablab](https://t.me/grablab) - talks in English
* [@grablab_ru](https://t.me/grablab_ru) - talks in Russian

## How to Contribute to This List

See [Contributing](https://github.com/lorien/web-scraping/blob/master/CONTRIBUTING.md) guide.

## Credits

The list is based initially on some data from these sources [awesome-python](https://github.com/vinta/awesome-python), [awesome-php](https://github.com/ziadoz/awesome-php), [awesome-ruby](https://github.com/markets/awesome-ruby), [ruby-nlp](https://github.com/diasks2/ruby-nlp), [awesome-javascript](https://github.com/sorrycc/awesome-javascript)