An open API service indexing awesome lists of open source software.

https://github.com/devramsean0/wikipedia-crawler

A robots.txt respecting web crawler to track how big the Wikipedia network of linked domains goes.
https://github.com/devramsean0/wikipedia-crawler

wikipedia-scraper

Last synced: 10 months ago
JSON representation

A robots.txt respecting web crawler to track how big the Wikipedia network of linked domains goes.

Awesome Lists containing this project

README

          

# wikipedia-crawler
This is a robots.txt compliant crawler to try and find out how deep the network of links and pages that stem from "https://wikipedia.org/"