An open API service indexing awesome lists of open source software.

https://github.com/commoncrawl/ccbot-blocking-analysis


https://github.com/commoncrawl/ccbot-blocking-analysis

Last synced: 9 days ago
JSON representation

Awesome Lists containing this project

README

          

# ccbot blocking analysis

This repo contains a prototype that analyzes crawl output, looking to
quantify how much blocking of CCBot is due to bot defenses, vs.
disallows in robots.txt files.

Please see OPEN-ATHENA-PILOT.md for an example analysis.