Projects in Awesome Lists by cocrawler
A curated list of projects in awesome lists by cocrawler .
https://github.com/cocrawler/cocrawler
CoCrawler is a versatile web crawler built using modern tools and concurrency.
aiohttp aiohttp-client async-python concurrency crawler pluggable-modules python3 screenshot warc
Last synced: 14 Dec 2025
https://github.com/cocrawler/cdx_toolkit
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
cdx cdx-api commoncrawl python warc web-archives web-archiving
Last synced: 14 Dec 2025