Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mazzasaverio/doccrawl

Simple document crawler that harvests PDFs and documents from configured web sources.
https://github.com/mazzasaverio/doccrawl

asyncpg data-engineering docker logfire playwright postgresql pydantic-v2 python3 scrapegraphai

Last synced: 9 days ago
JSON representation

Simple document crawler that harvests PDFs and documents from configured web sources.

Awesome Lists containing this project

README

        

# doccrawl
Simple document crawler that harvests PDFs and documents from configured web sources.