An open API service indexing awesome lists of open source software.

https://github.com/mazzasaverio/pipeline-docs-data-extractor

(Let's build a) Robust pipeline for extracting structured data from various documents
https://github.com/mazzasaverio/pipeline-docs-data-extractor

airflow data-engineer data-engineering etl-pipeline large-language-models pdf-text-extraction unstructured

Last synced: 8 months ago
JSON representation

(Let's build a) Robust pipeline for extracting structured data from various documents

Awesome Lists containing this project