An open API service indexing awesome lists of open source software.

https://github.com/barlou/tools

Reusable Python tools for data engineering pipelines — cloud storage client (AWS S3, OVH), structured logging with cloud flush strategies, and Hive-partitioned Parquet/ORC archiving. Built for Airflow tasks and RL training workloads.
https://github.com/barlou/tools

airflow aws-s3 cloud-storage data-engineering github-actions logging orc ovh parquet python

Last synced: about 1 month ago
JSON representation

Reusable Python tools for data engineering pipelines — cloud storage client (AWS S3, OVH), structured logging with cloud flush strategies, and Hive-partitioned Parquet/ORC archiving. Built for Airflow tasks and RL training workloads.

Awesome Lists containing this project