Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with data-lakestore
A curated list of projects in awesome lists tagged with data-lakestore .
https://github.com/pirate-emperor/bigdata-pipeline
BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.
airflow airflow-dags airflow-docker big-data data-lake data-lakestore data-warehouse dbt dbt-core distributed-computing docker docker-compose hadoop hive hiveql kudu mysql mysql-server trino trino-cli
Last synced: 31 Jan 2025
https://github.com/ethanwebber123/airflow-etl-elt
Airflow-ETL-ELT is a robust data pipeline tool that enables efficient extraction, transformation, loading, and orchestration of data workflows. It offers a scalable and customizable solution for managing complex ETL and ELT processes with ease.
airflow-docker analytics data-lakestore data-warehouse dbt-core docker docker-compose hive hiveql kudu postgresql pyspark python spark
Last synced: 21 Jan 2025