An open API service indexing awesome lists of open source software.

https://github.com/rellyson/data-engineering-tools

This repository holds examples and documentation about the most used tools in the data engineering ecosystem.
https://github.com/rellyson/data-engineering-tools

apache-airflow apache-spark data data-engineering jupyter-notebook python tools

Last synced: 5 months ago
JSON representation

This repository holds examples and documentation about the most used tools in the data engineering ecosystem.

Awesome Lists containing this project

README

          



Exploring Data Engineering tools

[![License](https://img.shields.io/badge/License-MIT-b5dc10)](https://opensource.org/licenses/MIT)
![airflow](https://img.shields.io/badge/Apache%20Airflow-2.8.1-blue?logo=apacheairflow)
![spark](https://img.shields.io/badge/Apache%20Spark-3.5.0-orange?logo=apachespark)

## About

This repository holds examples and documentation
about the most used tools in the data engineering ecosystem.

## Running

This project contains `docker compose` files
which you can use to start some tool stack (ex: Spark using the `docker-compose-spark.yaml` file).