An open API service indexing awesome lists of open source software.

https://github.com/zkan/building-data-pipelines-with-apache-airflow

Building Data Pipelines with Apache Airflow
https://github.com/zkan/building-data-pipelines-with-apache-airflow

apache-airflow data-pipelines docker

Last synced: 10 months ago
JSON representation

Building Data Pipelines with Apache Airflow

Awesome Lists containing this project

README

          

# Building Data Pipelines with Apache Airflow

## Starting Airflow

```sh
mkdir -p mnt/dags mnt/logs mnt/plugins
echo -e "AIRFLOW_UID=$(id -u)" > .env
docker-compose up
```

## Airflow S3 Connection to MinIO

Since MinIO offers S3 compatible object storage, we can set the connection type to S3. However, we'll need to set an extra option, so that Airflow connects to MinIO instead of S3.

- Connection Name: `minio` or any name you like
- Connection Type: S3
- Login: ``
- Password: ``
- Extra: a JSON object with the following properties:
```json
{
"host": "http://minio:9000"
}
```