https://github.com/zkan/building-data-pipelines-with-apache-airflow
Building Data Pipelines with Apache Airflow
https://github.com/zkan/building-data-pipelines-with-apache-airflow
apache-airflow data-pipelines docker
Last synced: 10 months ago
JSON representation
Building Data Pipelines with Apache Airflow
- Host: GitHub
- URL: https://github.com/zkan/building-data-pipelines-with-apache-airflow
- Owner: zkan
- License: cc0-1.0
- Created: 2022-09-23T03:21:25.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-09-25T14:54:53.000Z (over 3 years ago)
- Last Synced: 2024-04-14T13:48:40.140Z (about 2 years ago)
- Topics: apache-airflow, data-pipelines, docker
- Language: Dockerfile
- Homepage:
- Size: 10.6 MB
- Stars: 2
- Watchers: 2
- Forks: 11
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Building Data Pipelines with Apache Airflow
## Starting Airflow
```sh
mkdir -p mnt/dags mnt/logs mnt/plugins
echo -e "AIRFLOW_UID=$(id -u)" > .env
docker-compose up
```
## Airflow S3 Connection to MinIO
Since MinIO offers S3 compatible object storage, we can set the connection type to S3. However, we'll need to set an extra option, so that Airflow connects to MinIO instead of S3.
- Connection Name: `minio` or any name you like
- Connection Type: S3
- Login: ``
- Password: ``
- Extra: a JSON object with the following properties:
```json
{
"host": "http://minio:9000"
}
```