Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/teamclairvoyant/airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow
https://github.com/teamclairvoyant/airflow-maintenance-dags
airflow airflow-maintenance-dags apache-airflow cleanup dag maintenance workflow
Last synced: 2 days ago
JSON representation
A series of DAGs/Workflows to help maintain the operation of Airflow
- Host: GitHub
- URL: https://github.com/teamclairvoyant/airflow-maintenance-dags
- Owner: teamclairvoyant
- License: apache-2.0
- Created: 2016-12-15T16:04:00.000Z (about 8 years ago)
- Default Branch: master
- Last Pushed: 2024-06-18T05:33:47.000Z (7 months ago)
- Last Synced: 2025-01-16T14:24:54.559Z (9 days ago)
- Topics: airflow, airflow-maintenance-dags, apache-airflow, cleanup, dag, maintenance, workflow
- Language: Python
- Size: 684 KB
- Stars: 1,697
- Watchers: 87
- Forks: 398
- Open Issues: 29
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-apache-airflow - airflow-maintenance-dags - [Clairvoyant](http://clairvoyantsoft.com/) has a repo of Airflow DAGs that operator on Airflow itself, clearing out various bits of the backing metadata store. (Libraries, Hooks, Utilities)
README
# airflow-maintenance-dags
A series of DAGs/Workflows to help maintain the operation of Airflow## DAGs/Workflows
* [backup-configs](backup-configs)
* A maintenance workflow that you can deploy into Airflow to periodically take backups of various Airflow configurations and files.
* [clear-missing-dags](clear-missing-dags)
* A maintenance workflow that you can deploy into Airflow to periodically clean out entries in the DAG table of which there is no longer a corresponding Python File for it. This ensures that the DAG table doesn't have needless items in it and that the Airflow Web Server displays only those available DAGs.
* [db-cleanup](db-cleanup)
* A maintenance workflow that you can deploy into Airflow to periodically clean out the DagRun, TaskInstance, Log, XCom, Job DB and SlaMiss entries to avoid having too much data in your Airflow MetaStore.
* [kill-halted-tasks](kill-halted-tasks)
* A maintenance workflow that you can deploy into Airflow to periodically kill off tasks that are running in the background that don't correspond to a running task in the DB.
* This is useful because when you kill off a DAG Run or Task through the Airflow Web Server, the task still runs in the background on one of the executors until the task is complete.
* [log-cleanup](log-cleanup)
* A maintenance workflow that you can deploy into Airflow to periodically clean out the task logs to avoid those getting too big.
* [delete-broken-dags](delete-broken-dags)
* A maintenance workflow that you can deploy into Airflow to periodically delete DAG files and clean out entries in the ImportError table for DAGs which Airflow cannot parse or import properly. This ensures that the ImportError table is cleaned every day.
* [sla-miss-report](sla-miss-report)
* DAG providing an extensive analysis report of SLA misses broken down on a daily, hourly, and task level