Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/stefen-taime/etl-data-pipeline-rdbms-to-hdfs-using-airflow-apache-sqoop-spark-postgres-and-hive
This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)
https://github.com/stefen-taime/etl-data-pipeline-rdbms-to-hdfs-using-airflow-apache-sqoop-spark-postgres-and-hive
airflow big-data data docker-compose etl-pipeline hdfs hive infrastructure-as-code rdbms spark sql sqoop
Last synced: about 4 hours ago
JSON representation
This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)
- Host: GitHub
- URL: https://github.com/stefen-taime/etl-data-pipeline-rdbms-to-hdfs-using-airflow-apache-sqoop-spark-postgres-and-hive
- Owner: Stefen-Taime
- Created: 2022-04-29T13:48:01.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-04-29T14:05:26.000Z (over 2 years ago)
- Last Synced: 2023-05-03T03:21:58.585Z (over 1 year ago)
- Topics: airflow, big-data, data, docker-compose, etl-pipeline, hdfs, hive, infrastructure-as-code, rdbms, spark, sql, sqoop
- Language: Python
- Homepage:
- Size: 17.7 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0