https://github.com/kamanhang/healthcare-data-engineering-apache-spark-project
The primary goal of this project is to use and learn Apache Spark for healthcare data transformation and learn essential Data Engineering tools such as Apache Spark Data Transformation and AWS Services like S3.
https://github.com/kamanhang/healthcare-data-engineering-apache-spark-project
apache-spark data-science dataengineering etl-pipeline healthcare healthcare-data
Last synced: about 2 months ago
JSON representation
The primary goal of this project is to use and learn Apache Spark for healthcare data transformation and learn essential Data Engineering tools such as Apache Spark Data Transformation and AWS Services like S3.
- Host: GitHub
- URL: https://github.com/kamanhang/healthcare-data-engineering-apache-spark-project
- Owner: KamanHang
- Created: 2025-02-12T17:58:45.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-03-08T15:00:49.000Z (3 months ago)
- Last Synced: 2025-03-08T15:23:58.067Z (3 months ago)
- Topics: apache-spark, data-science, dataengineering, etl-pipeline, healthcare, healthcare-data
- Language: Jupyter Notebook
- Homepage:
- Size: 7.36 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# HealthCare Data Engineering Apache Spark Project
## Architecture Diagram
