Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/manuel-lang/data-engineering-nanodegree
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.
https://github.com/manuel-lang/data-engineering-nanodegree
airflow cassandra data-engineering postgresql redshift spark udacity udacity-data-engineer-nanodegree
Last synced: 2 months ago
JSON representation
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.
- Host: GitHub
- URL: https://github.com/manuel-lang/data-engineering-nanodegree
- Owner: manuel-lang
- Created: 2020-05-10T13:29:39.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2022-10-20T21:19:06.000Z (over 2 years ago)
- Last Synced: 2023-02-26T19:33:32.118Z (almost 2 years ago)
- Topics: airflow, cassandra, data-engineering, postgresql, redshift, spark, udacity, udacity-data-engineer-nanodegree
- Language: Jupyter Notebook
- Homepage:
- Size: 2.42 MB
- Stars: 41
- Watchers: 1
- Forks: 32
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Engineering Nanodegree
All projects of Udacity's [Data Engineering Nanodegree](https://www.udacity.com/course/data-engineer-nanodegree--nd027).## Projects
### Data Modeling
Learn to create relational and NoSQL data models to fit the diverse needs of data consumers. Use ETL to build databases in PostgreSQL and Apache Cassandra.[DATA MODELING WITH POSTGRES](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Modeling%20with%20Postgres)
[DATA MODELING WITH APACHE CASSANDRA](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Modeling%20with%20Cassandra)
### Cloud Data Warehouses
Sharpen your data warehousing skills and deepen your understanding of data infrastructure. Create cloud-based data warehouses on Amazon Web Services (AWS).[BUILD A CLOUD DATA WAREHOUSE](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Warehouse)
### Spark and Data Lakes
Understand the big data ecosystem and how to use Spark to work with massive datasets. Store big data in a data lake and query it with Spark.[BUILD A DATA LAKE](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Lake)
### Data Pipelines with Airflow
Schedule, automate, and monitor data pipelines using Apache Airflow. Run data quality checks, track data lineage, and work with data pipelines in production.[DATA PIPELINES WITH AIRFLOW](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Pipeline)
### Capstone Project
Combine what you've learned throughout the program to build your own data engineering portfolio project.[DATA ENGINEERING CAPSTONE](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Capstone%20Project)