Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/manuel-lang/data-engineering-nanodegree

Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.
https://github.com/manuel-lang/data-engineering-nanodegree

airflow cassandra data-engineering postgresql redshift spark udacity udacity-data-engineer-nanodegree

Last synced: 2 months ago
JSON representation

Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.

Awesome Lists containing this project

README

        

# Data Engineering Nanodegree
All projects of Udacity's [Data Engineering Nanodegree](https://www.udacity.com/course/data-engineer-nanodegree--nd027).

## Projects

### Data Modeling
Learn to create relational and NoSQL data models to fit the diverse needs of data consumers. Use ETL to build databases in PostgreSQL and Apache Cassandra.

[DATA MODELING WITH POSTGRES](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Modeling%20with%20Postgres)

[DATA MODELING WITH APACHE CASSANDRA](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Modeling%20with%20Cassandra)

### Cloud Data Warehouses
Sharpen your data warehousing skills and deepen your understanding of data infrastructure. Create cloud-based data warehouses on Amazon Web Services (AWS).

[BUILD A CLOUD DATA WAREHOUSE](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Warehouse)

### Spark and Data Lakes
Understand the big data ecosystem and how to use Spark to work with massive datasets. Store big data in a data lake and query it with Spark.

[BUILD A DATA LAKE](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Lake)

### Data Pipelines with Airflow
Schedule, automate, and monitor data pipelines using Apache Airflow. Run data quality checks, track data lineage, and work with data pipelines in production.

[DATA PIPELINES WITH AIRFLOW](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Data%20Pipeline)

### Capstone Project
Combine what you've learned throughout the program to build your own data engineering portfolio project.

[DATA ENGINEERING CAPSTONE](https://github.com/manuel-lang/Udacity-Data-Engineering-Nanodegree/tree/master/Capstone%20Project)