Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zkan/data-engineering-on-gcp

Data Engineering on Google Cloud Platform (GCP)
https://github.com/zkan/data-engineering-on-gcp

bigquery data-engineering data-lake data-pipeline data-warehouse gcs google-cloud-platform machine-learning

Last synced: about 1 month ago
JSON representation

Data Engineering on Google Cloud Platform (GCP)

Awesome Lists containing this project

README

        

# Data Engineering on Google Cloud Platform (GCP)

### Table of Contents

* [Prerequisites](#prerequisites)
* [Course Modules](#course-modules)
* [Datasets](#datasets)

## Prerequisites

1. Python
1. SQL
1. Git & GitHub
1. Command-Line Interface (CLI)
1. Docker
1. Google Cloud Platform (GCP)

## Course Modules

1. [Introduction to Data Engineering](01-introduction-to-data-engineering)
1. [Building a Data Warehouse with BigQuery](02-building-a-data-warehouse-with-bigquery)
1. [Setting up Data Lake using Google Cloud Storage](03-setting-up-data-lake-using-google-cloud-storage)
1. [Building Data Pipelines with Apache Airflow (Cloud Composer)](04-building-data-pipelines-with-apache-airflow-cloud-composer)
1. [Data Visualization with Looker Studio](05-data-visualization-with-looker-studio)
1. [Implementing Machine Learning Models in BigQuery ML](06-implementing-machine-learning-models-in-bigquery-ml)
1. [Real-world Capstone Project](07-real-world-capstone-project)

## Datasets

* [Breakfast at the Frat](https://github.com/zkan/open-data/tree/main/breakfast-at-the-frat)
* [Best Buy](https://github.com/zkan/open-data/tree/main/best-buy-apis)