https://github.com/patilni3/etl-notes
Extract Transform Load all theoretical notes
https://github.com/patilni3/etl-notes
datawarehouse etl etl-pipeline extract load schema transform
Last synced: about 2 months ago
JSON representation
Extract Transform Load all theoretical notes
- Host: GitHub
- URL: https://github.com/patilni3/etl-notes
- Owner: PatilNi3
- Created: 2024-12-03T09:52:30.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-12-03T09:58:13.000Z (6 months ago)
- Last Synced: 2025-02-08T23:27:12.263Z (3 months ago)
- Topics: datawarehouse, etl, etl-pipeline, extract, load, schema, transform
- Homepage:
- Size: 82 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ETL
This repository contains an ETL (Extract, Transform, Load) pipeline that helps with the efficient management, transformation, and storage of data. It provides an in-depth understanding of ETL concepts, processes, and best practices for data engineering.
## Table of Contents
1. What is ETL?
2. Difference between ETL and ELT
3. Introduction to OLTP and OLAP
4. Difference between OLTP and OLAP
5. What is ACID Property?
6. What is Data Warehouse? Need of Data Warehouse?
7. Data Warehouse Architecture
8. What are Data Marts and its Types?
9. What is ODS (Operational Data Store)?
10. What is Dimensional Modeling?
11. What is Fact and Dimension?
12. What is Fact Table and Dimension Table?
13. Star, Snowflake, and Galaxy Schema
14. What is Normalization and Denormalization?
15. What are the Different Normal Forms?
16. Slowly Changing Dimension