{"id":13621352,"url":"https://github.com/oleg-agapov/data-engineering-book","last_synced_at":"2026-01-29T04:23:18.549Z","repository":{"id":41188173,"uuid":"313285274","full_name":"oleg-agapov/data-engineering-book","owner":"oleg-agapov","description":"Accumulated knowledge and experience in the field of Data Engineering","archived":false,"fork":false,"pushed_at":"2022-11-22T19:44:52.000Z","size":19022,"stargazers_count":837,"open_issues_count":2,"forks_count":96,"subscribers_count":54,"default_branch":"master","last_synced_at":"2024-08-01T21:47:41.569Z","etag":null,"topics":["data","data-engineering","engineering"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/oleg-agapov.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-11-16T11:48:21.000Z","updated_at":"2024-08-01T18:28:58.000Z","dependencies_parsed_at":"2023-01-21T10:02:27.709Z","dependency_job_id":null,"html_url":"https://github.com/oleg-agapov/data-engineering-book","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oleg-agapov%2Fdata-engineering-book","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oleg-agapov%2Fdata-engineering-book/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oleg-agapov%2Fdata-engineering-book/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oleg-agapov%2Fdata-engineering-book/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/oleg-agapov","download_url":"https://codeload.github.com/oleg-agapov/data-engineering-book/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":223654615,"owners_count":17180545,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-engineering","engineering"],"created_at":"2024-08-01T21:01:05.010Z","updated_at":"2026-01-29T04:23:18.233Z","avatar_url":"https://github.com/oleg-agapov.png","language":null,"funding_links":[],"categories":["Others","Uncategorized"],"sub_categories":["Uncategorized"],"readme":"# Data Engineering Book \n\n\u003cimg src=\"https://img.shields.io/badge/Status-Work_in_progress-yellow\"\u003e\n\n\n\u003e _Accumulated knowledge and experience in the field of Data Engineering_\n\n\u003cimg width=\"350\" src=\"cover.png\" alt=\"Data engineering book cover\"\u003e\n\n## Table of content\n\n1. [Introduction to Data Engineering](./book/1-introduction-to-data-engineering/README.md)\n    1. [What is Data Engineering?](./book/1-introduction-to-data-engineering/1.1-what-is-data-engineering.md)\n    2. [Data Engineering Roadmap?](./book/1-introduction-to-data-engineering/1.2-data-engineering-roadmap.md)\n    3. [Useful resources for Data Engineers](./book/1-introduction-to-data-engineering/1.3-useful-resources.md)\n2. [Beginner path](./book/2-beginner-path/README.md)\n    1. [Intro to databases](./book/2-beginner-path/2-1-databases/databases.md)\n    2. [SQL for beginners](./book/2-beginner-path/2-2-sql-for-beginners/sql-1.md)\n3. Big Data path\n4. Data Architect path\n\n## About this book\n\nThe book covers different aspects of Data Engineering, from basic topics like databases, SQL and ETL to advanced like data architecture and Big Data stacks.\n\nBut it is still under development. It has no strict set of topics I want to cover, but it will be pretty close to what I've described in my [Data Engineering Roadmap](/book/1-introduction-to-data-engineering/1.2-data-engineering-roadmap.md).\n\n## How to read this Book\n\nIf you are an absolute novice – start with [introduction to Data Engineering](./book/1-introduction-to-data-engineering/1.1-what-is-data-engineering.md). I will explain who are data engineers, what tasks they perform, which skill are required etc.\n\nIf you already decided to learn data engineering, but don't know where to start – head on to the [Data Engineering roadmap](./book/1-introduction-to-data-engineering/1.2-data-engineering-roadmap.md). There I show three paths you can take, from absolute beginner to advanced levels.\n\nLastly, if you know what exactly you want to learn then head to the [table of content](#table-of-content) and find the most interesting topics for you.\n\n## Updates\n- `2022-11-18`: Added [Advanced SQL](./book/2-beginner-path/2-2-sql-for-beginners/sql-2.md) chapter\n- `2021-05-15`: Re-write of [SQL for beginners](./book/2-beginner-path/2-2-sql-for-beginners/sql-1.md) chapter so it has more content and is much informative\n- `2021-03-02`: Added \"SQL for beginners\"\n- `2021-01-10`: Added [Intro to databases](./book/2-beginner-path/2-1-databases/databases.md)\n- `2020-12-06`: Added section [Useful resources for Data Engineers](./book/1-introduction-to-data-engineering/1.3-useful-resources.md)\n- `2020-11-16`: First section [Intro to Data Engineering](./book/1-introduction-to-data-engineering/README.md) released\n\n## Feedback\n\nIf you have any feedback or other questions, please refer to [this form](https://docs.google.com/forms/d/e/1FAIpQLSeYSxyQcNyXIyQeD1DtR6q2zHO7heGGUQ36PqW--XdRL01Wqg/viewform).\n\n## About author\n\nMy name is **Oleg Agapov** and I'm a BI and Analytics Engineer.\n\nI work with data, analytics, engineering and sometimes magic ✨\n\n**twitter**: [@oleg_agapov_](https://twitter.com/oleg_agapov_)\n\n## License\n\nYou may freely copy and distribute portions of this book as long as you give appropriate credit and indicate if changes were made. You cannot use this book for any commercial purpose.\n\nCopyright ©2020-2022 Oleg Agapov.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foleg-agapov%2Fdata-engineering-book","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Foleg-agapov%2Fdata-engineering-book","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foleg-agapov%2Fdata-engineering-book/lists"}