https://github.com/vigneshss-07/data-engineering
This Repo contain details related to Data Engineering tech stacks
https://github.com/vigneshss-07/data-engineering
gcp hadoop-hdfs hive pyspark scala spark sql
Last synced: 3 months ago
JSON representation
This Repo contain details related to Data Engineering tech stacks
- Host: GitHub
- URL: https://github.com/vigneshss-07/data-engineering
- Owner: vigneshSs-07
- Created: 2022-02-09T14:57:21.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-12-09T16:55:54.000Z (over 3 years ago)
- Last Synced: 2025-06-28T23:44:25.473Z (about 1 year ago)
- Topics: gcp, hadoop-hdfs, hive, pyspark, scala, spark, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 864 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data-Engineering
### Data Engineering Essentials Hands-on - SQL, Python and Spark
* Data Engineering Essentials Hands-on - SQL, Python and Spark SQL (PySpark and Spark SQL).
* Data Engineering, Spark, Hive, Python, PySpark, Scala, Coding framework, Testing, IntelliJ, Maven, Glue, Streaming.
# PySpark documentation
* https://spark.apache.org/docs/latest/api/python/
* https://spark.apache.org/docs/latest/api/python/getting_started/index.html
* https://github.com/apache/spark/tree/master/python/pyspark
### Pyspark Tutorial
* https://www.learningjournal.guru/courses/spark/spark-foundation-training/jdbc-data-sources/
* Udemy - Data Engineering Essentials using SQL, Python, and PySpark - Section 49
# GitHub
* https://github.com/itversity/pyspark
### ETL pyspark
* https://github.com/itversity/etl-pyspark
### API Reference
* https://spark.apache.org/docs/latest/api/python/reference/index.html
# Github
* https://github.com/itversity
* Udemy course Data Engineering Essentials Hands-on - SQL, Python and Spark
- https://github.com/itversity/data-engineering-spark