An open API service indexing awesome lists of open source software.

https://github.com/vigneshss-07/data-engineering

This Repo contain details related to Data Engineering tech stacks
https://github.com/vigneshss-07/data-engineering

gcp hadoop-hdfs hive pyspark scala spark sql

Last synced: 3 months ago
JSON representation

This Repo contain details related to Data Engineering tech stacks

Awesome Lists containing this project

README

          

# Data-Engineering

### Data Engineering Essentials Hands-on - SQL, Python and Spark

* Data Engineering Essentials Hands-on - SQL, Python and Spark SQL (PySpark and Spark SQL).
* Data Engineering, Spark, Hive, Python, PySpark, Scala, Coding framework, Testing, IntelliJ, Maven, Glue, Streaming.

# PySpark documentation

* https://spark.apache.org/docs/latest/api/python/
* https://spark.apache.org/docs/latest/api/python/getting_started/index.html
* https://github.com/apache/spark/tree/master/python/pyspark

### Pyspark Tutorial

* https://www.learningjournal.guru/courses/spark/spark-foundation-training/jdbc-data-sources/
* Udemy - Data Engineering Essentials using SQL, Python, and PySpark - Section 49

# GitHub

* https://github.com/itversity/pyspark

### ETL pyspark

* https://github.com/itversity/etl-pyspark

### API Reference

* https://spark.apache.org/docs/latest/api/python/reference/index.html

# Github

* https://github.com/itversity
* Udemy course Data Engineering Essentials Hands-on - SQL, Python and Spark
- https://github.com/itversity/data-engineering-spark