https://github.com/urbanclimatefr/apache-spark
Apache Spark
https://github.com/urbanclimatefr/apache-spark
Last synced: 7 months ago
JSON representation
Apache Spark
- Host: GitHub
- URL: https://github.com/urbanclimatefr/apache-spark
- Owner: urbanclimatefr
- Created: 2024-11-07T17:56:14.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-11-07T17:56:57.000Z (11 months ago)
- Last Synced: 2025-01-17T18:56:56.887Z (9 months ago)
- Language: Jupyter Notebook
- Size: 15.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Apache Spark
Hi and welcome to the repository of the Learning Spark
You can find all the source codes of the Jupyter notebooks here.## Contents
- Why Spark
- How Spark Works
- Set up your dev environment
with Docker & Jupyter
- Work with DataFrames (JSON & CSV)
- Introduction into SparkSQL
- Coding with RDDs
- Conclusion## Source codes
- 01_JSON_Transformations
- 02_CSV_Schemas
- 03_Working_with_DataFrames
- 04_SparkSQL
- 05_Working_With_RDDs