An open API service indexing awesome lists of open source software.

https://github.com/bjam24/agh-large-scale-data-analysis

This respository contains projects made for the Large Scale Data Analysis course at the AGH UST in 2024.
https://github.com/bjam24/agh-large-scale-data-analysis

agh apache-spark apache-spark-cluster graphframes rdd spark-streaming sql structured-data

Last synced: 7 months ago
JSON representation

This respository contains projects made for the Large Scale Data Analysis course at the AGH UST in 2024.

Awesome Lists containing this project

README

          

# Large Scale Data Analysis
This project was made for the Large Scale Data Analysis course at the AGH UST in 2024/2025. All solutions are results of my work after hours, when I was solving given tasks (topics).
## Topics
### Project 1 - RDD

### Project 2 - DataFrame

### Project 3 - Apache Spark Cluster

https://github.com/user-attachments/assets/81e51d6d-1cfd-4d5b-bd97-2214580d5b67

### Project 4 - Spark Streaming

### Project 5 - GraphFrames

## Technology stack
- Python
- Apache Spark