https://github.com/bjam24/agh-large-scale-data-analysis
This respository contains projects made for the Large Scale Data Analysis course at the AGH UST in 2024.
https://github.com/bjam24/agh-large-scale-data-analysis
agh apache-spark apache-spark-cluster graphframes rdd spark-streaming sql structured-data
Last synced: 7 months ago
JSON representation
This respository contains projects made for the Large Scale Data Analysis course at the AGH UST in 2024.
- Host: GitHub
- URL: https://github.com/bjam24/agh-large-scale-data-analysis
- Owner: bjam24
- License: other
- Created: 2024-11-11T00:14:34.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2025-02-20T20:27:20.000Z (11 months ago)
- Last Synced: 2025-07-01T10:02:47.329Z (7 months ago)
- Topics: agh, apache-spark, apache-spark-cluster, graphframes, rdd, spark-streaming, sql, structured-data
- Language: HTML
- Homepage:
- Size: 7.18 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Large Scale Data Analysis
This project was made for the Large Scale Data Analysis course at the AGH UST in 2024/2025. All solutions are results of my work after hours, when I was solving given tasks (topics).
## Topics
### Project 1 - RDD
### Project 3 - Apache Spark Cluster
https://github.com/user-attachments/assets/81e51d6d-1cfd-4d5b-bd97-2214580d5b67
### Project 4 - Spark Streaming
## Technology stack
- Python
- Apache Spark