https://github.com/motykatomasz/supercomputing-for-big-data
Repository for the project in Supercomputing for Big Data course at TU Delft. Apache Spark and Apache Kafka were used to perform both batched and stream processing on GDelt data.
https://github.com/motykatomasz/supercomputing-for-big-data
apache-kafka apache-spark big-data scala
Last synced: 6 months ago
JSON representation
Repository for the project in Supercomputing for Big Data course at TU Delft. Apache Spark and Apache Kafka were used to perform both batched and stream processing on GDelt data.
- Host: GitHub
- URL: https://github.com/motykatomasz/supercomputing-for-big-data
- Owner: motykatomasz
- Created: 2019-09-12T11:27:22.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2019-10-13T20:56:21.000Z (almost 6 years ago)
- Last Synced: 2025-02-08T03:47:11.729Z (8 months ago)
- Topics: apache-kafka, apache-spark, big-data, scala
- Language: Scala
- Homepage:
- Size: 3.04 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Supercomputing-For-Big-Data
Repository for the project in Supercomputing for Big Data course at TU Delft