https://github.com/radeity/2021-data-intensive-computing-pj
Final project for 2021 fall Theory and practice of data-intensive computing
https://github.com/radeity/2021-data-intensive-computing-pj
Last synced: 2 days ago
JSON representation
Final project for 2021 fall Theory and practice of data-intensive computing
- Host: GitHub
- URL: https://github.com/radeity/2021-data-intensive-computing-pj
- Owner: Radeity
- Created: 2021-12-30T07:00:08.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-06-09T03:28:51.000Z (about 4 years ago)
- Last Synced: 2025-03-05T04:18:43.261Z (over 1 year ago)
- Language: Python
- Size: 2.35 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 2021-Data-Intensive-Computing-PJ
Final project for 2021 fall Theory and practice of data-intensive computing
Because of the big size of `eve.txt`, we don't provide it in `data`. It can be found in the following paper:
[A public data set of spatiotemporal match events in soccer competitions](https://www.nature.com/articles/s41597-019-0247-7)
**Make sure you have the Spark 3.x and Python 3.6+ environment.**
run `data/utils.py` for socket data sending
run `metrics.py` for streaming data processing
run `ui.py` for user query tasks