https://github.com/jgperrin/net.jgp.labs.spark.football
Having fun with soccer stats and Spark
https://github.com/jgperrin/net.jgp.labs.spark.football
java java8 soccer spark sparkjava worldcup
Last synced: about 19 hours ago
JSON representation
Having fun with soccer stats and Spark
- Host: GitHub
- URL: https://github.com/jgperrin/net.jgp.labs.spark.football
- Owner: jgperrin
- License: apache-2.0
- Created: 2018-08-16T21:54:45.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2021-06-13T17:44:44.000Z (about 4 years ago)
- Last Synced: 2025-02-22T19:13:08.596Z (4 months ago)
- Topics: java, java8, soccer, spark, sparkjava, worldcup
- Language: Java
- Homepage:
- Size: 715 KB
- Stars: 2
- Watchers: 1
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Soccer analytics with Apache Spark
Playing with soccer datasets to illustrate some of Spark's features in Java.
If you are in interested in using Spark in Action, 2e, consider the **Spark in Action, 2e** book by Jean-Georges Perrin and published by Manning. Find out more about the book at: [https://www.manning.com/books/spark-with-java](https://www.manning.com/books/spark-with-java).
Designed for Apache Spark v3.0.0.
## Examples
Examples are making basic analytics with Spark, nothing you could not do without a SQL database, except maybe, the prediction of the winner of the soccer World Cup 2022.
## Datasets
Datasets used in those labs are coming from:
https://www.kaggle.com/jsppimentel99/coparussiajogos
* Cup.Russia.Matches.csv
* Cup.Russia.Teams.csvhttps://www.kaggle.com/abecklas/fifa-world-cup#WorldCups.csv
* WorldCupMatches.csv
* WorldCupPlayers.csv
* WorldCups.csv
https://data.world/sawya/football-world-cup-2018-dataset
* Fixture.csv
* Players_Score.csv
* Players_Stats.csv
* Players.csv
* Teams.csv