Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/schochastics/football-data
football (soccer) datasets
https://github.com/schochastics/football-data
data-analysis data-science data-visualization dataset football-data rstats soccer-data
Last synced: about 2 months ago
JSON representation
football (soccer) datasets
- Host: GitHub
- URL: https://github.com/schochastics/football-data
- Owner: schochastics
- Created: 2020-03-16T22:12:19.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2024-09-14T17:42:08.000Z (3 months ago)
- Last Synced: 2024-10-12T22:14:18.809Z (2 months ago)
- Topics: data-analysis, data-science, data-visualization, dataset, football-data, rstats, soccer-data
- Language: R
- Size: 16.1 MB
- Stars: 45
- Watchers: 2
- Forks: 7
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
**This repository is made available under the [Open Data Commons Attribution License](https://opendatacommons.org/licenses/by/1-0/index.html)**
# football-data
This repository contains some datasets around football (soccer). The main dataset contains
results from ~1 million top-tier games.# Football Results Dataset
`data/results` contains results of 1,078,214 football games in 207 top-tier domestic leagues and
20 international tournaments (UEFA EuroLeague/ChampionsLeague,etc.) from 1888-2019. The files are
split up by competition but all follow the same scheme.## Codebook
| variable | description |
|----------------|-----------------------------------------------------------------|
| home | home team name (not necessarily unique) |
| away | away team name (not necessarily unique) |
| date | date of match |
| gh | goals for home team (including extra time and penalties) |
| ga | goals for away team (including extra time and penalties) |
| full_time | "F"=game ended in 90', "E"=extra time, "P"=penalty shoot-out |
| competition | country name of league or name of international competition |
| home_ident | unique identifier of home team |
| away_ident | unique identifier of away team |
| home_country | country of home team |
| away_country | country of away team |
| home_code | country code of home team |
| away_code | country code of away team |
| home_continent | continent of home team |
| away_continent | continent of away team |
| continent | continent of competition |
| level | "national"= domestic league, "international"= international cup |## Rscripts
The folder `Rscripts` contains some R code with basic analyses which could get you going:
- `01_basic_stats.R`: very simple stats (number of games, goals, etc)
- `02_soccerverse.R`: some code for figures on soccerverse.com## Disclaimer
This dataset definitely contains errors, especially for older games, where sources are not
as reliable. A big issue are teams that merge/split/dissolve over time, which (I think) I did
not resolve consistently over time.I have gathered this dataset over the course of 8 years and put a lot of effort in
it (see [worldclubratings.com](worldclubratings.com)). If you use the data for any kind of project, please drop me a line
or ping me on [twitter](https://twitter.com/schochastics). I hapilly include your results on worldclubratings.com too.# formations and lineups
The folder `data/formations` contains lineups and formations datasets
# goals and times
The folder `data/goals_time` contains the details about goal scorers in several European Leagues
# Links
[worldclubratings.com](http://worldclubratings.com/)
Similar datasets:
- [European Soccer Database](https://www.kaggle.com/hugomathien/soccer) (Kaggle)
- [World Soccer](https://www.kaggle.com/sashchernuh/european-football) (Kaggle)
- [International Football Results](https://www.kaggle.com/martj42/international-football-results-from-1872-to-2017) (Kaggle)
- [engsoccerdata](https://github.com/jalapic/engsoccerdata) (R package)
- [football-data.co.uk](http://football-data.co.uk/)