Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/datasets/football-datasets
Major Europe leagues data (England, Spain, Italy, Germany and France)
https://github.com/datasets/football-datasets
csv data datasets football open soccer
Last synced: 7 days ago
JSON representation
Major Europe leagues data (England, Spain, Italy, Germany and France)
- Host: GitHub
- URL: https://github.com/datasets/football-datasets
- Owner: datasets
- Created: 2018-08-15T08:07:57.000Z (over 6 years ago)
- Default Branch: main
- Last Pushed: 2025-01-16T02:10:10.000Z (15 days ago)
- Last Synced: 2025-01-16T15:08:17.386Z (14 days ago)
- Topics: csv, data, datasets, football, open, soccer
- Language: Python
- Homepage: https://datahub.io/collections/football
- Size: 1.24 MB
- Stars: 51
- Watchers: 9
- Forks: 23
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Football datasets
This repository includes 5 major Europe leagues:
- English Premier League – https://datahub.io/core/english-premier-league
- Spanish La Liga – https://datahub.io/core/spanish-la-liga
- Italian Serie A – https://datahub.io/core/italian-serie-a
- German Bundesliga – https://datahub.io/core/german-bundesliga
- French Ligue 1 – https://datahub.io/core/french-ligue-1Each league has data for the all the seasons. The data is updated on monthly basis via Github-Actions.
## Data
The data is sourced from the `https://www.football-data.co.uk/` website, datasets range starts from 1993 up to current year.
## Preparation
You need to have Python version >=3.5:
- Install requirements using `pip install -r scripts/requirements.txt`
- Run the script `python scripts/process.py`
- Update datapackage `pyhton scripts/process.py`## Automation
Up-to-date (auto-updates every month) football dataset could be found on the datahub.io: https://datahub.io/core/football-datasets
## Packaging datasets
Each directory in `datasets/` directory is a data package. It has a common `schema.json` for all its resources. You need to run `python package.py` from root directory to generate `datapackage.json` for each data package.
## License
This Data Package is made available under the Public Domain Dedication and License v1.0 whose full text can be found at: http://www.opendatacommons.org/licenses/pddl/1.0/