https://github.com/datasets/football-datasets
Major Europe leagues data (England, Spain, Italy, Germany and France)
https://github.com/datasets/football-datasets
csv data datasets football open soccer
Last synced: 12 months ago
JSON representation
Major Europe leagues data (England, Spain, Italy, Germany and France)
- Host: GitHub
- URL: https://github.com/datasets/football-datasets
- Owner: datasets
- Created: 2018-08-15T08:07:57.000Z (over 7 years ago)
- Default Branch: main
- Last Pushed: 2025-03-28T02:23:36.000Z (12 months ago)
- Last Synced: 2025-03-28T03:24:15.253Z (12 months ago)
- Topics: csv, data, datasets, football, open, soccer
- Language: Python
- Homepage: https://datahub.io/collections/football
- Size: 1.18 MB
- Stars: 62
- Watchers: 8
- Forks: 25
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Football datasets
This repository includes 5 major Europe leagues:
- English Premier League – https://datahub.io/core/english-premier-league
- Spanish La Liga – https://datahub.io/core/spanish-la-liga
- Italian Serie A – https://datahub.io/core/italian-serie-a
- German Bundesliga – https://datahub.io/core/german-bundesliga
- French Ligue 1 – https://datahub.io/core/french-ligue-1
Each league has data for the all the seasons. The data is updated on daily basis via Github-Actions.
## Data
The data is sourced from the `https://www.football-data.co.uk/` website, datasets range starts from 1993 up to current year.
## Preparation
You need to have Python version >=3.5:
- Install requirements using `pip install -r scripts/requirements.txt`
- Run the script `python scripts/process.py`
- Update datapackage `pyhton scripts/process.py`
## Automation
Up-to-date (auto-updates every day) football dataset could be found on the datahub.io: https://datahub.io/core/football-datasets
## Packaging datasets
Each directory in `datasets/` directory is a data package. It has a common `schema.json` for all its resources. You need to run `python package.py` from root directory to generate `datapackage.json` for each data package.
## License
This Data Package is made available under the Public Domain Dedication and License v1.0 whose full text can be found at: http://www.opendatacommons.org/licenses/pddl/1.0/