https://github.com/davidfischer/gdx-downloader
A downloader for MLB Gameday raw data
https://github.com/davidfischer/gdx-downloader
Last synced: 3 months ago
JSON representation
A downloader for MLB Gameday raw data
- Host: GitHub
- URL: https://github.com/davidfischer/gdx-downloader
- Owner: davidfischer
- Created: 2013-08-08T02:14:43.000Z (almost 12 years ago)
- Default Branch: master
- Last Pushed: 2013-08-09T22:41:10.000Z (almost 12 years ago)
- Last Synced: 2025-01-03T07:03:44.360Z (5 months ago)
- Language: Shell
- Size: 133 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
GDX Downloader
==============This is a simple downloader for the data from the MLB
[Gameday website](http://gdx.mlb.com/components/game/mlb/).Usage
-----./gdxdownloader.sh
# Download a specific game
./gdxdownloader.sh http://gdx.mlb.com/components/game/mlb/year_2012/month_04/day_01/gid_2012_04_01_nyamlb_miamlb_1/
# Download all games for a specific day
./gdxdownloader.sh http://gdx.mlb.com/components/game/mlb/year_2012/month_04/day_01/
# Download all games for a specific month
./gdxdownloader.sh http://gdx.mlb.com/components/game/mlb/year_2012/month_04/Output goes to mlb/ in the current directory and starts at components/game/mlb.
If downloading a directory, the URL *MUST* end with a '/'. The downloader
goes [breadth-first](https://en.wikipedia.org/wiki/Breadth-first_search).Downloading a full day's data (~10 games) takes about 15 minutes and takes
about 20MB on disk.Directory structure
-------------------mlb/ # Major League Baseball
|-- year_(\d{4})/ # A specific year of games and player statistics
|-- batters/
| |-- (\d{6}).xml # Season stats for batters with the given MLB id
|-- month_(\d{2})/ # A specific month of games
| |-- day_(\d{2})/ # A specific day of games
| | |-- gid_(\d{4})_(\d{2})_(\d{2})_\w{3}mlb_\w{3}mlb_(\d) # YYYY_MM_DD, team abbrevs, doubleheader
| | | |-- batters/
| | | | |-- (\d{6}).xml # Batters' daily stats including player stats
| | | |-- bench.xml
| | | |-- boxscore.xml
| | | |-- game.xml
| | | |-- inning/
| | | | |-- inning_(\d+).xml # Contains atbats by inning (inc. pitch/fx)
| | | | |-- inning_hit.xml # Contains info about balls hit into play
| | | | |-- inning_Scores.xml # Contains info about Scoring plays
| | | |-- linescore.xml
| | | |-- pitchers/
| | | | |-- (\d{6}).xml # Pitchers' daily stats including player stats
| | | |-- players.xml
| | | |-- plays.xml
|--- pitchers/
|-- (\d{6}).xml # Season stats for pitchers with the given MLB id
aaa/ # Triple-A ball
aax/ # Double-A ball