Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pddiii/what-makes-a-hall-of-famer
MLB Hall of Fame Status Prediction Model
https://github.com/pddiii/what-makes-a-hall-of-famer
baseball baseball-analytics baseballr data-scraping mlb python r
Last synced: about 2 months ago
JSON representation
MLB Hall of Fame Status Prediction Model
- Host: GitHub
- URL: https://github.com/pddiii/what-makes-a-hall-of-famer
- Owner: pddiii
- Created: 2023-12-01T00:09:32.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-22T22:20:43.000Z (4 months ago)
- Last Synced: 2024-08-23T00:00:43.382Z (4 months ago)
- Topics: baseball, baseball-analytics, baseballr, data-scraping, mlb, python, r
- Language: R
- Homepage:
- Size: 4.16 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
National Baseball Hall of Fame Status Prediction Model using Random Forest and Gradient Boosted Decision Trees.
# Contributors
**Commits made post 12/16/2023 are done by Peter D. DePaul III**
- **Peter D. DePaul III** - Data Cleaning, Batter's Models, FanGraphs Batting and Pitching models, and Final Report
- **Nelson Duong** - Pitcher's Models & Exploratory Data Analysis
- **Jeffrey Gutierrez** - Final Report
- **Yuji Kusuyama** - Exploratory Data Analysis
- **Alan Wong** - Final Report
# Resources
- [tidyverse](https://www.tidyverse.org/)
- [tidymodels](https://www.tidymodels.org/)
- [Lahman Database for R](https://cran.r-project.org/web/packages/Lahman/index.html)
- [xgboost](https://cran.r-project.org/web/packages/xgboost/index.html)
- [doParallel](https://cran.r-project.org/web/packages/doParallel/index.html)
- [caret](https://cran.r-project.org/web/packages/caret/index.html)
- [vip](https://cran.r-project.org/web/packages/vip/index.html)## Data Collection
[Baseball Reference Model Data]()
The "Data Cleaning.R" file contains the data cleaning, and feature engineering process for both the Baseball Reference and FanGraphs Model.
[FanGraphs Model Data](fg_data.py)
## Data Dictionary
### Baseball Reference Model
[Batter's Dictionary](Dictionary/Batter_Variables.md)
[Pitcher's Dictionary](Dictionary/Pitcher_Variables.md)
### FanGraphs Model
[Batter's Dictionary](Dictionary/fg_Batter_variables.md)
[Pitcher's Dictionary](Dictionary/fg_Pitcher_variables.md)
# Full Project Report
[Baseball Hall of Fame Prediction](Final-Report.pdf)