Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pddiii/what-makes-a-hall-of-famer

MLB Hall of Fame Status Prediction Model
https://github.com/pddiii/what-makes-a-hall-of-famer

baseball baseball-analytics baseballr data-scraping mlb python r

Last synced: about 2 months ago
JSON representation

MLB Hall of Fame Status Prediction Model

Awesome Lists containing this project

README

        

National Baseball Hall of Fame Status Prediction Model using Random Forest and Gradient Boosted Decision Trees.

# Contributors

**Commits made post 12/16/2023 are done by Peter D. DePaul III**

- **Peter D. DePaul III** - Data Cleaning, Batter's Models, FanGraphs Batting and Pitching models, and Final Report

- **Nelson Duong** - Pitcher's Models & Exploratory Data Analysis

- **Jeffrey Gutierrez** - Final Report

- **Yuji Kusuyama** - Exploratory Data Analysis

- **Alan Wong** - Final Report

# Resources

- [tidyverse](https://www.tidyverse.org/)
- [tidymodels](https://www.tidymodels.org/)
- [Lahman Database for R](https://cran.r-project.org/web/packages/Lahman/index.html)
- [xgboost](https://cran.r-project.org/web/packages/xgboost/index.html)
- [doParallel](https://cran.r-project.org/web/packages/doParallel/index.html)
- [caret](https://cran.r-project.org/web/packages/caret/index.html)
- [vip](https://cran.r-project.org/web/packages/vip/index.html)

## Data Collection

[Baseball Reference Model Data]()

The "Data Cleaning.R" file contains the data cleaning, and feature engineering process for both the Baseball Reference and FanGraphs Model.

[FanGraphs Model Data](fg_data.py)

## Data Dictionary

### Baseball Reference Model

[Batter's Dictionary](Dictionary/Batter_Variables.md)

[Pitcher's Dictionary](Dictionary/Pitcher_Variables.md)

### FanGraphs Model

[Batter's Dictionary](Dictionary/fg_Batter_variables.md)

[Pitcher's Dictionary](Dictionary/fg_Pitcher_variables.md)

# Full Project Report

[Baseball Hall of Fame Prediction](Final-Report.pdf)