https://github.com/gordon801/xgb-cc-scores
Using provided LendingClub credit card data and the output from an XGBoost model to produce XGB prediction scores for each data observation. (Python)
https://github.com/gordon801/xgb-cc-scores
Last synced: 2 months ago
JSON representation
Using provided LendingClub credit card data and the output from an XGBoost model to produce XGB prediction scores for each data observation. (Python)
- Host: GitHub
- URL: https://github.com/gordon801/xgb-cc-scores
- Owner: gordon801
- Created: 2021-07-19T06:05:46.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2021-07-29T16:45:40.000Z (almost 4 years ago)
- Last Synced: 2025-01-21T20:09:03.852Z (4 months ago)
- Language: Python
- Homepage:
- Size: 478 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# XGBoost Modelling - Credit Card Scores
In this project, I used provided LendingClub data and the output from an XGBoost model to produce XGB prediction scores for each observation in the LendingClub dataset.## Inputs
* creditcard_3001.csv: LendingClub credit card data with 3001 observations.
* xgb_tree_1000.txt: Text file dump from an XGBoost model containing 1000 ensemble trees.
## Program
* xgboost_process.pyThis program parses the input files, creates binary trees from the xgb_tree data, and defines a traversal algorithm. To obtain the overall prediction score for a particular observation, it sums the individual prediction scores from traversing each of the 1000 ensemble trees. It repeats this process for each of the 3001 observations and outputs the resultant prediction scores to 'test_result.csv'.
## Output
* test_result.csv: List of 3001 prediction scores corresponding to each LendingClub observation.