Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/rkcosmos/santander-product-recommendation


https://github.com/rkcosmos/santander-product-recommendation

Last synced: 2 months ago
JSON representation

Awesome Lists containing this project

README

        

# Santander Product Recommendation

This is my first Kaggle competition. The best model is in the first top 7%. You can read the high-level explanation in Thai here (http://kittinaradorn.com/first_kaggle_competition/).

**Instruction:**

1. Download dataset from https://www.kaggle.com/c/santander-product-recommendation/data, unzip and put them into input folder.

2. Create preprocessed data by running all cells in the following notebooks
- MakeJuneExtraData.ipynb
- MakeJuneExtraDataMulticlass.ipynb
- MakeDataMulticlass2.ipynb
- MakeTestDatawithpast3.ipynb

3. Choose model to create prediction from the following notebooks:
- Baseline model: run MostProbableProductRecent2.ipynb
- Basic logistic regression: run CollaborativeFiltering.ipynb
- A bit more advanced logistic regression: run LogisticRegression2.ipynb
- Simple XGBoost model: run XGBmulticlass.ipynb
- XGBoost with feature engineering: run XGBmulticlass_withpast5.ipynb
- Basic Neural Network: run Keras1.ipynb
- Ensemble model: run Ensemble6_Keras1_XGB5_popular.ipynb

Special thanks to [anokas](https://www.kaggle.com/anokas) (for starter script), [BreakfastPirate](https://www.kaggle.com/breakfastpirate) (for contributing important insight to the community) and other people in the kaggle forum!