https://github.com/kmohamedalie/taiwan-clients-default-of-credit-card
  
  
    Predicting Customer credit card default 
    https://github.com/kmohamedalie/taiwan-clients-default-of-credit-card
  
banking classification credit-card finance hyperparameter-tuning taiwan
        Last synced: 9 months ago 
        JSON representation
    
Predicting Customer credit card default
- Host: GitHub
 - URL: https://github.com/kmohamedalie/taiwan-clients-default-of-credit-card
 - Owner: Kmohamedalie
 - License: mit
 - Created: 2023-08-14T12:12:00.000Z (about 2 years ago)
 - Default Branch: master
 - Last Pushed: 2023-10-19T21:10:02.000Z (about 2 years ago)
 - Last Synced: 2025-01-02T15:32:02.819Z (10 months ago)
 - Topics: banking, classification, credit-card, finance, hyperparameter-tuning, taiwan
 - Language: Jupyter Notebook
 - Homepage: https://github.com/Kmohamedalie/Taiwan-Clients-Default-of-Credit-Card
 - Size: 4.24 MB
 - Stars: 0
 - Watchers: 1
 - Forks: 0
 - Open Issues: 0
 - 
            Metadata Files:
            
- Readme: README.md
 - License: LICENSE
 
 
Awesome Lists containing this project
README
          # Taiwan Clients Default of Credit Card (classification)
This research aimed at the case of customers default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods, However in this self hands on project I only applied 2 alogrithms (SVM and Boosting machines).

 
**Dataset:** UCI Machine Learning,   Kaggle
**Complete JupyterNotebook:** [Link](https://github.com/Kmohamedalie/Taiwan-Clients-Default-of-Credit-Card/blob/master/Notebook/TaiwanClients%20Default%20of%20Credit%20Card%20-%20SnapML(Random%20Forest%20vs%20Boosting%20Machine)%20.ipynb)
**Metrics Achieved:**  
| Algorithm |Accuracy | f1-score |
| --------- | -----------| ------- |
| Boosting Machines | 82% |79.32% |
**Additional Information about the dataset**
This research aimed at the case of customers default payments in Taiwan and compares the predictive accuracy of probability of default among six data mining methods. From the perspective of risk management, the result of predictive accuracy of the estimated probability of default will be more valuable than the binary result of classification - credible or not credible clients. Because the real probability of default is unknown, this study presented the novel Sorting Smoothing Methods to estimate the real probability of default. With the real probability of default as the response variable (Y), and the predictive probability of default as the independent variable (X), the simple linear regression result (Y = A + BX) shows that the forecasting model produced by artificial neural network has the highest coefficient of determination; its regression intercept (A) is close to zero, and regression coefficient (B) to one. Therefore, among the six data mining techniques, artificial neural network is the only one that can accurately estimate the real probability of default.