https://github.com/fbarffmann/credit-risk-classification

Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
https://github.com/fbarffmann/credit-risk-classification

classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn

Last synced: about 22 hours ago
JSON representation

Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.

Host: GitHub
URL: https://github.com/fbarffmann/credit-risk-classification
Owner: fbarffmann
Created: 2024-09-14T15:05:38.000Z (7 months ago)
Default Branch: main
Last Pushed: 2025-04-13T16:30:32.000Z (11 days ago)
Last Synced: 2025-04-13T17:38:16.047Z (11 days ago)
Topics: classification, credit-risk, data-analysis, logistic-regression, machine-learning, model-evaluation, pandas, python, scikit-learn
Language: Jupyter Notebook
Homepage:
Size: 526 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Credit Risk Classification using Logistic Regression

Built and evaluated a machine learning model to classify loans as high-risk or healthy based on borrower data. Used logistic regression to predict loan status with a focus on precision and recall for high-risk loans.

## Tools & Technologies Used

- Python
- Pandas
- Scikit-learn (Logistic Regression)
- Classification Report
- Confusion Matrix
- Jupyter Notebooks

## File Structure

```text
.
├── Credit_Risk/
│ ├── credit_risk_classification.ipynb # Full model workflow
│ └── lending_data.csv # Loan dataset
```

## Skills Demonstrated

- Binary classification using Logistic Regression
- Data preparation and feature engineering
- Splitting data into training and testing sets
- Evaluating model accuracy, precision, recall, and F1-score
- Generating confusion matrix and classification report

## Key Findings

- Analyzed 19,384 loans, classifying them as healthy or high-risk.
- Model achieved 100% precision and F1-score for healthy loans (label 0).
- Model achieved 84% precision and 89% F1-score for high-risk loans (label 1), with 94% recall.
- Model tends to predict healthy loans extremely well but occasionally misclassifies high-risk loans as healthy.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/fbarffmann/credit-risk-classification

Awesome Lists containing this project

README