An open API service indexing awesome lists of open source software.

https://github.com/abhinav330/credit-score-classification-problem

This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy then create a machine learning model using sklearn
https://github.com/abhinav330/credit-score-classification-problem

credit-card credit-card-fraud credit-card-fraud-detection credit-risk credit-scoring data-science machinelearning numpy pandas python sklearn

Last synced: 28 days ago
JSON representation

This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy then create a machine learning model using sklearn

Awesome Lists containing this project

README

          

[![Codacy Badge](https://app.codacy.com/project/badge/Grade/a6197298f3a143d591577dd7505ce3cd)](https://app.codacy.com/gh/Abhinav330/Creadit-Score-Classification-problem/dashboard?utm_source=gh&utm_medium=referral&utm_content=&utm_campaign=Badge_grade)
![GitHub Pipenv locked dependency version](https://img.shields.io/github/pipenv/locked/dependency-version/Abhinav330/Creadit-Score-Classification-problem/matplotlib?color=blue)
![GitHub Pipenv locked dependency version](https://img.shields.io/github/pipenv/locked/dependency-version/Abhinav330/Creadit-Score-Classification-problem/numpy?color=yellow)
![GitHub Pipenv locked dependency version](https://img.shields.io/github/pipenv/locked/dependency-version/Abhinav330/Creadit-Score-Classification-problem/pandas?color=silver)
![GitHub Pipenv locked dependency version](https://img.shields.io/github/pipenv/locked/dependency-version/Abhinav330/Creadit-Score-Classification-problem/scikit-learn?color=red)
![GitHub Pipenv locked dependency version](https://img.shields.io/github/pipenv/locked/dependency-version/Abhinav330/Creadit-Score-Classification-problem/scipy?color=red)
![GitHub Pipenv locked dependency version](https://img.shields.io/github/pipenv/locked/dependency-version/Abhinav330/Creadit-Score-Classification-problem/seaborn?color=green)
![GitHub Pipenv locked Python version](https://img.shields.io/github/pipenv/locked/python-version/Abhinav330/Creadit-Score-Classification-problem?color=dark%20green)
![GitHub repo size](https://img.shields.io/github/repo-size/Abhinav330/Creadit-Score-Classification-problem)

# Credit Card Customer Analysis

## Description

This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy, and utilize advanced features of Pandas like `groupby` `df.astype(dtype)` `str.split(separator)` `replace(to_replace, value)` . The project is divided into three parts:

### Part 1: Data Cleaning

In this part, a Jupyter Notebook file (`Credit score classification_Data_Cleaning_part1.ipynb`) is provided, which focuses on cleaning the credit card customer dataset. The notebook utilizes Pandas and NumPy libraries to perform various data cleaning tasks such as handling missing values, removing duplicates, and standardizing data formats. The cleaned dataset serves as the foundation for subsequent analysis.

### Part 2: Exploratory Data Analysis

The second part of the project, Jupyter Notebook file (`Credit score classification_EDA_part2.ipynb`) involves exploratory data analysis (EDA) on the credit card customer data. The primary objective is to gain insights into the factors driving credit scores of customers. Various statistical and visual techniques will be employed to explore relationships, identify trends, and uncover patterns within the dataset.

### Part 3: Feature Engineering and ML Model Creation (In Progress)

The final part of the project, yet to be started, will focus on feature engineering and building a machine learning (ML) model. Feature engineering techniques will be applied to transform the existing dataset, creating new features that may improve the model's predictive power. A suitable ML algorithm will be selected and trained using the processed dataset to predict credit scores for new customers based on their provided information.

## Installation

1. Clone the repository:

```bash
[git clone https://github.com/your-username/credit-card-customer-analysis.git](https://github.com/Abhinav330/Data-Science-Projects.git)
```

2. Install the required dependencies:

```bash
pip install pandas numpy matplotlib seaborn scikit-learn
```

3. Run the Jupyter Notebook files in the following order:

- `Credit score classification_Data_Cleaning_part1.ipynb`
- `Credit score classification_EDA_part2.ipynb`

## Usage

To replicate the data cleaning process, follow the steps outlined in `Credit score classification_Data_Cleaning_part1.ipynb` and in `Credit score classification_EDA_part2.ipynb`. Modify the notebook as needed for your specific dataset.