https://github.com/abhinav330/credit-score-classification-problem
This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy then create a machine learning model using sklearn
https://github.com/abhinav330/credit-score-classification-problem
credit-card credit-card-fraud credit-card-fraud-detection credit-risk credit-scoring data-science machinelearning numpy pandas python sklearn
Last synced: 28 days ago
JSON representation
This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy then create a machine learning model using sklearn
- Host: GitHub
- URL: https://github.com/abhinav330/credit-score-classification-problem
- Owner: Abhinav330
- Created: 2024-08-25T22:29:55.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-31T00:45:55.000Z (about 1 year ago)
- Last Synced: 2025-06-22T03:38:03.706Z (4 months ago)
- Topics: credit-card, credit-card-fraud, credit-card-fraud-detection, credit-risk, credit-scoring, data-science, machinelearning, numpy, pandas, python, sklearn
- Language: Jupyter Notebook
- Homepage:
- Size: 864 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
[](https://app.codacy.com/gh/Abhinav330/Creadit-Score-Classification-problem/dashboard?utm_source=gh&utm_medium=referral&utm_content=&utm_campaign=Badge_grade)







# Credit Card Customer Analysis
## Description
This project aims to analyze credit card customer data, clean the dataset using libraries such as Pandas and NumPy, and utilize advanced features of Pandas like `groupby` `df.astype(dtype)` `str.split(separator)` `replace(to_replace, value)` . The project is divided into three parts:
### Part 1: Data Cleaning
In this part, a Jupyter Notebook file (`Credit score classification_Data_Cleaning_part1.ipynb`) is provided, which focuses on cleaning the credit card customer dataset. The notebook utilizes Pandas and NumPy libraries to perform various data cleaning tasks such as handling missing values, removing duplicates, and standardizing data formats. The cleaned dataset serves as the foundation for subsequent analysis.
### Part 2: Exploratory Data Analysis
The second part of the project, Jupyter Notebook file (`Credit score classification_EDA_part2.ipynb`) involves exploratory data analysis (EDA) on the credit card customer data. The primary objective is to gain insights into the factors driving credit scores of customers. Various statistical and visual techniques will be employed to explore relationships, identify trends, and uncover patterns within the dataset.
### Part 3: Feature Engineering and ML Model Creation (In Progress)
The final part of the project, yet to be started, will focus on feature engineering and building a machine learning (ML) model. Feature engineering techniques will be applied to transform the existing dataset, creating new features that may improve the model's predictive power. A suitable ML algorithm will be selected and trained using the processed dataset to predict credit scores for new customers based on their provided information.
## Installation
1. Clone the repository:
```bash
[git clone https://github.com/your-username/credit-card-customer-analysis.git](https://github.com/Abhinav330/Data-Science-Projects.git)
```2. Install the required dependencies:
```bash
pip install pandas numpy matplotlib seaborn scikit-learn
```3. Run the Jupyter Notebook files in the following order:
- `Credit score classification_Data_Cleaning_part1.ipynb`
- `Credit score classification_EDA_part2.ipynb`## Usage
To replicate the data cleaning process, follow the steps outlined in `Credit score classification_Data_Cleaning_part1.ipynb` and in `Credit score classification_EDA_part2.ipynb`. Modify the notebook as needed for your specific dataset.