https://github.com/ginga1402/hbfc_bank_analysis
Personal loan data analysis – Exploratory data analysis
https://github.com/ginga1402/hbfc_bank_analysis
data-cleaning data-manipulation data-visualization data-wrangling excel exploratory-data-analysis
Last synced: 9 months ago
JSON representation
Personal loan data analysis – Exploratory data analysis
- Host: GitHub
- URL: https://github.com/ginga1402/hbfc_bank_analysis
- Owner: Ginga1402
- Created: 2023-05-18T14:00:48.000Z (almost 3 years ago)
- Default Branch: master
- Last Pushed: 2023-06-24T11:55:06.000Z (almost 3 years ago)
- Last Synced: 2025-03-30T03:14:35.216Z (about 1 year ago)
- Topics: data-cleaning, data-manipulation, data-visualization, data-wrangling, excel, exploratory-data-analysis
- Homepage:
- Size: 2.13 MB
- Stars: 1
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# HBFC_Bank_Analysis
## Problem Statement:
The goal of HBFC bank is to sell more personal loans to their savings account holders. The bank wants to start a campaign to sell the personal loans, but before that they want to analyze last marketing campaign data to understand the profile of potential loan customers. This will help them in doing a targeted approach to the prospective customers in future. The bank has approached you, to help them with the analysis of the previous campaign data.
The bank basically has two customers,
• Liability customers – They deposit the money in the bank and pays interest against
the deposited money.
• Asset customers - They borrow money (take different types of loan) from the bank
and the bank charges interest against the borrowed money.
At present the bank has small number of asset/loan customers. The bank wants to increase their income by increasing the customer base of “asset customers”. Last year the bank ran a
campaign where they successfully converted 9% of the existing “liability customers” to “asset customers”. This has encouraged the bank to have a better targeted marketing campaign to increase the success ratio with minimum budget.

## Data Description
The file contains data of 5000 customers. It includes,
• Customer demographic information (age, income, etc.),
• Relationship with the bank (mortgage, securities account, etc.)
• Customer response to the last personal loan campaign (Personal Loan).
• Among the 5000 customers, only 480 (9.6%) customers have accepted the personal loan in the last year campaign.


## Objective (Task):
• As a consultant, you must perform preliminary data analysis (EDA) and visualization to understand the profile of customers having savings account, who took personal loan in the last marketing campaign VS customers who didn’t take it up.
• Using EDA identify profile of customers whom bank can target for selling personal loan.
In order to do this analysis, you are expected to solve these questions:
1) What percentage of the bank’s customers (according to the data) have availed Personal Loans vs the ones who have not availed it?
2) Generate a table with min, max, median & average for all numeric variables (age, experience, income, family members, CCAvg, Mortgage)
3) Create a new categorical variable for Experience using 4 categories –
• 0 to 10 years
• 11 to 20 years
• 21 to 30 years
• 30+ years.
Plot a bar graph for this new categorical variable
[Hint – You may make use of if else/nested if statements to accomplish this task. You can refer how Income_Category has been created in the dataset]
4) Create a scatter plot of the Age and the Experience variable. What do you observe?
5) What are the top 3 areas (ZIP Codes) where the bank’s customers are located?
6) How many customers have a combination of Fixed Deposits and Credit Cards but not Personal Loan?
7) What is the median income of the customers who have availed personal loans and compare it with the median income of those customers who have not availed personal loans? What do you infer?
8) Create 4 separate Pivot Tables. Summarize your data by percentage values.
• Education vs Personal Loan
• TD Account Vs Personal Loan
• Online vs Personal Loan
• Income_Category vs Personal Loan
[Hint: Please drag Personal Loan to the Columns area while creating the Pivot Table to get the required values]
9) Analyze the Pivot tables created in the previous question and state any anomaly that you observe. Which categorical variables appear most important for your further study if you want to analyze which customers are most likely to take personal loans and why?
10) In the last campaign, bank reached out to 5000 customers out of which 480 customers accepted the personal loan offer. The bank incurred a huge cost in running a marketing campaign to reach out to so many customers. This is where you as a strategic business consultant step in. You are tasked to optimize the cost of this campaign by identifying
the correct target base (without significant reduction in number of acceptances of offers). The bank can then send Personal Loan offers to these target customers who have a higher chance of accepting the offer. Based on your analysis, what strategy would you suggest to the management of HBFC bank?
## Observation:
Referring to the analysis and the source data provided I realized that:
1. In order to efficiently use our resource & get our product sold to the customer's, we need to target some specific groups of customers instead of putting all our resources at whole population.
2. Considering the Education Criteria we should focus on graduates & professional's . If we observe the maximum customers as classified on education criteria are undergraduates but the conversion rate is the least in this. So we should focus on graduates & Professionals.
3. The customers earning more than $100,000 have the maximum chances of buying our product.
4. While analyzing, I came to find out that the customers who are not using online banking tend to take more personal loan rather than the one's who are using online banking.We should focus more on them .
A table is presented in the worksheet which will help you to have a more clear picture of my analysis.
With respect to maximizing our resources & convert Liability Customers to Asset Customers we should focus on the above samples of customers rather than the whole population.