Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/anhvu2201/rfm-analysis
RFM Analysis using Python for analyzing customers based on their most recent purchase, transaction frequency, and spending value, helping businesses identify high-value customer segments and optimize marketing strategies more effectively.
https://github.com/anhvu2201/rfm-analysis
marketing-analytics python-3 rfm-analysis
Last synced: 28 days ago
JSON representation
RFM Analysis using Python for analyzing customers based on their most recent purchase, transaction frequency, and spending value, helping businesses identify high-value customer segments and optimize marketing strategies more effectively.
- Host: GitHub
- URL: https://github.com/anhvu2201/rfm-analysis
- Owner: anhvu2201
- Created: 2024-11-17T09:14:13.000Z (about 1 month ago)
- Default Branch: main
- Last Pushed: 2024-11-17T10:16:06.000Z (about 1 month ago)
- Last Synced: 2024-11-17T11:22:20.292Z (about 1 month ago)
- Topics: marketing-analytics, python-3, rfm-analysis
- Language: Jupyter Notebook
- Homepage:
- Size: 21.4 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# RFM-Analysis
# I. Introduction
## 1. Introduction to RFM analysis:
- RFM analysis is a popular customer analysis technique in marketing and customer relationship management (CRM). It evaluates customers based on three factors:
- Recency (R): How recently did the customer make a purchase? The more recent, the higher their likelihood to engage.
- Frequency (F): How often does the customer make purchases? Frequent buyers tend to be more loyal.
- Monetary (M): How much has the customer spent in total? Higher spending indicates potentially valuable customers.
- By analyzing these factors, RFM helps businesses segment customers to optimize marketing strategies, personalize promotions, and enhance customer retention, driving data-driven decision-making efficiently.
## 2. Project Purpose:
- Determine RFM scores to segment customers into different groups.
- Analyze the company - Superstore's performance status and provide recommendations for the marketing department.
- Identify which of the three metrics (R, F, and M) should be prioritized.
# II. Data Visualization Using Python
## 1. Distribution of Recency:
![R](https://github.com/user-attachments/assets/d0ef484f-d93a-4faa-9ace-7d896f27db30)
- The distribution of Recency is right-skewed. As the Recency increases, there is a steep decline in the number of customers.
- The histogram shows that most customers have made recent purchases (<=100 days). About 1,700 customers have already bought something at Superstore in the last 50 days.
- It indicates that most of the Superstore's customers are active customers who tend to make purchases recently.
## 2. Distribution of Frequency:
![F](https://github.com/user-attachments/assets/29d3405a-cee8-40f5-82ae-8a4164a705ce)
- The distribution of Frequency is highly right-skewed. As the Frequency increases, there is a significant drop in the number of customers.
- This historam shows that most of customers have fewer than 20 transactions. For particular, more than 3,500 customers have made 1 to 10 purchases, when only few hundreds of customers have placed more than 10 orders and barely any have placed 20 or more.
- It indicates that the majority of Superstore's customers are low-frequency purchasers who do not make purchases often.
## 3. Distribution of Monetary:
![M](https://github.com/user-attachments/assets/fdde9212-e3da-4c89-81b6-4f470ca2d5e5)
- The distribution of Monetary is highly right-skewed. As the Monetary increases, there is a significant drop in the number of customers.
- This historam shows that most of customers have fewer than 10,000 monetary values. For particular, more than 3,500 customers have spent less than 5,000, when only hundreds of customers have spent more than 5,000. Virtually none have spent more than 10,000.
- It indicates that most of Superstore customers have low spending, while a small portion of customers contributes to the high monetary value segment.
## 4. Customer Segmentation By Total Sale:
![Seg By Total Sales](https://github.com/user-attachments/assets/4b4e03e5-1db3-4cee-885e-77c9a15122c6)
- Ranking the customer segmentation by total sales:
1.Champions
2.Loyal
3.At Risk
4.Need Attention
5.Hibernating Customers
6.Potential Loyalist
7.Cannot Lose Them
8.Lost Customers
9.Promising
10.About To Sleep
11.New Customers
## 5. Customer Segmentation By Customer Value:
![Seg By Customer Value](https://github.com/user-attachments/assets/be74fc49-d574-42b4-871a-79db099811fd)
- Ranking the customer segmentation by total sales:
1.Champions
2.Hibernating Customers
3.Lost Customers
4.Loyal
5.Potential Loyalist
6.At Risk
7.Need Attention
8.About To Sleep
9.New Customers
10.Promising
11.Cannot Lose Them
## 6. Distribution Of Customer Across Segments:
![Dis Across Seg](https://github.com/user-attachments/assets/425b6680-07d3-4548-b7a8-d35ea5831840)
- Customer segments can be categorized into 3 groups:
- High-Value Customers (HVC): Champions, Loyal, Potential Loyalist, New Customers, Promising.
- At-Risk Customers (ARC): Need Attention. About To Sleep, At Risk, Cannot Lose Them.
- Low-Value Customers (LVC): Hibernating Customers, Lost Customers.
![image](https://github.com/user-attachments/assets/dbc6a3f0-87e4-447a-9c40-b96edddd0a14)
- Observation:
- HVC category contributes the most, with 1871 customers.
- LVC category contributes the second - 1075 customers.
- ARC category, while the smallest, still contributes a considerable amount of 974 customers.
- It indicates that the business is in a growing state despite facing with several problems.
# III. Insights
![image](https://github.com/user-attachments/assets/a3ee9c85-adf9-4ba4-a79d-4e6aad470888)
- [Link](https://docs.google.com/spreadsheets/d/1MBt3b48lT-RzD44xsbMsgwtVO-JmIJjU/edit?usp=sharing&ouid=107825711284033293753&rtpof=true&sd=true)
# IV. Recommendation
![image](https://github.com/user-attachments/assets/92dfe72f-e37a-480e-9b63-8b4bd6fb47bb)
![image](https://github.com/user-attachments/assets/46e756d9-7165-4708-9b25-8d02b35f0feb)
![image](https://github.com/user-attachments/assets/f0130773-8e9b-4546-80c7-1534e9cafc9a)
- [Link](https://docs.google.com/spreadsheets/d/1MBt3b48lT-RzD44xsbMsgwtVO-JmIJjU/edit?usp=sharing&ouid=107825711284033293753&rtpof=true&sd=true)