Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/flytomarsz/us-regional-sales-analysis
This analysis project aim to identify bike rental's behavior in 2020 from USA Regional Area. This project is part of my Independent Study at RevoU.
https://github.com/flytomarsz/us-regional-sales-analysis
data-analytics data-visualization mysql powerbi spreadsheet
Last synced: 2 days ago
JSON representation
This analysis project aim to identify bike rental's behavior in 2020 from USA Regional Area. This project is part of my Independent Study at RevoU.
- Host: GitHub
- URL: https://github.com/flytomarsz/us-regional-sales-analysis
- Owner: Flytomarsz
- Created: 2025-01-28T10:49:22.000Z (13 days ago)
- Default Branch: main
- Last Pushed: 2025-01-28T12:46:48.000Z (13 days ago)
- Last Synced: 2025-01-28T13:48:05.391Z (13 days ago)
- Topics: data-analytics, data-visualization, mysql, powerbi, spreadsheet
- Homepage: https://www.kaggle.com/datasets/talhabu/us-regional-sales-data/data
- Size: 13 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# US Regional Sales Analysis 📊
## Project Background
Analyzing factors in sales trend fluctuations during December 2020. This project aiming to give some actionable insights which useful for strategic decision-making in order to increase product sales.Factors causing the decline and increase in sales across various states:
- **Marketing Analysis:** Sales team performance across different sales channels, including In-Store, Online, Distributor, and Wholesale.
- **Geography Factors:** Climate and weather influences on product sales, uneven population distribution on each different country, household income differences on each country.
- **Product Performance:** Price rate every product is not align with product demand.
- **Company Services:** The delivery time is too slow.Issue Tree, SMART Analysis, and DARCI can be access here [https://whimsical.com/milestone-1-QDj4xuQHMEJwJcfjP1PP14]
ERD Diagram can be access here [https://dbdiagram.io/d/US-Regional-Sales-ERD-6798a274263d6cf9a044336a]
US Regional Sales Dataset on Kaggle [https://www.kaggle.com/datasets/talhabu/us-regional-sales-data/data]
## Tools
- Google Sheets
- Google Bigquery, MySQL Workbench
- Microsoft Power BI# Data Structure & Initial Checks
Table description:
- **Sales_Order:** Information for each order.
- **Customers:** Customers detail information.
- **Store_Location:** Information of store location with different types of income and geography informations.
- **Product:** Detail of products information.
- **Region:** Detail of area in region.
- **Sales_team:** Sales team information.Entity Relationship Diagram:
![image](https://github.com/user-attachments/assets/018461e6-d047-46a6-85a5-84be46085ce4)# Executive Summary
### Overview of Findings
In 2020 California achieved the highest overall sales approaching 3.000. While other states had sales under than 1.500, even many states are under 500. An analysis reveals significant drop in December 2020 with many states has reported zero sales, aside the other 5 states has maintained stable performance. The company want to investigate the factors affecting these sales spikes and declines.Dashboard overview
![image](https://github.com/user-attachments/assets/3467e9c8-2942-4ead-9480-a8873168375c)### Business Insight Overview
![image](https://github.com/user-attachments/assets/bc09705c-4dd1-46ed-bed6-ec91b211e841)
1. Total Order Quantity: 13.3K and total Revenue: $26.6M, suggesting strong revenue from these orders. The average revenue per order = $26.6M / 13.3K ≈ $2,000 per order. This implies a high-value product/service or bulk purchases.
2. Total Profit: $8M and total Unit Cost: $4.2M. Profit Margin = (Profit / Revenue) × 100 = (8M / 26.6M) × 100 ≈ 30%. The 30% profit margin is generally strong, indicating healthy pricing strategies and cost management.
3. Total Population: 892.1M and total Household Income: $320.5M. Average Household Income per Person = $320.5M / 892.1M ≈ $359 per person. This suggests a market with moderate spending power, meaning pricing strategies should align with customer affordability.### Delivery Time Analysis
![image](https://github.com/user-attachments/assets/21496f9a-a737-4114-9682-b5e8e89e5871)
1. The dots on the scatter plot are randomly distributed without any clear pattern & relatively flat, indicating that the land area size does not influence delivery time.
2. Showing average delivery time between 15 to 25 days.![image](https://github.com/user-attachments/assets/b11ede01-702b-425b-ab3e-b303cd4bf00d)
1. With this bar chart, it shows that majority of delivery time take more than 30 days (1,478 deliveries), while the fastest delivery time 0-5 days tracked for only 72 deliveries.### Price and Demand Analysis
![image](https://github.com/user-attachments/assets/9ace2867-f76c-461c-9a2f-8ddebd7a6cb0)
1. Based on the scatter plot, price influences order volume, but it is not the primary factor.
2. As order quantity increases, the unit price tends to rise, although with significant variation.### Product Performance
![image](https://github.com/user-attachments/assets/a4af3e5f-fdc3-41fe-8121-83e84410c7a2)
1. This chart shows that order quantity impacts the revenue.
2. The higher order quantity per product = the higher chance revenue per peroduct.### Sales Team Performance
![image](https://github.com/user-attachments/assets/5eee4bca-e1ed-4c4f-a65e-51b8f5cf51f9)
1. The number of orders impacts total revenue and average revenue for each state.
2. This chart shows that the sales with the highest orders do not necessarily generate the highest revenue### Population & Household Income Analysis
![image](https://github.com/user-attachments/assets/7303cb2e-86bf-4961-988f-45c91094b1f8)
1. **California** has the highest population (180.6M) and generates the highest revenue ($5.74M), indicating its dominance across multiple metrics. **New York** and **Texas** rank second and third in population, but their revenues differ significantly caused by differences in population number.
2. States like California and New York have very high household incomes ($58.9M and $57.4M, respectively), but the revenue they generate shows significant differences. This suggests differing consumer behaviors or market dynamics in these states.
3. California not only has the highest order quantity (2,945) but also the highest revenue, showing a strong correlation between orders and revenue.
4. New York have fewer orders (686) but still generate significant revenue, suggesting higher average order values.
5. Tennessee, Virginia, and Indiana have the lowest revenues (all below $500K) and relatively small order quantities.### Demography by Population Analysis
![image](https://github.com/user-attachments/assets/79950817-d98a-4e2e-a860-f85c22eb7e31)
![image](https://github.com/user-attachments/assets/3db909b0-a86e-4efd-a689-77e7c8e48c52)
1. The Geomap shows that show that areas with large populations have larger circles. Example, California: Population 180 million with household income 58 million, order quantity 2,945 and revenue 5.74 million.
2. It shows Positive Correlation: The higher the population and household income, the greater the transaction and income opportunities.### Order Quantity Month-over-Month
![image](https://github.com/user-attachments/assets/8305901c-3242-4e62-ac87-156e51f00bae)
1. Based on the line chart, the Order Qty affect Revenue.
2. Stable Early Year: January to May remained relatively stable, with orders ranging from 1,100 to 1,255.
3. July Surge: July recorded the highest order volume, although revenue was not the highest. This suggests possible discounts/promotions that boosted sales volume but lowered the average price.
4. December has the most significant drop in both orders and revenue. This could be due to seasonal factors or changes in business strategy.## Recommendations:
Based on the insights and findings above, I would recommend the Marketing & Sales Team to consider the following:
1. **Delivery Time**: Build multi-warehouses in the states and understanding the state's regional areas to better identify the suitable types of shipping for each region.
2. **Revenue**: Countries with lower sales compared to those with higher sales for the same product should adopt the sales strategies of the countries with higher sales.
3. **Product & Demand**: The most popular items will be produced in larger quantities, while the less popular items will be discounted for clearance.
4. **Population & Household Income**: Partnering with local supermarkets, distributors, or online shops to increase product recognition with consument behavior approaches.
5. **Sales Performance**: Top-performing salesman can receive bonuses, while lower performers get benefit for training and sharing sessions with high-revenue achievers.## Conclutions:
1. Delivery time is not affected by area size or climate, highlighting the need for optimized logistics & customized delivery methods for different regions.
2. The volume of order directly affects revenue; Therefore, the low-performing regions can adopt strategies from high-performing regions to improve their results.
3. Population size & household income both are influence the demand of orders also revenue.
4. The higher product prices do not significantly affect demand of orders.
5. Build multiple warehouses, effective sales strategies, and local partnerships are crucial for improving sales in low-performing regions.