An open API service indexing awesome lists of open source software.

https://github.com/weonlyliveonce13/off-price-analytics

A data-driven approach to understanding what factors drives off-price sales. Conducting data cleaning, data preparation, exploratory data analysis, feature engineering, model experiments, and evaluation.
https://github.com/weonlyliveonce13/off-price-analytics

analytics exploratory-data-analysis feature-engineering feature-selection python quarto quarto-pub scikitlearn-machine-learning uv

Last synced: 4 months ago
JSON representation

A data-driven approach to understanding what factors drives off-price sales. Conducting data cleaning, data preparation, exploratory data analysis, feature engineering, model experiments, and evaluation.

Awesome Lists containing this project

README

          

# Background
You have just joined an off-price retailer that specialises in selling branded products at
discounted prices. The business wants to understand which product types, brands, and
suppliers move fastest, how pricing affects inventory turnover, and how customer shopping
habits vary by region and time.

P.S: This is still a work in progress, intending to add more scripts and automate everything.

# Task

## Data Cleaning and Preparation
- Load the data from offprice_transactions.csv.
- Check for and address missing, inconsistent, or outlier values.
- Convert columns to appropriate data types, especially for dates and prices.
- Summarise the main steps you took and any challenges encountered.

## Exploratory Data Analysis (EDA)
Address the following business questions:
- Which product categories, brands, and suppliers sell the most units, and which
generate the most revenue?
- How does discount percentage (OriginalPrice vs. DiscountedPrice) relate to sales
volume for top brands and suppliers?
- What is the average inventory turnover per store (units sold per store per week)?
- How does clearance pricing impact sales volume and returns?
- Compare customer purchase behavior across different regions.

## Predictive Task
Predict whether a given product will be sold on clearance or not.
- Create a target variable: ClearanceFlag (Yes/No).
- Select appropriate features (category, brand, supplier, region, discount, etc.).
- Split the data into training and test sets.
- Train a simple classification model (e.g., logistic regression, decision tree).
- Report on key metrics (accuracy, precision, recall) and explain your process