An open API service indexing awesome lists of open source software.

https://github.com/rohitblaze10/amazon_prime_analysis-kaggle

Exploratory data analysis on Amazon Prime users to uncover trends in subscriptions, engagement, and customer behavior. Key insights include age group distribution, renewal patterns, device usage, and customer feedback. Visualizations and data-driven insights help optimize user experience and retention strategies
https://github.com/rohitblaze10/amazon_prime_analysis-kaggle

analysis eda python python-script

Last synced: about 2 months ago
JSON representation

Exploratory data analysis on Amazon Prime users to uncover trends in subscriptions, engagement, and customer behavior. Key insights include age group distribution, renewal patterns, device usage, and customer feedback. Visualizations and data-driven insights help optimize user experience and retention strategies

Awesome Lists containing this project

README

          

# Amazon Prime User Analysis

## Overview
This project focuses on analyzing Amazon Prime user data to uncover insights related to user demographics, subscription behavior, and engagement trends. The dataset is processed using Python, and the analysis results are visualized for better understanding.

## Project Structure
- **Amazon_prime_analysis.xlsx**: Contains raw and processed Amazon Prime user data.
- **Cleaned_amazon_prime_users.csv**: Cleaned dataset used for further analysis.
- **Updated_amazon_prime_users.csv**: Updated dataset with refined data.
- **Amazon_Prime_Final_EDA_Output.ipynb**: Jupyter notebook containing exploratory data analysis (EDA) and key insights.

## Getting Started
### Prerequisites
Ensure you have the following installed:
- Python 3.x
- Jupyter Notebook
- Pandas
- Matplotlib / Seaborn (for visualization)

### Installation
1. Clone this repository:
```bash
git clone https://github.com/rohitblaze10/amazon-prime-analysis.git
cd amazon-prime-analysis
```
2. Install the required Python libraries:
```bash
pip install pandas matplotlib seaborn jupyter
```

## Usage
1. Open Jupyter Notebook:
```bash
jupyter notebook
```
2. Run `Amazon_Prime_Final_EDA_Output.ipynb` to process and analyze the data.
3. View results in `Amazon_prime_analysis.xlsx` or the cleaned CSV files.

## Key Findings
- **Subscription Plans:**
- The majority of users subscribe to the **Monthly** plan, followed by **Annual** and **Family** plans.
- **User Engagement:**
- Users with **auto-renewal enabled** tend to have higher engagement metrics.
- **Smart TVs** are the most used devices for accessing Prime services.
- **Popular Purchase Categories:**
- The top three purchase categories among Prime users are **Electronics, Books, and Clothing**.
- **Customer Support Interactions:**
- Users with **low feedback ratings** tend to contact customer support more frequently.
- **Demographics:**
- The dataset includes users across various age groups and locations, with balanced gender representation.

## Visualizations
Below are some key visualizations from our analysis:
1. **Subscription Plan Distribution** – Highlights the most popular plans.
![sub](https://github.com/user-attachments/assets/839ecb5c-078c-4620-9d21-1a3b58aa09b2)

2. **Device Usage for Prime Access** – Shows preferred devices for streaming.
![device](https://github.com/user-attachments/assets/aae5cadb-77d5-42ac-aab5-73805a432ad3)

3. **Most Purchased Categories** – Displays frequently bought product categories.
![NO](https://github.com/user-attachments/assets/e51328c4-fb1d-4f0b-b604-55538a767f48)
nd more...
## Future Improvements
- Improve data cleaning and preprocessing techniques.
-
- Implement predictive modeling to forecast user retention.
- Add more advanced visualizations.

## Contributing
Feel free to fork this repository and make improvements. Pull requests are welcome!

## License
[Specify a license, e.g., MIT, Apache 2.0]