https://github.com/nagar2nd/apple_store_reviews---statistics-
Analyzed Apple Store reviews using statistical methods to explore user ratings, likes, and satisfaction trends. The project provides insights into engagement patterns and actionable recommendations to enhance user experience.
https://github.com/nagar2nd/apple_store_reviews---statistics-
matplotlib pandas python seaborn statistics
Last synced: about 2 months ago
JSON representation
Analyzed Apple Store reviews using statistical methods to explore user ratings, likes, and satisfaction trends. The project provides insights into engagement patterns and actionable recommendations to enhance user experience.
- Host: GitHub
- URL: https://github.com/nagar2nd/apple_store_reviews---statistics-
- Owner: Nagar2nd
- Created: 2025-01-04T07:53:06.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-01-04T08:08:15.000Z (over 1 year ago)
- Last Synced: 2025-11-18T09:20:21.906Z (7 months ago)
- Topics: matplotlib, pandas, python, seaborn, statistics
- Language: Jupyter Notebook
- Homepage:
- Size: 13.3 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Apple Store Reviews Analysis
This repository contains a comprehensive statistical analysis of the Apple Store Reviews dataset.
The analysis explores user ratings, purchase amounts, likes, and correlations to draw meaningful insights.
All steps are implemented using Python, and the notebook includes visualizations and key statistical methods.
---
## **Statistical Analysis Overview**
### **1. Central Tendency Analysis**
- **Objective:** Calculate the mean, median, and mode of app ratings and determine the best measure of central tendency.
- **Results:**
- **Mean:** 2.869
- **Median:** 3.0
- **Mode:** 1.0
- **Conclusion:** The **median** best represents the central tendency, as it is unaffected by outliers and provides a reliable measure for the dataset.
### **2. Spread of Data**
- **Objective:** Calculate the range and interquartile range (IQR) of the purchase amounts to understand the spread of the data.
- **Results:**
- **Range:** 19.97
- **IQR:** 10.19
- **Conclusion:** The **IQR** highlights the spread of the middle 50% of purchase amounts, showing moderate variability in typical spending.
### **3. Variability Analysis**
- **Objective:** Calculate the variance and standard deviation for the number of likes on reviews.
- **Results:**
- **Standard Deviation:** 28.69
- **Variance:** 822.85
- **Conclusion:** The moderate standard deviation indicates variability in user engagement, with some reviews receiving significantly more likes than others.
### **4. Correlation Analysis**
- **Objective:** Determine the correlation between the number of likes and app ratings.
- **Results:**
- **Correlation Coefficient:** 0.8425
- **Conclusion:** A strong positive correlation exists, suggesting that higher ratings are associated with more likes, indicating user preference for highly rated content.
### **5. Distribution Analysis**
- **Objective:** Plot the distribution of app ratings and analyze the skewness.
- **Results:**
- **Skewness:** 0.102
- **Conclusion:** The distribution is approximately symmetrical with a slight positive skew, indicating balanced user satisfaction with a tendency for higher ratings.
### **6. Hypothesis Testing**
- **Objective:** Test if Instagram's average rating is significantly higher than WhatsApp's at a 95% confidence level.
- **Results:**
- **T-Statistic:** -0.797
- **P-Value:** 0.787
- **Conclusion:** There is no statistically significant difference between the average ratings of Instagram and WhatsApp.
### **7. Central Limit Theorem (CLT)**
- **Objective:** Take random samples of ratings, calculate their means, and create a sampling distribution to demonstrate the CLT.
- **Results:**
- **Population Mean:** 2.869
- **Sampling Mean:** 2.878
- **Standard Error:** 0.268
- **Conclusion:** The sampling distribution is normal, supporting the CLT. This allows reliable inference about the population mean from sample means.
---
## **How to Use This Repository**
1. **Dataset:** The dataset used for this analysis is included in the `Apple_Store_Reviews.csv` folder.
2. **Notebook:** The Python analysis notebook (`Apple_store_reviews_analysis.ipynb`) contains the complete code, visualizations, and statistical computations.
3. **Requirements:**
- Python
- Libraries: `pandas`, `numpy`, `scipy`, `matplotlib`, `seaborn`
---
## **Connect**
Feel free to reach out with any questions or feedback:
- **Email:** shivaninagarofficial@gmail.com
- **LinkedIn:** https://www.linkedin.com/in/shivani-nagar12/
---
Thank you for exploring this analysis!