https://github.com/dimitryzub/walmart-stores-coffee-analysis
Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡
https://github.com/dimitryzub/walmart-stores-coffee-analysis
analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn
Last synced: about 1 month ago
JSON representation
Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡
- Host: GitHub
- URL: https://github.com/dimitryzub/walmart-stores-coffee-analysis
- Owner: dimitryzub
- License: bsd-3-clause
- Created: 2022-10-25T06:42:41.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-10-25T10:09:01.000Z (over 3 years ago)
- Last Synced: 2025-02-07T12:39:14.517Z (over 1 year ago)
- Topics: analysis, analytics, data, data-visualization, matplotlib, pandas, python, pythonanalysis, seaborn
- Language: Jupyter Notebook
- Homepage: https://serpapi.com/blog/serpapi-demo-project-walmart-coffee-exploratory-data-analysis/
- Size: 1.05 MB
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Walmart Stores Coffee Analysis
This Python demo project is a practical showcase of using [SerpApi's](https://serpapi.com/) [Walmart Search Engine Results API](https://serpapi.com/walmart-search-api) plus how extracted data could be used in exploratory data analysis.
This repo covers:
1. [Extracting data from Walmart Organic results](https://github.com/dimitryzub/walmart-stores-coffee-analysis/blob/74d06e8016c1903ff53bfbde0263f98264baca1c/script/extraction.py).
2. [Extracting data from all store pages using SerpApi pagination](https://github.com/dimitryzub/walmart-stores-coffee-analysis/blob/74d06e8016c1903ff53bfbde0263f98264baca1c/script/extraction.py#L190-L195).
3. [Extracting data from 500 Walmart stores](https://github.com/dimitryzub/walmart-stores-coffee-analysis/blob/74d06e8016c1903ff53bfbde0263f98264baca1c/script/walmart-stores.json). [SerpApi provides JSON Walmart Stores Locations](https://serpapi.com/walmart-stores) with 4.460 stores in total.
4. [My full process of exploratory data analysis](https://github.com/dimitryzub/walmart-stores-coffee-analysis/blob/68a28ee3fe55372194076a2177a75d7502b26f5a/analysis/walmart-coffee-analysis.ipynb).
There's also a full blog post that also shows data extraction steps: https://serpapi.com/blog/serpapi-demo-project-walmart-coffee-exploratory-data-analysis/#extracting-walmart-data
Kaggle Dataset: https://www.kaggle.com/datasets/dimitryzub/walmart-coffee-listings-from-500-stores
## Key Takeaways
1. The most popular coffee seller is Walmart.
2. The most popular coffee type is medium roast.
3. More weight (grams) doesn't equal higher price.
- A lower gram coffee may cost more than a higher gram coffee.
4. The highest coffee weight is 2835 grams (2.8 kg).
5. "Folgers classic roast ground coffee" has 15k+ reviews which is the maximum value from data set.
6. ~300-500 grams is the most frequent weight.
7. The highest coffee price is $77 (Lavazza perfetto single-serve k-cup)
## Data Sources
Walmart. Data was extracted using [Walmart Search Engine Results API](https://serpapi.com/walmart-search-api) from SerpApi.
## Plots


### Most Frequent Weight (~300-500 grams)


### Higher weight = higher price?

