https://github.com/satvikx/amzn-web-scraper
This project is a web scraper designed to extract product details from Amazon product pages using BeautifulSoup.
https://github.com/satvikx/amzn-web-scraper
beautifulsoup python scraper web
Last synced: 11 months ago
JSON representation
This project is a web scraper designed to extract product details from Amazon product pages using BeautifulSoup.
- Host: GitHub
- URL: https://github.com/satvikx/amzn-web-scraper
- Owner: satvikx
- Created: 2025-03-31T07:42:45.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-31T07:47:37.000Z (over 1 year ago)
- Last Synced: 2025-03-31T08:30:54.572Z (over 1 year ago)
- Topics: beautifulsoup, python, scraper, web
- Language: Python
- Homepage:
- Size: 0 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Amazon Products Scraper (using BeautifulSoup)
**This project is a web scraper designed to extract product details from Amazon product pages using Python and the BeautifulSoup library. The scraper fetches information such as product name, price, ratings, reviews, images, and other relevant details. The extracted data is saved in both JSON and CSV formats for further analysis or use.**
## Features
- Extracts product details such as:
- Product name
- Price and discounts
- Ratings and number of reviews
- Product description and specifications
- Images
- Bank offers and promotional details
- Saves the extracted data in:
- `amazon_product.json` (structured JSON format)
- `amazon_product.csv` (tabular format for easy analysis)
- Handles dynamic user-agent headers to avoid detection.
- Modular and object-oriented design for better maintainability.
## Setup
[**Note** - I use uv as python package manager and Git Bash as default terminal]
1. Create Virtual Environment `uv venv venv`.
2. Activate Environment `source venv/Scripts/activate`.
3. Install Dependencies `uv pip install -r requirements.txt`.
4. Insert the product url in `main.py` file.
5. Run using `python main.py`.
6. Results are saved in [amazon_product.csv](amazon_product.csv) and [amazon_product.json](amazon_product.json).