https://github.com/rahulkrsharma2004/athlead-assignment
A web scraper built to extract detailed product information from any Smart TV listing on Amazon India. This tool captures key product details, pricing, offers, customer reviews, and images for a structured and comprehensive overview.
https://github.com/rahulkrsharma2004/athlead-assignment
axios cors css3 expressjs html5 mern-stack nodejs puppeteer react-icons react-router reactjs tailwindcss
Last synced: 2 months ago
JSON representation
A web scraper built to extract detailed product information from any Smart TV listing on Amazon India. This tool captures key product details, pricing, offers, customer reviews, and images for a structured and comprehensive overview.
- Host: GitHub
- URL: https://github.com/rahulkrsharma2004/athlead-assignment
- Owner: Rahulkrsharma2004
- Created: 2025-03-21T12:27:30.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-24T12:37:37.000Z (about 1 year ago)
- Last Synced: 2025-07-16T08:45:59.160Z (11 months ago)
- Topics: axios, cors, css3, expressjs, html5, mern-stack, nodejs, puppeteer, react-icons, react-router, reactjs, tailwindcss
- Language: JavaScript
- Homepage: https://amazon-web-scraper-psi.vercel.app
- Size: 162 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AthLead-Assignment
# ποΈ Amazon India Smart TV Web Scraper
This project is a **fully functional web scraper** that extracts **detailed product information** from **Smart TV listings** on **Amazon India**. Using **Puppeteer**, it fetches data such as product name, price, discounts, bank offers, ratings, reviews, and images.
The extracted data is then displayed in an **Amazon-like UI**, built with **React.js** and **Tailwind CSS**. This tool helps users analyze product details efficiently **without manually browsing Amazon**.
## π Features
βοΈ **Scrapes Key Product Details** β Product Name, Rating, Price, Number of Ratings
βοΈ **Captures Discounts & Bank Offers** β Fetches total discount & banking deals
βοΈ **Extracts "About This Item" & "Product Information"** β All technical details included
βοΈ **Collects Amazon Product Images** β Fetches images from the product page (excluding videos)
βοΈ **Extracts "From the Manufacturer" Images** β Retrieves images from the manufacturer section
βοΈ **AI-Generated Customer Review Summary** β Captures Amazonβs AI-generated insights
βοΈ **User-Friendly UI** β Displays extracted data in a clean, Amazon-like interface
βοΈ **Optimized for Performance** β Efficient and fast scraping using Puppeteer
## πΈ Preview

## π οΈ Tech Stack
### πΉ **Frontend**
- **React.js** β Component-based UI development
- **Tailwind CSS** β Utility-first styling framework
- **React Icons** β For attractive UI icons
### πΉ **Backend**
- **Node.js** β JavaScript runtime for server-side logic
- **Express.js** β Fast, unopinionated web framework
- **Puppeteer** β Headless Chrome browser for web scraping
### πΉ **Data Handling**
- **Axios** β For API calls
- **JSON** β Stores and structures scraped data
### πΉ **Deployment & Dev Tools**
- **Vercel** β Frontend deployment
- **Vercel** β Backend hosting options
- **GitHub** β Version control and collaboration
- **ESLint & Prettier** β Code quality and formatting
- **Thunderclient** β API testing
## β‘ Getting Started
### 1οΈβ£ Clone the Repository
```bash
git clone https://github.com/Rahulkrsharma2004/AthLead-Assignment.git
cd Backend-Scraper
cd Frontend-Scraper