https://github.com/rahulkrsharma2004/athlead-assignment
https://github.com/rahulkrsharma2004/athlead-assignment
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/rahulkrsharma2004/athlead-assignment
- Owner: Rahulkrsharma2004
- Created: 2025-03-21T12:27:30.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2025-03-21T19:27:09.000Z (2 months ago)
- Last Synced: 2025-03-21T19:44:35.243Z (2 months ago)
- Language: JavaScript
- Size: 0 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AthLead-Assignment
# ποΈ Amazon India Smart TV Web Scraper
This project is a **fully functional web scraper** that extracts **detailed product information** from **Smart TV listings** on **Amazon India**. Using **Puppeteer**, it fetches data such as product name, price, discounts, bank offers, ratings, reviews, and images.
The extracted data is then displayed in an **Amazon-like UI**, built with **React.js** and **Tailwind CSS**. This tool helps users analyze product details efficiently **without manually browsing Amazon**.## π Features
βοΈ **Scrapes Key Product Details** β Product Name, Rating, Price, Number of Ratings
βοΈ **Captures Discounts & Bank Offers** β Fetches total discount & banking deals
βοΈ **Extracts "About This Item" & "Product Information"** β All technical details included
βοΈ **Collects Amazon Product Images** β Fetches images from the product page (excluding videos)
βοΈ **Extracts "From the Manufacturer" Images** β Retrieves images from the manufacturer section
βοΈ **AI-Generated Customer Review Summary** β Captures Amazonβs AI-generated insights
βοΈ **User-Friendly UI** β Displays extracted data in a clean, Amazon-like interface
βοΈ **Optimized for Performance** β Efficient and fast scraping using Puppeteer## πΈ Preview
![]()
![]()
![]()
![]()
## π οΈ Tech Stack
### πΉ **Frontend**
- **React.js** β Component-based UI development
- **Tailwind CSS** β Utility-first styling framework
- **React Icons** β For attractive UI icons### πΉ **Backend**
- **Node.js** β JavaScript runtime for server-side logic
- **Express.js** β Fast, unopinionated web framework
- **Puppeteer** β Headless Chrome browser for web scraping### πΉ **Data Handling**
- **Axios** β For API calls
- **JSON** β Stores and structures scraped data### πΉ **Deployment & Dev Tools**
- **Vercel** β Frontend deployment
- **Vercel** β Backend hosting options
- **GitHub** β Version control and collaboration
- **ESLint & Prettier** β Code quality and formatting
- **Thunderclient** β API testing## β‘ Getting Started
### 1οΈβ£ Clone the Repository
```bash
git clone https://github.com/Rahulkrsharma2004/AthLead-Assignment.git
cd Backend-Scraper
cd Frontend-Scraper