Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/donotknowwhy/scrape-by-puppeteer


https://github.com/donotknowwhy/scrape-by-puppeteer

puppeteer scraping

Last synced: 10 days ago
JSON representation

Awesome Lists containing this project

README

        

# Web Scraping Project: thegioididong.com Product Data Extraction

## Overview

This project aims to scrape product data from the website [thegioididong.com](https://www.thegioididong.com/), a popular electronics retailer in Vietnam. The script uses Puppeteer, a Node.js library, to automate the process of navigating through the website, handling pagination, and extracting product information. The extracted data is then formatted into a JSON array containing the product name, original price, sale price, and discount rate.

## Features

- **Automated Navigation**: The script automatically navigates through the product listing pages, handling the "Load More" button to load additional products.
- **Data Extraction**: Extracts detailed product information including name, original price, sale price, and discount rate.
- **JSON Output**: Converts the extracted data into a JSON array for easy consumption and further processing.