Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sharif-minhaz/web-scraping-with-js
Web scraper extracts data from websites. In JavaScript, jsdom and Puppeteer are powerful tools for this task. jsdom is a lightweight DOM implementation that lets user manipulate and query static pages. Puppeteer is a Node.js library for controlling headless Chrome/Chromium, ideal for dynamic websites. It can simulate user interactions and take ss
https://github.com/sharif-minhaz/web-scraping-with-js
bun jsdom puppeteer web-scraping
Last synced: about 17 hours ago
JSON representation
Web scraper extracts data from websites. In JavaScript, jsdom and Puppeteer are powerful tools for this task. jsdom is a lightweight DOM implementation that lets user manipulate and query static pages. Puppeteer is a Node.js library for controlling headless Chrome/Chromium, ideal for dynamic websites. It can simulate user interactions and take ss
- Host: GitHub
- URL: https://github.com/sharif-minhaz/web-scraping-with-js
- Owner: Sharif-Minhaz
- Created: 2024-08-02T17:58:23.000Z (3 months ago)
- Default Branch: master
- Last Pushed: 2024-09-14T13:16:53.000Z (about 2 months ago)
- Last Synced: 2024-10-17T22:27:42.318Z (19 days ago)
- Topics: bun, jsdom, puppeteer, web-scraping
- Language: JavaScript
- Homepage:
- Size: 59.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web Scraping Project
This project demonstrates web scraping using `jsdom` and `Puppeteer`. Follow the instructions below to set up and run the project.
## Installation
First, ensure you have [Yarn](https://yarnpkg.com/) installed on your machine. Then, run the following command to install the necessary packages:
```sh
yarn install
```## Running the Project
This project is set to run with [Bun](https://bun.sh/) runtime for improved performance so, if you don't want to use `bun` you can simple go to the `package.json` and modify `start` script like that:
```sh
# "start": "bun index.js",
"start": "node index.js"
```To start the project, use the following command:
```sh
yarn start
```