Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ndom91/pw-web-scraper

🌐 Basic Playwright Web-scraper
https://github.com/ndom91/pw-web-scraper

nodejs playwright web-scraper

Last synced: 22 days ago
JSON representation

🌐 Basic Playwright Web-scraper

Awesome Lists containing this project

README

        

# 🌐 Web Scraper

Basic [`playwright`](https://playwright.dev)/[`apify`](https://sdk.apify.com/) based web-scraper!

## 🕹ī¸ Setup

1. Clone repository and install dependencies

```
$ git clone [email protected]:ndom91/web-scraper-berlin.git
$ cd web-scraper-berlin
$ npm install
```

2. Paste your list of URLs to be scraped into `sites.txt`

3. Double check the `SEARCH_TERM` variable towards the top of `index.js`. This is the term which will trigger sites to be written to `output.txt` during the scraping process.

4. Run `npm run scrape` :tada:

## 📝 License

MIT