An open API service indexing awesome lists of open source software.

https://github.com/iurianu/simple-web-scraper

A simple web scraper with axios and express
https://github.com/iurianu/simple-web-scraper

axios cheerio cors express node scraper

Last synced: about 2 months ago
JSON representation

A simple web scraper with axios and express

Awesome Lists containing this project

README

          

# Axios & Express Web Scraper

> A simple web scraper that gets all articles from [the guardian](https://www.theguardian.com/uk)

## Sources

- [Build a Web Scraper](https://www.youtube.com/watch?v=-3lqUHeZs_0)
by [Ania Kubów](https://www.youtube.com/c/AniaKub%C3%B3w)
- [Get Data from Backend (NodeJS) to Frontend](https://www.youtube.com/watch?v=1wXYg8Eslnc&t=0s)
by [Ania Kubów](https://www.youtube.com/c/AniaKub%C3%B3w)

## Dependencies

- `axios v0.27.2`
- `cheerio v1.0.0-rc.11`
- `cors v2.8.5`
- `express v4.18.1`

## Installation

- clone this repo to your computer
- cd simple-web-scraper
- run `npm install`

## Usage

- run `npm run start` in the `./simple-web-scraper` folder

## Details

> The app is configured to scrape the data from [https://www.theguardian.com/uk](https://www.theguardian.com/uk)

> The app runs on http://localhost:8000

> You can see the resulted json file on http://localhost:8000/results

> You can see the resulted html by opening the index.html file in the browser