https://github.com/iurianu/simple-web-scraper
A simple web scraper with axios and express
https://github.com/iurianu/simple-web-scraper
axios cheerio cors express node scraper
Last synced: about 2 months ago
JSON representation
A simple web scraper with axios and express
- Host: GitHub
- URL: https://github.com/iurianu/simple-web-scraper
- Owner: iurianu
- Created: 2022-06-01T15:36:49.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-06-01T15:50:43.000Z (about 4 years ago)
- Last Synced: 2025-01-13T16:27:18.427Z (over 1 year ago)
- Topics: axios, cheerio, cors, express, node, scraper
- Language: JavaScript
- Homepage:
- Size: 11.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Axios & Express Web Scraper
> A simple web scraper that gets all articles from [the guardian](https://www.theguardian.com/uk)
## Sources
- [Build a Web Scraper](https://www.youtube.com/watch?v=-3lqUHeZs_0)
by [Ania Kubów](https://www.youtube.com/c/AniaKub%C3%B3w)
- [Get Data from Backend (NodeJS) to Frontend](https://www.youtube.com/watch?v=1wXYg8Eslnc&t=0s)
by [Ania Kubów](https://www.youtube.com/c/AniaKub%C3%B3w)
## Dependencies
- `axios v0.27.2`
- `cheerio v1.0.0-rc.11`
- `cors v2.8.5`
- `express v4.18.1`
## Installation
- clone this repo to your computer
- cd simple-web-scraper
- run `npm install`
## Usage
- run `npm run start` in the `./simple-web-scraper` folder
## Details
> The app is configured to scrape the data from [https://www.theguardian.com/uk](https://www.theguardian.com/uk)
> The app runs on http://localhost:8000
> You can see the resulted json file on http://localhost:8000/results
> You can see the resulted html by opening the index.html file in the browser