https://github.com/waridrox/stackoverflow-questions
A concurrent node.js scraper that gets stack overflow question stats and saves it in a CSV file.
https://github.com/waridrox/stackoverflow-questions
Last synced: 10 months ago
JSON representation
A concurrent node.js scraper that gets stack overflow question stats and saves it in a CSV file.
- Host: GitHub
- URL: https://github.com/waridrox/stackoverflow-questions
- Owner: waridrox
- Created: 2022-02-02T09:40:47.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-02-02T13:21:07.000Z (over 4 years ago)
- Last Synced: 2025-06-18T09:49:42.246Z (about 1 year ago)
- Language: JavaScript
- Size: 44.9 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# stackoverflow-questions
A concurrent node.js scraper that gets stack overflow question stats and saves it in a CSV file.
## Setup
```sh
git clone https://github.com/waridrox/stackoverflow-questions.git
cd stackoverflow-questions
npm i
```
## Usage
```sh
npm start
```
## Features
Scrapes the stackoverflow website for questions sorted by popularity (50 per page) using a concurrent queue of 5 processes.
Stores the data on MongoDB Atlas.
When the user quits, the data is exported to a CSV file.
When the user starts the scraper again, the new data is appended to the db, also updating the CSV file.