https://github.com/waridrox/stackoverflow-questions

A concurrent node.js scraper that gets stack overflow question stats and saves it in a CSV file.
https://github.com/waridrox/stackoverflow-questions

Last synced: 10 months ago
JSON representation

A concurrent node.js scraper that gets stack overflow question stats and saves it in a CSV file.

Host: GitHub
URL: https://github.com/waridrox/stackoverflow-questions
Owner: waridrox
Created: 2022-02-02T09:40:47.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2022-02-02T13:21:07.000Z (over 4 years ago)
Last Synced: 2025-06-18T09:49:42.246Z (about 1 year ago)
Language: JavaScript
Size: 44.9 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# stackoverflow-questions
A concurrent node.js scraper that gets stack overflow question stats and saves it in a CSV file.

## Setup

```sh
git clone https://github.com/waridrox/stackoverflow-questions.git
cd stackoverflow-questions
npm i
```

## Usage
```sh
npm start
```

## Features

Scrapes the stackoverflow website for questions sorted by popularity (50 per page) using a concurrent queue of 5 processes.
Stores the data on MongoDB Atlas.
When the user quits, the data is exported to a CSV file.
When the user starts the scraper again, the new data is appended to the db, also updating the CSV file.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/waridrox/stackoverflow-questions

Awesome Lists containing this project

README