An open API service indexing awesome lists of open source software.

https://github.com/srijanmukherjee/dataharvestors

Various web scrapers for collecting data
https://github.com/srijanmukherjee/dataharvestors

Last synced: 3 months ago
JSON representation

Various web scrapers for collecting data

Awesome Lists containing this project

README

        

# DataHarvesters
Various web scrapers for collecting data

## How to run

Create a virtual environment
```console
python -m venv env
```

[Activate](https://docs.python.org/3/library/venv.html#how-venvs-work) the environemnt and install the dependencies
```console
pip install -r requirements.txt
```

### goodreads

```console
scrapy runspider goodreads/BookSpider.py -O output/books.json
```