An open API service indexing awesome lists of open source software.

https://github.com/darwinz/js-cli-web-crawlers

Some web crawlers run through CLI, developed in JavaScript
https://github.com/darwinz/js-cli-web-crawlers

cli crawlers express javascript nodejs

Last synced: 7 months ago
JSON representation

Some web crawlers run through CLI, developed in JavaScript

Awesome Lists containing this project

README

          

# JS CLI Web Crawlers

### Prerequisites

* [Node.js](https://nodejs.org/en/) 8.5+
* Npm 5.3+

### Setup

After all prerequisites are met, install package dependencies with npm

```bash
$ npm install
```

Then run the config helper to configure the environment (prompts for user configuration options)

```bash
$ npm run config:helper
```

### Running

After all prerequisites are met and setup is complete, crawlers can be run from shell using node, or as a node
web service

##### Example (running from cli)

```bash
$ node crawlers/vivint_solar --run
```
or
```bash
$ npm run crawlers:solar
```

##### Example (running as web service)

```bash
$ npm start
```
```bash
$ curl -XGET http://localhost:3000/crawlers/solar/production
```

### Crawlers

* Vivint Solar - check production output
* Google Analytics - check real time analytics data

### Testing

Integration testing can be run using npm
```bash
$ npm test
```