Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/agentender/openpayrollcrawler

A simple webcrawler targeted at https://openpayrolls.com
https://github.com/agentender/openpayrollcrawler

Last synced: about 1 month ago
JSON representation

A simple webcrawler targeted at https://openpayrolls.com

Awesome Lists containing this project

README

        

# OpenPayroll Crawler

This is a simple web crawler meant toextract payroll data from OpenPayroll.

## Dependencies ##
* Scrapy `pip install scrapy`

## How to run: ##
Running the crawler is simple.
* Open crawler.py
* Edit line 44 with whatever location you are wanting to crawl (check http://openpayrolls.com for valid locations)
* Optionally change the output file name here as well.
* Run crawler.py as main

## *WARNING*: Running this for large locations will likely get you IP banned from OpenPayroll ##

### A VPN is strongly recommended ###