Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/khaliqgant/data-scrubber
Collection of scripts to pull data from remote API sources
https://github.com/khaliqgant/data-scrubber
Last synced: 4 days ago
JSON representation
Collection of scripts to pull data from remote API sources
- Host: GitHub
- URL: https://github.com/khaliqgant/data-scrubber
- Owner: khaliqgant
- Created: 2016-09-25T17:30:36.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2016-10-11T01:49:03.000Z (over 8 years ago)
- Last Synced: 2024-11-10T03:42:21.421Z (2 months ago)
- Language: Python
- Size: 6.84 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Data Scrubber
=============
Collection of scripts to pull remote data.
Google Sheet of target datasets can be found [here] (https://docs.google.com/spreadsheets/d/1a3Immizb2NfMQt1bjieMG3W0sLo6PV8ZDPYmzmRG5dQ/edit?usp=sharing).## Getting Started
* `make init` to spin up a virtual environment and install required packages## Python
Contain python scripts to pull datasets
* **yelp_neighborhood.py:** For any given neighborhood (e.g. 'Fort Greene, Brooklyn') creates a JSON file (pr pd data frame) of all food establishments within the neighborhood.
* **nyc_open_business_pull.py:** Pulls business information for all businesses within a given zip code. Data from City of NY Open Data project.
* **zip_lookup.py:** Outputs a Zip Code lookup CSV from http://federalgovernmentzipcodes.us/.