https://github.com/thisisparker/datasets
Datasets I've cleaned up or compiled from public sources
https://github.com/thisisparker/datasets
Last synced: over 1 year ago
JSON representation
Datasets I've cleaned up or compiled from public sources
- Host: GitHub
- URL: https://github.com/thisisparker/datasets
- Owner: thisisparker
- Created: 2017-03-17T03:59:16.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2018-02-09T20:20:54.000Z (over 8 years ago)
- Last Synced: 2025-01-24T21:41:15.265Z (over 1 year ago)
- Size: 3.87 MB
- Stars: 4
- Watchers: 2
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Datasets
These are datasets that I've created, cleaned up, or compiled from public sources.
## Pomological
This is a listing of the images in the Pomological Watercolors Collection housed in the US Department of Agriculture's National Agricultural Library. A version of this dataset powers [the @pomological twitter bot](https://twitter.com/pomological). [David Riordan](https://twitter.com/riordan) helped with the initial scraping.
## NYC neighborhoods
There are a bunch of problems with using ZIP codes as geographical boundaries, but if you want to throw caution to the wind, this is a set of named neighborhoods in New York City and some corresponding ZIP codes. It is a little stale and could use an update, but here it is.
## Dogs
Collected information about registered dogs in different cities (currently New York and San Francisco). These are snapshots of the registration database obtained through public records requests, which I've converted to JSON. Each city provides slightly different information, but names, breeds, and zip codes are pretty constant.