https://github.com/christabor/plantstuff
Warning! messy/unstable! :herb: :evergreen_tree: :maple_leaf: :leaves: :hibiscus: Utilities for retrieving, computing, organizing, and creating plant/horticulture data from various sources.
https://github.com/christabor/plantstuff
biology datasets horticulture plants
Last synced: 3 months ago
JSON representation
Warning! messy/unstable! :herb: :evergreen_tree: :maple_leaf: :leaves: :hibiscus: Utilities for retrieving, computing, organizing, and creating plant/horticulture data from various sources.
- Host: GitHub
- URL: https://github.com/christabor/plantstuff
- Owner: christabor
- License: mit
- Created: 2017-12-28T22:04:22.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2020-03-03T04:54:17.000Z (over 5 years ago)
- Last Synced: 2025-04-12T13:13:04.211Z (6 months ago)
- Topics: biology, datasets, horticulture, plants
- Language: Python
- Homepage:
- Size: 915 KB
- Stars: 6
- Watchers: 2
- Forks: 0
- Open Issues: 12
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# plantstuff
:herb: :evergreen_tree: :maple_leaf: :leaves: :hibiscus: Utilities for retrieving, computing, organizing, and creating plant/horticulture data from various sources.## DISCLAIMER
This repo is a work-in-progress. The ultimate goals are not yet defined, so everything is still very messy and NOT production ready in any capactiy (whatever that means here).
## LEGAL COPYRIGHT DISCLAIMER
No scraped data is stored here until content copyright is verified.
## Scraping spiders
The following scrapy spiders have been created and have been tested to generate real uniform data. All spiders are under `scraping.scrapers.spiders`:
* theplantlist
* springhillnursery
* provenwinners
* wikipedia (basic categorical lists for now)**Note**: unless otherwise noted, these are not considered exhaustive - but they typically do retrieve most all urls and handle pagination.
### Works in progress
* Monrovia
* Perennials.com
* Plantlust