https://github.com/rdmpage/index-animalium
Data from uBio and Smithsonian on Sherborn Index Animalium
https://github.com/rdmpage/index-animalium
Last synced: 3 months ago
JSON representation
Data from uBio and Smithsonian on Sherborn Index Animalium
- Host: GitHub
- URL: https://github.com/rdmpage/index-animalium
- Owner: rdmpage
- Created: 2017-05-16T12:38:20.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2023-05-26T13:13:27.000Z (about 3 years ago)
- Last Synced: 2025-12-26T15:12:31.247Z (6 months ago)
- Language: HTML
- Size: 31.6 MB
- Stars: 2
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Index Animalium
Data from uBio and Smithsonian on Sherborn Index Animalium
For uBio project see http://www.ubio.org/Sherborne/index.html, data dumps come from http://www.sil.si.edu/DigitalCollections/indexanimalium/Datasets/
For background see:
Richard, J. M., Pilsk, S. C., & Kalfatovic, M. R. (2016, January 7). Unlocking Index Animalium: From paper slips to bytes and bits. ZooKeys. Pensoft Publishers. [https://doi.org/10.3897/zookeys.550.9673](https://doi.org/10.3897/zookeys.550.9673)
## ION
The folder `ion` contains results of web scraping the ION web site to retrieve to Index Animalium links on ION’s pages.
## uBio
The `bio` folder contains content from the Wayback Machine detailing uBio’s parsing efforts.
## Smithsonian
The folder `smithsonian` has the data dumps from the Smithsonian, which are not CSV files despite the name as the citations contain commas that break CSV parsing.
## Code
My code to parse ION HTML pages, and to parse the Smithsonian “CSV” files.
## Global Names BHL CoL
The file http://opendata.globalnames.org/bhlnames/ may also be relevant here, depending on how much overlap there is between Catalogue of Life and Index Animalium.