https://github.com/rdmpage/world-spider-catalog
Harvesting the World Spider Catalog
https://github.com/rdmpage/world-spider-catalog
Last synced: 3 months ago
JSON representation
Harvesting the World Spider Catalog
- Host: GitHub
- URL: https://github.com/rdmpage/world-spider-catalog
- Owner: rdmpage
- Created: 2017-05-19T11:41:36.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2017-09-30T09:59:35.000Z (over 8 years ago)
- Last Synced: 2025-03-10T18:48:10.151Z (about 1 year ago)
- Language: HTML
- Size: 8.31 MB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# world-spider-catalog
Harvesting the World Spider Catalog
## Linking names to references
The LSID XML doesn’t include the URL for the reference, but we can get these (it seems) like this:
```sql
SELECT * FROM names INNER JOIN `references` ON names.namePublishedIn = `references`.citation LIMIT 10;
```
## Mapping tasks
### Generate dump of names to match to ION
```
SELECT id, nameComplete, REPLACE(taxonAuthor, ",","") FROM names WHERE taxonAuthor IS NOT NULL and taxonAuthor <> “” LIMIT 100;
```
### Map to ION
```
SELECT worldspiders.id, names.id, `worldspiders`.nameComplete, `worldspiders`.taxonAuthor FROM `worldspiders` JOIN names WHERE `worldspiders`.nameComplete = `names`.nameComplete AND `worldspiders`.taxonAuthor = `names`.taxonAuthor limit 100;
```