https://github.com/dpriskorn/wikidataorcidscraper
This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia
https://github.com/dpriskorn/wikidataorcidscraper
curation doi orcid scholia scientific-articles wikidata
Last synced: 30 days ago
JSON representation
This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia
- Host: GitHub
- URL: https://github.com/dpriskorn/wikidataorcidscraper
- Owner: dpriskorn
- License: agpl-3.0
- Created: 2024-02-27T17:00:08.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2024-07-15T19:40:43.000Z (about 1 year ago)
- Last Synced: 2025-04-02T13:48:49.519Z (6 months ago)
- Topics: curation, doi, orcid, scholia, scientific-articles, wikidata
- Language: Python
- Homepage:
- Size: 274 KB
- Stars: 4
- Watchers: 2
- Forks: 0
- Open Issues: 12
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Wikidata Orcid Scraper
This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia
## Features
* lookup on works on orcid.org and extraction of work with DOIs
* lookup of DOIs on Wikidata to determine if it has been imported or not
* presenting a table with information to the user and helping them import missing articles
* lookup of citation count in OpenCitations and linking to the result## Documentation
https://www.wikidata.org/wiki/Wikidata:Tools/Wikidata_Orcid_Scraper## Choice of SPARQL endpoint
This tool uses the QLever Wikidata SPARQL
endpoint to detect if a DOI is missing in Wikidata.
The endpoint is currently updated weekly from the official Wikidata dumps.The QLever endpoint was chosen because it is faster and more
reliable than current available alternatives, see
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_backend_update/WDQS_backend_alternatives#A_performance_evaluation_of_QLever,_Virtuoso,_Blazegraph,_GraphDB,_Stardog,_Jena,_and_Oxigraph.## Development
Run it like so outside Docker:
`$ gunicorn app:app`## License
GPLv3+