https://github.com/dpriskorn/wikidataorcidscraper

This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia
https://github.com/dpriskorn/wikidataorcidscraper

curation doi orcid scholia scientific-articles wikidata

Last synced: 30 days ago
JSON representation

This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia

Host: GitHub
URL: https://github.com/dpriskorn/wikidataorcidscraper
Owner: dpriskorn
License: agpl-3.0
Created: 2024-02-27T17:00:08.000Z (over 1 year ago)
Default Branch: master
Last Pushed: 2024-07-15T19:40:43.000Z (about 1 year ago)
Last Synced: 2025-04-02T13:48:49.519Z (6 months ago)
Topics: curation, doi, orcid, scholia, scientific-articles, wikidata
Language: Python
Homepage:
Size: 274 KB
Stars: 4
Watchers: 2
Forks: 0
Open Issues: 12
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Wikidata Orcid Scraper
This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia

![image](https://github.com/dpriskorn/WikidataOrcidScraper/assets/68460690/33e917a1-09f8-4e22-8830-85b178964c58)

## Features
* lookup on works on orcid.org and extraction of work with DOIs
* lookup of DOIs on Wikidata to determine if it has been imported or not
* presenting a table with information to the user and helping them import missing articles
* lookup of citation count in OpenCitations and linking to the result

## Documentation
https://www.wikidata.org/wiki/Wikidata:Tools/Wikidata_Orcid_Scraper

## Choice of SPARQL endpoint
This tool uses the QLever Wikidata SPARQL
endpoint to detect if a DOI is missing in Wikidata.
The endpoint is currently updated weekly from the official Wikidata dumps.

The QLever endpoint was chosen because it is faster and more
reliable than current available alternatives, see
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_backend_update/WDQS_backend_alternatives#A_performance_evaluation_of_QLever,_Virtuoso,_Blazegraph,_GraphDB,_Stardog,_Jena,_and_Oxigraph.

## Development
Run it like so outside Docker:
`$ gunicorn app:app`

## License
GPLv3+

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dpriskorn/wikidataorcidscraper

Awesome Lists containing this project

README