{"id":23472981,"url":"https://github.com/caltechlibrary/irdm_harvester","last_synced_at":"2025-04-13T05:49:54.450Z","repository":{"id":172873556,"uuid":"632079685","full_name":"caltechlibrary/irdm_harvester","owner":"caltechlibrary","description":"Automatically harvest publications for an InvenioRDM repository","archived":false,"fork":false,"pushed_at":"2025-04-11T15:18:42.000Z","size":756,"stargazers_count":1,"open_issues_count":5,"forks_count":0,"subscribers_count":5,"default_branch":"main","last_synced_at":"2025-04-13T05:49:48.769Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/caltechlibrary.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGES.md","contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":"CITATION.cff","codeowners":null,"security":null,"support":"SUPPORT.md","governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":"codemeta.json","zenodo":null}},"created_at":"2023-04-24T17:04:14.000Z","updated_at":"2025-04-11T15:18:46.000Z","dependencies_parsed_at":"2023-10-17T05:29:53.820Z","dependency_job_id":"f6a89a16-a115-45ef-b7e3-72fdddbe7e43","html_url":"https://github.com/caltechlibrary/irdm_harvester","commit_stats":null,"previous_names":["caltechlibrary/irdm_harvester"],"tags_count":2,"template":false,"template_full_name":"caltechlibrary/template","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caltechlibrary%2Firdm_harvester","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caltechlibrary%2Firdm_harvester/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caltechlibrary%2Firdm_harvester/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/caltechlibrary%2Firdm_harvester/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/caltechlibrary","download_url":"https://codeload.github.com/caltechlibrary/irdm_harvester/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248670501,"owners_count":21142901,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-24T17:14:36.727Z","updated_at":"2025-04-13T05:49:54.420Z","avatar_url":"https://github.com/caltechlibrary.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# InvenioRDM Harvester\n\nThis is a harvester that can automatically collect and submit works to an\nInvenioRDM repository. It currently works with the CaltechAUTHORS repository and looks at CrossRef and ORCID.\n\n[![License](https://img.shields.io/badge/License-BSD--like-lightgrey)](https://choosealicense.com/licenses/bsd-3-clause)\n[![Latest\nrelease](https://img.shields.io/github/v/release/caltechlibrary/irdm_harvester.svg?color=b44e88)](https://github.com/irdm_harvester/template/releases)\n[![DOI](https://img.shields.io/badge/dynamic/json.svg?label=DOI\u0026query=$.pids.doi.identifier\u0026uri=https://data.caltech.edu/api/records/c14ab-m2d78/versions/latest)](https://data.caltech.edu/records/c14ab-m2d78/latest)\n\n## Table of contents\n\n* [Introduction](#introduction)\n* [Installation](#installation)\n* [Usage](#usage)\n* [Known issues and limitations](#known-issues-and-limitations)\n* [Getting help](#getting-help)\n* [Contributing](#contributing)\n* [License](#license)\n* [Authors and history](#authors-and-history)\n* [Acknowledgments](#authors-and-acknowledgments)\n\n\n## Introduction\n\nCurrently harvesting:\n\n    - CrossRef by ROR\n    - ORCID\n    - CrossRef DOIs\n\n## Usage\n\nThe harvests are typically run through [GitHub actions](https://github.com/caltechlibrary/irdm_harvester/actions) \nbut could also be run on the command line.\n\nYou need to have a CaltechAUTHORS token available in the environment variable \n`RDMTOK`. For a CrossRef ROR harvest type\n\n```bash\npython harvest.py crossref\n```\n\nYou can harvest a specific DOI with\n\n```bash\npython harvest.py -doi 10.7717/peerj-cs.1023\n```\n\nFor an ORCID harvest type:\n\n```bash\npython harvest.py orcid -orcid 0000-0001-9266-5146\n```\n\nFor all harvests there is an `-actor` flag, which gets included in the message when the record is added to the queue.\n\n## Installation\n\nFor command line use you need the latest version of `irdmtools` installed:\n\n`curl https://caltechlibrary.github.io/irdmtools/installer.sh | sh`\n\nThen install the python requirements with\n\n`pip install -r requirements.txt`\n\n## Known issues and limitations\n\nWhile this approach should work for any InvenioRDM repository, it has only been tested on \nCaltechAUTHORS. If you're interested in using this with a different repository reach out as we\nwould be happy to make it a bit more flexible.\n\nPublishers use a wide variety of urls for licenses. We are currently adding\nvariants to the license.csv file, which is a custom file that connects urls to\nthe InvenioRDM license names. It is almost certainly incomplete.\n\n## Getting help\n\nOpen an issue in the issue tab.\n\n## Contributing\n\nPull requests are appreciated.\n\n## License\n\nSoftware produced by the Caltech Library is Copyright © 2022 California Institute of Technology.  This software is freely distributed under a BSD-style license.  Please see the [LICENSE](LICENSE) file for more information.\n\n## Authors and history\n\nGitHub action created by Tom Morrell. Robert Doiel and Tom Morrell wrote\nthe source irdmtools package.\n\n## Acknowledgments\n\nThis work was funded by the California Institute of Technology Library.\n\n\n\u003cdiv align=\"center\"\u003e\n  \u003cbr\u003e\n  \u003ca href=\"https://www.caltech.edu\"\u003e\n    \u003cimg width=\"100\" height=\"100\" src=\"https://raw.githubusercontent.com/caltechlibrary/template/main/.graphics/caltech-round.png\"\u003e\n  \u003c/a\u003e\n\u003c/div\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcaltechlibrary%2Firdm_harvester","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcaltechlibrary%2Firdm_harvester","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcaltechlibrary%2Firdm_harvester/lists"}