{"id":50915427,"url":"https://github.com/open-sci/2023-2024-atreides-code","last_synced_at":"2026-06-16T14:04:19.285Z","repository":{"id":237537662,"uuid":"781322894","full_name":"open-sci/2023-2024-atreides-code","owner":"open-sci","description":"The repository for the team Atreides of the Open Science course a.a. 2023/2024","archived":false,"fork":false,"pushed_at":"2025-08-07T10:21:04.000Z","size":21133,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-09-05T09:59:18.493Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"isc","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/open-sci.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":"CITATION.cff","codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-04-03T06:59:29.000Z","updated_at":"2025-08-07T10:21:08.000Z","dependencies_parsed_at":"2024-05-07T21:29:24.093Z","dependency_job_id":"200f050a-a967-4f7c-a01d-f242521fa525","html_url":"https://github.com/open-sci/2023-2024-atreides-code","commit_stats":null,"previous_names":["open-sci/2023-2024-atreides-code"],"tags_count":5,"template":false,"template_full_name":null,"purl":"pkg:github/open-sci/2023-2024-atreides-code","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/open-sci%2F2023-2024-atreides-code","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/open-sci%2F2023-2024-atreides-code/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/open-sci%2F2023-2024-atreides-code/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/open-sci%2F2023-2024-atreides-code/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/open-sci","download_url":"https://codeload.github.com/open-sci/2023-2024-atreides-code/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/open-sci%2F2023-2024-atreides-code/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34408796,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-16T02:00:06.860Z","response_time":126,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2026-06-16T14:04:18.468Z","updated_at":"2026-06-16T14:04:19.270Z","avatar_url":"https://github.com/open-sci.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 2023-2024-atreides-code\nThe repository for the team Atreides of the Open Science course a.a. 2023/2024\n\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.11262416.svg)](https://doi.org/10.5281/zenodo.11262416)\n[![SWH](https://archive.softwareheritage.org/badge/origin/https://github.com/open-sci/2023-2024-atreides-code/)](https://archive.softwareheritage.org/browse/origin/?origin_url=https://github.com/open-sci/2023-2024-atreides-code)\n\n## Usage\n### Installation\n\n```sh\n# Clone the repository\ngit clone https://github.com/open-sci/2023-2024-atreides-code\n\n# Move to the repository folder\ncd 2023-2024-atreides-code\n\n# Install required dependencies using uv\n# uv install options: https://docs.astral.sh/uv/getting-started/installation/\nuv sync\n\n# Activate the virtual environment\nsource .venv/bin/activate\n```\n\n### Run the software\n\nCreate the necessary datasets ('IRIS in Meta' and 'IRIS in Index' are required to answer the research questions) by running the following command:\n\n```sh\npython3 -m scripts.create_datasets -meta \u003cpath_to_meta_zip\u003e -iris \u003cpath_to_iris_zip\u003e [-index \u003cpath_to_index_zip\u003e] \u003cdataset_of_choice\u003e [--year_cutoff \u003cyear\u003e]\n```\n\n#### Arguments\n\n- `-meta, --meta_path`: Required. The path to the folder (or zip file) containing the OpenCitations Meta dump.\n- `-iris, --iris_path`: Required. The path to the folder (or zip file) containing the IRIS CSV files.\n- `-index, --index_path`: The path to the OpenCitations Index dump folder (or zip).\n- `-iim, --iris_in_meta`: Create the \"Iris In Meta\" dataset, which contains all the entities with external IDs in IRIS that are in Meta.\n- `-iii, --iris_in_index`: Create the \"Iris In Index\" dataset, which contains all the entities with external IDs in IRIS that are in the OpenCitations Index.\n- `-inim, --iris_not_in_meta`: Create the \"Iris Not In Meta\" dataset, which contains all the entities with external IDs in IRIS that are not in Meta.\n- `-inoid, --iris_no_id`: Create the \"Iris No ID\" dataset, which contains all the entities with no external IDs in IRIS.\n- `-yc, --year-cutoff`: (Optional) Specify a year cutoff for the mapping of IRIS data. Only entities published prior or during this year will be included in the new datasets.\n- `--search_for_titles`: (Experimental) Try to reconcile the IRIS entities without PIDs using their title in OC Meta. \u003cins\u003eThis can take around 3 hours to complete.\u003c/ins\u003e\n\nAlternatively, you can download the processed datasets from the links provided below and place them in the `data/` directory of the repository folder.\n\nUse the following command to get the answers to the research questions:\n\n```sh\npython3 -m scripts.answer_research_questions [-rq \u003cresearch_question_number\u003e]\n```\n\n- `-rq \u003cresearch_question_number\u003e`: (Optional) Specify the research question number to answer a specific question.\n\n\nFor more detailed guidelines consult the protocol for the software:\n\n[![protocols.io](https://a11ybadges.com/badge?logo=protocolsdotio)](https://dx.doi.org/10.17504/protocols.io.g6xmbzfk7)\n\n\n## Research questions:\n\n1) What is the coverage of the publications available in IRIS, that strictly concern research conducted within the University of Bologna, in OpenCitations Meta?\n2) What are the types of publications that are better covered in OpenCitations Meta?\n3) What is the amount of citations (according to OpenCitations Index) the IRIS publications included in OpenCitations Meta are involved in (as citing entity and as cited entity)?\n4) How many of these citations come from and go to publications not included in IRIS?\n5) How many of these citations involve publications in IRIS as both citing and cited entities?\n\n## Download original datasets\n\n- UNIBO IRIS bibliographic data dump, dated 30 May 2025, updated on 3 July 2025: [https://doi.org/10.6092/unibo/amsacta/8427](https://doi.org/10.6092/unibo/amsacta/8427)\n\n- OpenCitations Meta CSV dataset of all bibliographic metadata (June 2025): [https://doi.org/10.5281/zenodo.15625651](https://doi.org/10.5281/zenodo.15625651)\n\n- OpenCitations Index CSV dataset of all the citation data (July 2025): [https://doi.org/10.6084/m9.figshare.24356626.v6](https://doi.org/10.6084/m9.figshare.24356626.v6)\n\n## Output datasets\n\n- IRIS in Meta: [https://doi.org/10.6084/m9.figshare.25879420.v3](https://doi.org/10.6084/m9.figshare.25879420.v3)\n\n- IRIS in Index: [https://doi.org/10.6084/m9.figshare.25879441.v3](https://doi.org/10.6084/m9.figshare.25879441.v3)\n\n- IRIS Not in Meta: [https://doi.org/10.6084/m9.figshare.25897708.v3](https://doi.org/10.6084/m9.figshare.25897708.v3)\n\n- IRIS No ID: [https://doi.org/10.6084/m9.figshare.25897759.v3](https://doi.org/10.6084/m9.figshare.25897759.v3)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fopen-sci%2F2023-2024-atreides-code","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fopen-sci%2F2023-2024-atreides-code","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fopen-sci%2F2023-2024-atreides-code/lists"}