{"id":20837793,"url":"https://github.com/astrazeneca/onto_merger","last_synced_at":"2026-02-20T01:31:17.358Z","repository":{"id":37724169,"uuid":"488946363","full_name":"AstraZeneca/onto_merger","owner":"AstraZeneca","description":"OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes that represent the same domain.","archived":false,"fork":false,"pushed_at":"2024-01-11T19:22:08.000Z","size":34352,"stargazers_count":96,"open_issues_count":5,"forks_count":6,"subscribers_count":6,"default_branch":"main","last_synced_at":"2025-06-11T05:03:41.770Z","etag":null,"topics":["algorithm","alignment","biological-networks","biology","graph","kg","knowledge","knowledge-graph","mapping","ontology","ontology-alignment"],"latest_commit_sha":null,"homepage":"","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AstraZeneca.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2022-05-05T11:33:45.000Z","updated_at":"2025-05-03T10:03:17.000Z","dependencies_parsed_at":"2022-08-08T21:30:30.320Z","dependency_job_id":"7b0eee08-c0be-4ce6-9f79-339c19f53c56","html_url":"https://github.com/AstraZeneca/onto_merger","commit_stats":{"total_commits":214,"total_committers":5,"mean_commits":42.8,"dds":"0.20560747663551404","last_synced_commit":"67db652c1fa691847f28f745587bd208fa8e7c32"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/AstraZeneca/onto_merger","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AstraZeneca%2Fonto_merger","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AstraZeneca%2Fonto_merger/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AstraZeneca%2Fonto_merger/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AstraZeneca%2Fonto_merger/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AstraZeneca","download_url":"https://codeload.github.com/AstraZeneca/onto_merger/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AstraZeneca%2Fonto_merger/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":262067911,"owners_count":23253698,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["algorithm","alignment","biological-networks","biology","graph","kg","knowledge","knowledge-graph","mapping","ontology","ontology-alignment"],"created_at":"2024-11-18T01:08:36.212Z","updated_at":"2025-10-17T15:26:48.071Z","avatar_url":"https://github.com/AstraZeneca.png","language":"HTML","readme":"[pypi-image]: https://badge.fury.io/py/onto_merger.svg\n[pypi-url]: https://pypi.python.org/pypi/onto_merger\n[size-image]: https://img.shields.io/github/repo-size/AstraZeneca/onto_merger.svg\n[size-url]: https://github.com/AstraZeneca/onto_merger/archive/main.zip\n[build-image]: https://github.com/AstraZeneca/onto_merger/workflows/CI/badge.svg\n[build-url]: https://github.com/AstraZeneca/onto_merger/actions?query=workflow%3ACI\n[docs-image]: https://readthedocs.org/projects/ontomerger/badge/?version=latest\n[docs-url]: https://ontomerger.readthedocs.io/en/latest/?badge=latest\n[coverage-image]: https://codecov.io/gh/AstraZeneca/onto_merger/branch/main/graph/badge.svg\n[coverage-url]: https://codecov.io/github/AstraZeneca/onto_merger?branch=main\n\n\u003cp align=\"center\"\u003e\n  \u003cimg width=\"90%\" src=\"https://github.com/AZ-AI/onto_merger/blob/main/images/onto_merger_logo.jpg?sanitize=true\" /\u003e\n\u003c/p\u003e\n\n--------------------------------------------------------------------------------\n\n[![PyPI Version][pypi-image]][pypi-url]\n[![Docs Status][docs-image]][docs-url]\n[![Code Coverage][coverage-image]][coverage-url]\n[![Build Status][build-image]][build-url]\n[![Arxiv](https://img.shields.io/badge/ArXiv-2206.02238-orange.svg)]()\n\n**[Paper](https://arxiv.org/abs/2206.02238)** | **[Documentation](https://ontomerger.readthedocs.io)** | **[External Resources](https://ontomerger.readthedocs.io/en/latest/notes/resources.html)**\n\n--------------------------------------------------------------------------------\n\n*OntoMerger* is an ontology alignment library for **deduplicating** knowledge\ngraph nodes,(i.e. ontology concepts) that represent the *same domain*, e.g. diseases,\nand **connecting** them to form a single directed acyclic hierarchical graph (DAG) (i.e. an ontology\nclass hierarchy).  The library implements a pipeline that takes *nodes, mappings and\n(disconnected) hierarchies* as input and produces *node merges* and a *connected hierarchy*.\nIt also provides analysis and data testing for fine tuning the inputs in order\nto further reduce duplication, as well as to increase connectivity.\n\n--------------------------------------------------------------------------------\n\n**Citing**\n\n\nIf you find *OntoMerger* useful in your work or research, please consider adding the following citation:\n\n```bibtex\n@misc{ontomerger,\n  doi = {10.48550/ARXIV.2206.02238},\n  author = {Geleta, David and Nikolov, Andriy and ODonoghue, Mark and Rozemberczki, Benedek and Gogleva, Anna and Tamma, Valentina and Payne, Terry R.},\n  title = {OntoMerger: An Ontology Integration Library for Deduplicating and Connecting Knowledge Graph Nodes},\n  publisher = {arXiv},\n  year = {2022},\n}\n```\n--------------------------------------------------------------------------------\n\n**Getting Started**\n\n```python\n\nfrom onto_merger.pipeline import Pipeline\n\n# initialise the pipeline\npipeline = Pipeline(project_folder_path=\"../path/to/project\")\n\n# run the process\npipeline.run_alignment_and_connection_process()\n\n# view results in \"../path/to/project/output/report/index.html\"\n```\n\n--------------------------------------------------------------------------------\n\n**Running tests**\n\n```\n$ tox -e py\n```\n--------------------------------------------------------------------------------\n\n**License**\n\n- [Apache 2.0 License](https://github.com/AstraZeneca/onto_merger/blob/main/LICENSE)\n\n--------------------------------------------------------------------------------\n\n**Credit**\n\nThe **Onto Merger** logo is based on:\n\n- [Galguna Font](https://www.dafont.com/galguna.font)\n- [Noun Project Icons](https://thenounproject.com/)\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fastrazeneca%2Fonto_merger","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fastrazeneca%2Fonto_merger","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fastrazeneca%2Fonto_merger/lists"}