{"id":29016362,"url":"https://github.com/karrlab/datanator","last_synced_at":"2025-09-02T16:34:00.975Z","repository":{"id":76161063,"uuid":"80694368","full_name":"KarrLab/datanator","owner":"KarrLab","description":"Toolkit for discovering and aggregating data for whole-cell modeling","archived":false,"fork":false,"pushed_at":"2022-01-19T01:00:34.000Z","size":77443,"stargazers_count":14,"open_issues_count":7,"forks_count":4,"subscribers_count":7,"default_branch":"master","last_synced_at":"2023-10-20T20:54:42.329Z","etag":null,"topics":["cells","data-aggregation","data-discovery","data-integration","mathematical-modeling","systems-biology"],"latest_commit_sha":null,"homepage":"http://www.datanator.info","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/KarrLab.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2017-02-02T04:50:51.000Z","updated_at":"2023-10-20T20:55:38.259Z","dependencies_parsed_at":null,"dependency_job_id":"ba4284c1-8c36-4717-982a-b7c0fd4110ef","html_url":"https://github.com/KarrLab/datanator","commit_stats":null,"previous_names":[],"tags_count":0,"template":null,"template_full_name":null,"purl":"pkg:github/KarrLab/datanator","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/KarrLab%2Fdatanator","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/KarrLab%2Fdatanator/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/KarrLab%2Fdatanator/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/KarrLab%2Fdatanator/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/KarrLab","download_url":"https://codeload.github.com/KarrLab/datanator/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/KarrLab%2Fdatanator/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":261960448,"owners_count":23236575,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cells","data-aggregation","data-discovery","data-integration","mathematical-modeling","systems-biology"],"created_at":"2025-06-25T22:08:00.134Z","updated_at":"2025-06-25T22:08:01.858Z","avatar_url":"https://github.com/KarrLab.png","language":"Python","readme":"[![Documentation](https://readthedocs.org/projects/datanator/badge/?version=latest)](http://docs.karrlab.org/datanator)\n[![Test results](https://circleci.com/gh/KarrLab/datanator.svg?style=shield)](https://circleci.com/gh/KarrLab/datanator)\n[![Test coverage](https://coveralls.io/repos/github/KarrLab/datanator/badge.svg)](https://coveralls.io/github/KarrLab/datanator)\n[![Code analysis](https://api.codeclimate.com/v1/badges/e9b796130e29aee4672f/maintainability)](https://codeclimate.com/github/KarrLab/datanator)\n[![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)\n\n# Datanator: Toolkit for discovering and aggregating data for whole-cell modeling\n\n## Contents\n* [Overview](#overview)\n* [Installation instructions and documentation](#installation-instructions-and-documentation)\n* [Testing Datanator](#testing-datanator)\n* [License](#license)\n* [Development team](#development-team)\n* [Questions and comments](#questions-and-comments)\n\n## Overview\nExtensive data is needed to build comprehensive predictive models of cells. Although the literature and public repositories contain extensive data about cells, this data is hard to utilize for modeling because it is scattered across a large number of sources; because it is described with inconsistent identifiers, units, and data models; and because there are few tools for finding relevant data for modeling specific species and environmental conditions. \n\nDatanator is a software tool for discovering, aggregating, and integrating the data needed for modeling cells. This includes metabolite, RNA, and protein abundances; protein complex compositions; transcription factor binding motifs; and kinetic parameters. Datanator is particularly useful for building large models, such as whole-cell models, that require large amounts of data to constrain large numbers of parameters.\n\nThis package contains the source code for Datanator. The data aggregated with Datanator is available at [https://www.datanator.info](https://www.datanator.info). The data is also available for download as MongoDB snapshot from [Zenodo](https://doi.org/10.5281/zenodo.3971048).\n\n## Installation instructions and documentation\nPlease see the [documentation](http://docs.karrlab.org/datanator) for installation instructions, user instructions, and code documentation. \n\nNote, Datanator only supports Python 3. \n\nIf one needs to use the datanator database hosted by Karr Lab, one will need `karr_lab_build_config` repository saved\nas `.wc` in the user home directory.\n\n\n## Testing Datanator\nTo ensure Datanator works properly, we have developed extensive units tests of every aspect of `datanator`. We recommend using `pytest` to run these tests as follows:\n\n```\npython3 -m pytest tests\n```\n\n## License\nWe aim to provide data and tools for working with this data with no additional restrictions beyond those imposed by the third-party data sources and software libraries used to construct Datanator.\n\nThe content of the Datanator database is a compilation of data curated by the Datanator Team and data aggregated from third-party databases. The copyrightable content curated by the Datanator Team is released under the Creative Commons 1.0 Universal (CC0) [License](LICENSE-DATA). The content from third-party databases is available under the licenses summarized [here](LICENSE-THIRD-PATRY-DATA).\n\nThe structure of the database is released under the CC0 [License](LICENSE-DATABASE-STRUCTURE). This software is released open-source under the MIT [License](LICENSE).\n\n## Development team\nThe model was developed by the [Karr Lab](https://www.karrlab.org) at the Icahn School of Medicine at Mount Sinai in New York, US.\n\n* Yosef Roth\n* Zhouyang Lian\n* Saahith Pochiraju\n* Balazs Szigeti\n* Jonathan Karr\n\n## Questions and comments\nPlease contact the [Karr Lab](https://www.karrlab.org) with any questions or comments.\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkarrlab%2Fdatanator","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkarrlab%2Fdatanator","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkarrlab%2Fdatanator/lists"}