Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/microbiomedata/nmdc-schema
National Microbiome Data Collaborative (NMDC) unified data model
https://github.com/microbiomedata/nmdc-schema
ecoinformatics fair-data genomics linkml metabolomics metadata microbiome microbiomedata mixs nmdc proteomics samples schema standards transcriptomics
Last synced: 3 days ago
JSON representation
National Microbiome Data Collaborative (NMDC) unified data model
- Host: GitHub
- URL: https://github.com/microbiomedata/nmdc-schema
- Owner: microbiomedata
- License: cc0-1.0
- Created: 2021-04-03T00:54:27.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2025-01-16T21:29:04.000Z (11 days ago)
- Last Synced: 2025-01-17T12:15:21.778Z (10 days ago)
- Topics: ecoinformatics, fair-data, genomics, linkml, metabolomics, metadata, microbiome, microbiomedata, mixs, nmdc, proteomics, samples, schema, standards, transcriptomics
- Language: Python
- Homepage: https://microbiomedata.github.io/nmdc-schema/
- Size: 136 MB
- Stars: 28
- Watchers: 9
- Forks: 8
- Open Issues: 371
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# National Microbiome Data Collaborative Schema
[![PyPI - License](https://img.shields.io/pypi/l/nmdc-schema)](https://github.com/microbiomedata/nmdc-schema/blob/main/LICENSE)
[![PyPI version](https://badge.fury.io/py/nmdc-schema.svg?any-param=to-cause-a-cache-miss)](https://badge.fury.io/py/nmdc-schema)The mission of the NMDC is to build a FAIR microbiome data sharing network, through infrastructure, data standards,
and community building, that addresses pressing challenges in environmental sciences. The NMDC platform is built on
top of a unified data model (schema) that weaves together existing standards and ontologies to provide a systematic
representation of all aspects of the microbiome data life cycle.This repository mainly defines a [LinkML](https://github.com/linkml/linkml) schema for managing metadata from
the [National Microbiome Data Collaborative (NMDC)](https://microbiomedata.org/).## Documentation
The documentation for the NMDC schema can be found at [https://microbiomedata.github.io/nmdc-schema/](https://microbiomedata.github.io/nmdc-schema/).
This documentation is aimed at consumers of NMDC data and metadata, it describes the different data elements used to describe studies, samples,
sample processing, data generation, workflows, and downstream data objects.The NMDC [Introduction to metadata and ontologies](https://microbiomedata.org/introduction-to-metadata-and-ontologies/)
primer provides some the context for this project.The remainder of this page is primary for the internal maintainers and contributors to the NMDC schema
## Repository Contents Overview
Some products that are maintained, and tasks orchestrated within this repository are:
- Maintenance of LinkML YAML that specifies the NMDC Schema
- [src/schema/nmdc.yaml](src/schema/nmdc.yaml)
- and various other YAML schemas imported by it,
like [prov.yaml](src/schema/prov.yaml), [annotation.yaml](src/schema/annotation.yaml), etc. all which you can find
in the [src/schema](src/schema/) folder
- Makefile targets for converting the schema from it's native LinkML YAML format to other artifact
like [JSON Schema](project/jsonschema/nmdc.schema.json)
- Build, deployment and distribution of the schema as a PyPI package
- Automatic publishing of refreshed documentation upon change to the schema,
accessible [here](https://microbiomedata.github.io/nmdc-schema/)## Maintaining the Schema
See [DEVELOPMENT.md](DEVELOPMENT.md) for instructions on setting up a development environment.
See [MAINTAINERS.md](MAINTAINERS.md) for instructions on using that development environment to maintain the schema.
## Makefiles
Makefiles are text files people can use to tell [`make`](https://www.gnu.org/software/make/manual/make.html#Introduction) (a computer program) how it can _make_ things (or—in general—_do_ things). In the world of Makefiles, those _things_ are called _targets_.
This repo contains 2 Makefiles:
- `Makefile`, based on the generic Makefile from the [LinkML cookiecutter](https://github.com/linkml/linkml-project-cookiecutter)
- `project.Makefile`, which contains _targets_ that are specific to this projectHere's an example of using `make` in this repo:
```shell
# Deletes all files in `examples/output`.
make examples-clean
```
> The `examples-clean` _target_ is defined in the `project.Makefile`. In this repo, the `Makefile` `include`s the `project.Makefile`. As a result, `make` has access to the _targets_ defined in both files.## Data downloads
The NMDC's metadata about biosamples, studies, bioinformatics workflows, etc. can be obtained from our nmdc-runtime API.
Try entering "biosample_set" or "study_set" into the `collection_name` box
at https://api.microbiomedata.org/docs#/metadata/list_from_collection_nmdcschema__collection_name__getOr use the API programmatically! Note that some collections are large, so the responses are paged.
You can learn about the other available collections at https://microbiomedata.github.io/nmdc-schema/Database/