https://github.com/cmungall/dasher

Base framework for creating semantic QC dashboards
https://github.com/cmungall/dasher

kg knowledge-graph obo owl qc quality-control semantic semweb

Last synced: about 1 year ago
JSON representation

Base framework for creating semantic QC dashboards

Host: GitHub
URL: https://github.com/cmungall/dasher
Owner: cmungall
License: cc0-1.0
Created: 2020-07-03T00:04:36.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2020-07-03T02:06:33.000Z (almost 6 years ago)
Last Synced: 2025-02-10T09:45:05.665Z (over 1 year ago)
Topics: kg, knowledge-graph, obo, owl, qc, quality-control, semantic, semweb
Language: Python
Homepage: http://cmungall.github.io/dasher
Size: 104 KB
Stars: 1
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Dasher: Semantic QC Framework

Base framework for running QC checks over multiple knowledge sources. Potential examples include:

- OBO Dashboard
- GO Rules
- Monarch
- KG frameworks

See
https://docs.google.com/document/d/1ld73pVz_BIH22jRBZuV0RVDeSiuGyQpD1u_F9Yv9gg0/edit#

The core components:

- a schema for representing both metrics themselves and the results of evaluating them
- TODO: code for downloading sources and loading them into a database
- TODO: code for executing these metrics on these sources
- TODO: lightweight view layer to generate an HTML grid summary

## Schema

Browse the schema here: [http://cmungall.github.io/dasher](http://cmungall.github.io/dasher)

See the [schema/](schema/) folder

The source is in YAML (biolinkml)

Currently the main derived artefacts of interest are:

- [JSON Schema](src/schema/semqc.schema.json)
- [Python dataclasses](src/schema/semqc_datamodel.py)

## Download/Preparation

TODO

The idea is to use blazegraph-runner here.

Assume each source is trivially convertable to RDF

## Executing Metrics

TODO

Assumption that a number of generic mechanisms can be used:

- OWL: Reasoning
- Shapes/ShEx
- JSON-Schema validation (requires JSON-LD framing)
- SPARQL (e.g. robot checks)

Additionally, each implementation using this framework is free to add in their own procedural checks, e.g. in Python, although declarative is preferred

The code will generate json/yaml-ld conforming to the schema above

Implementing pipelines e.g Jenkins can choose to do things like fail-fast based on queries of result set

## Display

TODO

Something simple such as mustache / liquid templates over the above json-ld

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/cmungall/dasher

Awesome Lists containing this project

README