Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-fair

A curated list of awesome stuff around the FAIR principles for (scientific) data, i.e that data is findable, accessable, interoperable and re-usable.
https://github.com/Materials-Data-Science-and-Informatics/awesome-fair

Last synced: 3 days ago
JSON representation

  • Resources about the FAIR principles

  • FAIR assessment

      • F-uji - An (online) tool which can provide a FAIR score for a given PID based on a metric created by FAIRsFAIR, [code](https://github.com/pangaea-data-publisher/fuji).
      • FAIR Evaluation Services - A FAIR assessment tool from FAIRsharing, [code](https://github.com/FAIRMetrics/Metrics).
  • Organizations and Communities

      • EuDat - Collaborative European data infrastructure.
      • FAIRsharing - A curated resource on data and metadata standards, inter-related to databases and data policies.
      • Research Data Alliance - International organization and communication platform for establishing standards and recommendations concerning research data publication.
      • The Turing Way - Handbook and community for reproducible, ethical and collaborative data science.
      • The Turing Way - Handbook and community for reproducible, ethical and collaborative data science.
  • Metadata formats and standards

      • DataCite - Metadata schema developed by international community with increasing adoption by repositories
      • Data Catalog (DCAT) - RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web.
      • Dublin Core Metadata Initiative Terms - Dublin Core Metadata Element Set, is a set of fifteen "core" elements for describing resources.
      • JSON LD Playground - Convert JSON-LD data between various representations.
      • JSON Schema - Standard for the description of structural constraints in order to do validation of JSON objects.
      • Resource Description Framework (RDF) - RDF is a standard model for data interchange on the Web.
      • Schema.org - Well-established and industry-accepted vocabulary providing semantics for common entities like Person, Organization, Dataset, etc.
      • SKOS - The Simple Knowledge Organization System (SKOS) is a common data model for sharing and linking knowledge organization systems via the Semantic Web.
      • Provenance Primer (PROV) - This primer document provides an accessible introduction to the PROV data model for provenance interchange on the Web.
  • Ontology services

      • Ontobee - A linked ontology data server to support ontology term dereferencing, linkage, query and integration.
      • publication
      • Ontology Lookup Service - OLS is a repository for biomedical ontologies that aims to provide a single point of access to the latest ontology versions.
  • Finding datasets and software

      • Datacite commons - Search through the metadata indexed by Datacite.
      • EuDat B2find - Search through metadata of datasets accumulated by EuDat.
      • OpenAIRE explorer - Search through the metadata indexed by openaire.
      • Schole explorer - A data literature interlinking service (former scholix), indexes links between data and journal publications. It also provides interfaces and APIs to query the graph.
      • Research Software Repository - Aggregates research software from various sources with information about the problem it solves and its scientific domain.
  • Software and software publications

      • CITATION.CFF - Plain text files with human- and machine-readable citation information for software (and datasets). Supported by GitHub, Zenodo, Zotero.
      • Citable code with Zenodo & GitHub - Make GitHub repositories citable with Zenodo DOI.
      • CodeMeta - CodeMeta works on providing a minimal metadata schema for science software and code, in JSON and XML to create a concept vocabulary that can be used to standardize the exchange of software metadata across repositories and organizations.
      • HERMES - A CI based workflow to create and publish software publications to well known repositories.
  • Provenance tracking

      • AiiDA - Automated Interactive Infrastructure and Database for Computational Science (AiiDA) to automatically track provenance of simulation workflows and all associated data, [code](https://github.com/aiidateam/aiida-core).
      • DataLad - A free and open-source distributed data management system for everyone. It is based on git-annex with manual to automatic provenance tracking, [code](https://github.com/datalad/datalad).
      • MLflow - Tool to track the provenance of machine learning applications, [code](https://github.com/mlflow/mlflow/).
      • CWL - Domain-agnostic and community-driven open standard for description and execution of research workflows that supports provenance tracking ([CWLProv](https://github.com/common-workflow-language/cwltool/blob/main/CWLProv.rst)) in a standard-compliant way using the existing RO Crate, PROV and BagIt standards.
  • Your own repository setup

      • Dataverse - Open source research data repository software [code](https://github.com/IQSS/dataverse).
      • EuDat B2share - A repository by EuDat, but the software is open sourc, bases in invenio and one can setup own instances of it, [code](https://github.com/EUDAT-B2SHARE/b2share).
      • Invenio - Open source customizable software to setup large scale digital repositories, library systems and data repositories, [code](https://github.com/inveniosoftware/invenio).
      • InvenioRDM - The turn-key research data management repository based on Invenio framework and Zenodo.
  • Awesome meta data sources

      • Microsoft academy graph - All the data and links from Mircosoft academy (shutdown end of 2021).
      • Openaire graph - All metadata contained in the openaire graph.
      • Scholix - A schema for scholarly links. Implemented and deployed by several scholarly link providers.
      • CrossRef - Organization building connections between related entities, building a queryable graph.