An open API service indexing awesome lists of open source software.

https://github.com/codekow/demo-containerized-dataset

How to make your dataset available and immutable via containers
https://github.com/codekow/demo-containerized-dataset

containers data-science datasets docker

Last synced: about 1 year ago
JSON representation

How to make your dataset available and immutable via containers

Awesome Lists containing this project

README

          

# Containerized Datasets

The purpose of this repo show options around serving and
maintaining datasets used for training machine learning models
with common tools.

We can use the benefits of containers, immutability and versioning, to support [reproducibility and replicability](https://www.ncbi.nlm.nih.gov/books/NBK547546/) which are key
to **science**.

Disclaimer: This is one method among many and may not meet your needs or be the best option. This is **a way**, NOT **the way**.