An open API service indexing awesome lists of open source software.

https://github.com/zillow/intake-nested-yaml-catalog

Supports a single YAML file hierarchical catalog to organize datasets and avoid a data swamp.
https://github.com/zillow/intake-nested-yaml-catalog

data-access data-catalog intake python

Last synced: 9 days ago
JSON representation

Supports a single YAML file hierarchical catalog to organize datasets and avoid a data swamp.

Awesome Lists containing this project

README

        

.. image:: https://travis-ci.org/zillow/intake-nested-yaml-catalog.svg?branch=master
:target: https://travis-ci.org/zillow/intake-nested-yaml-catalog

.. image:: https://coveralls.io/repos/github/zillow/intake-nested-yaml-catalog/badge.svg?branch=master
:target: https://coveralls.io/github/zillow/intake-nested-yaml-catalog?branch=master

.. image:: https://readthedocs.org/projects/intake-nested-yaml-catalog/badge/?version=latest
:target: https://intake-nested-yaml-catalog.readthedocs.io/en/latest/?badge=latest
:alt: Documentation Status

Welcome to Intake plugin for nested YAML catalogs
==================================================

This is an `Intake `_ plugin supporting a
single YAML hierarchical catalog to organize datasets and avoid a data swamp.

Example of organizing the datasets by business domain entities:

.. code-block:: yaml

metadata:
hierarchical_catalog: true
entity:
customer:
customer_attributes:
args:
urlpath: s3://foo
driver: parquet
user:
user_profile:
args:
urlpath: s3://foo
driver: parquet

Can be accessed as:

.. code-block:: python

df = catalog.entity.customer.customer_attributes.read()