Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/SiggiGue/hdfdict

Helps h5py to dump and load dictionaries.
https://github.com/SiggiGue/hdfdict

Last synced: 3 months ago
JSON representation

Helps h5py to dump and load dictionaries.

Host: GitHub
URL: https://github.com/SiggiGue/hdfdict
Owner: SiggiGue
License: mit
Created: 2015-02-27T12:25:29.000Z (almost 10 years ago)
Default Branch: master
Last Pushed: 2024-05-24T08:20:50.000Z (9 months ago)
Last Synced: 2024-09-21T23:51:16.320Z (5 months ago)
Language: Python
Size: 32.2 KB
Stars: 22
Watchers: 3
Forks: 9
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # hdfdict helps h5py to dump and load python dictionaries

[![CodeFactor](https://www.codefactor.io/repository/github/siggigue/hdfdict/badge)](https://www.codefactor.io/repository/github/siggigue/hdfdict)

[![Coverage Status](https://coveralls.io/repos/github/SiggiGue/hdfdict/badge.svg?branch=master)](https://coveralls.io/github/SiggiGue/hdfdict?branch=master)

If you have a hierarchical data structure of e.g. numpy arrays in a dictionary for example, you can use this tool to save this dictionary into a h5py `File()` or `Group()` and load it again.

This tool just maps the hdf `Groups` to dict `keys` and the `Datset` to dict `values`.

Only types supported by h5py can be used.

The dictionary-keys need to be strings.

A lazy loading option is activated per default. So big h5 files are not loaded at once. Instead, a dataset gets only loaded if it is accessed from the LazyHdfDict instance.

## Example

```python

import hdfdict

import numpy as np

d = {

    'a': np.random.randn(10),

    'b': [1, 2, 3],

    'c': 'Hallo',

    'd': np.array(['a', 'b']).astype('S'),

    'e': True,

    'f': (True, False),

}

fname = 'test_hdfdict.h5'

hdfdict.dump(d, fname)

res = hdfdict.load(fname)

print(res)

```

Output:

`

{'a': , 'b': , 'c': , 'd': , 'e': , 'f': }

`

This are all lazy loding fields in the result `res`.

Just call `res.unlazy()` or `dict(res)` to get all fields loaded.

If you only want to load specific fields, just use item access e.g. `res['a']` so only field 'a' will be loaded from the file.

```python

print(dict(res))

```

Output:

`

{'a': array([ 1.20991242,  0.74938763, -0.02199212, -0.08664085, -0.11950787,

       -0.12527781, -1.26821192, -1.20105904, -0.37933725, -0.16289392]), 'b': [1, 2, 3], 'c': 'Hallo', 'd': array([b'a', b'b'], dtype='|S1'), 'e': True, 'f': (True, False)}

`

## Installation

+ `pip install git+https://github.com/SiggiGue/hdfdict.git`