Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/syoyo/eson

ESON, Exa-scale Storage Object Notation
https://github.com/syoyo/eson

Last synced: about 2 months ago
JSON representation

ESON, Exa-scale Storage Object Notation

Awesome Lists containing this project

README

        

# ESON, Exa-scale Storage Object Notation

ESON is simple but powerful schema-less binary data format designed to handle Exa-scale data. Example includes graphics(e.g. geometry, volume and textures) and may also applicable for in-memory database and scientific data.

ESON is also designed to handle large-scale data efficiently for comming NVM(non-volatile memory) or SCM(storage-class memory) era.

C++ and python API is primarily provided.

# Version

* 0.3.2 (Nov 2015) C++ API is now header only!
* 0.3.1 (Aug 2015) Initial python2 and python3 binding(no native module compilation required)
* 0.3.0 (Mar 2015) Initial support of ARRAY in C/C++ API
* 0.2.0 (Jan 2015)
* 0.1.0 (Jul 2013)

# Status

Very unstable. Spec and API will change in the future.

## Quick tutorial

$ make
$ ./eson_test

## Specification

See [SPECIFICATION.md](SPECIFICATION.md)

## Design and implementation references

ESON design is strongly affected by

* BSON http://bsonspec.org/

ESON C++ API is strongly affected by

* picojson https://github.com/kazuho/picojson

## Example in C++

```
// Define this only in *one* .cc
#define ESON_IMPLEMENTATION
#include "eson.h"

#include
#include
#include

static void
ESONTest()
{
eson::Value v;
double dbl = 1.234;
eson::Value vd(dbl);

double dbl2 = 3.4;
eson::Value vd2(dbl2);

int64_t i = 144;
eson::Value ival(i);

std::string name("jojo");
eson::Value sval(name);

char bindata[12];
for (int i = 0; i < 12; i++) {
bindata[i] = i;
}
eson::Value bval((const uint8_t*)bindata, 12);

eson::Object o;
o["abora"] = vd;
o["muda"] = vd2;
o["dora"] = ival;
o["name"] = sval;
o["bin"] = bval;

v = eson::Value(o);

// First calcuate required size for serialized data.
int64_t sz = v.Size();

uint8_t* buf = new uint8_t[sz]; // or use mmap() if sz is large.
uint8_t* ptr = &buf[0];

ptr = v.Serialize(ptr);
assert((ptr-&buf[0]) == sz);

FILE* fp = fopen("output.eson", "wb");
fwrite(buf, 1, sz, fp);
fclose(fp);

eson::Value ret;
std::string err = eson::Parse(ret, buf);
if (!err.empty()) {
std::cout << "err:" << err << std::endl;
}

eson::Value dval = ret.Get("muda");
printf("muda = %f\n", dval.Get());

eson::Binary bin = ret.Get("bin").Get();
printf("bin len = %d\n", bin.size);
for (int i = 0; i < bin.size; i++) {
printf(" bin[%d] = %d\n", i, bin.ptr[i]);
}

delete buf;
}
```

## Example in JavaScript(node.js)

```
var eson = require('eson-binary');
var fs = require('fs');

if (process.argv.length < 3) {
console.log("needs input.eson");
process.exit(-1);
}

var buf = fs.readFileSync(process.argv[2])
var b = eson.parse(buf);

console.log(b)
```

## Note

### How to handle int16, int32, fp16, float32 data?

Please use BINARY data type.

## TODO

* [ ] Support Endianness.
* [ ] Efficiently serialize key table for better search performance.
* [ ] Make API Zero-Copy to reduce memory.
* [ ] Add serialize API in JavaScript API.
* [ ] Support 2GB+ size in JavaScript API.

## Compression

Currently we are planning to use zstd or LZ4 compression for lossless binary data.
Lossy compression for floating point data is interesting direction to explore.
There are zfp an fpzip for lossy floating point compression.

https://computation.llnl.gov/projects/floating-point-compression

## Author(s)

* Syoyo Fujita([email protected])
* Yasutoshi Mori(https://github.com/mirageym) Python binding.

## License

ESON C++ library and JavaScript library is licensed under MIT license.
ESON python binding is licensed under 3-clause BSD.

### Third-party licenses

* BSON Python binding. 3-clause BSD.
* lz4 is licensed under 2-clause BSD license.