Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/furstenheim/jsonb_deep_sum

Sum over jsonb rows
https://github.com/furstenheim/jsonb_deep_sum

Last synced: 5 days ago
JSON representation

Sum over jsonb rows

Awesome Lists containing this project

README

        

## Jsonb deep sum

![Build Status](https://travis-ci.org/furstenheim/jsonb_deep_sum.svg?branch=master)

jsonb_deep_sum is a PostgreSQL extension to easily add jsonb numeric

SELECT jsonb_deep_add('{"a": 1}', '{"a": 2}');
> '{"a": 3}'

SELECT jsonb_deep_add('{"a": {"b": 1}}', '{"a": {"b": 1}}')
> '{"a": {"b": 2}}'

It also provides an aggregation function `jsonb_deep_sum`

## INSTALLATION

In the directory where you downloaded jsonb_deep_sum, run

make && make install

Make install will copy the extension files to the postgres folder, so make sure that you have the necessary permissions.
It might also happen that pgxs is not found. For that you might need to install postgresql-server-dev-all and postgresql-common [[link](https://github.com/travis-ci/travis-ci/issues/2864)].

Once you have successfully compiled the extension log into postgresql and do:

CREATE EXTENSION jsonb_deep_sum;

### Tests

You can run the tests without installing postgres

```
docker run -it --rm --mount "type=bind,src=$(pwd),dst=/repo" pgxn/pgxn-tools sh -c 'cd /repo && pg-start 12 && pg-build-test'

```

## EXAMPLE

CREATE TABLE simple_nested (data jsonb);
INSERT INTO simple_nested VALUES ('{"a": 1}'), ('{"a": 2, "b": 1}'), ('{"a": 5}'), ('{"a": 3, "b": 1}'), (NULL);
SELECT jsonb_deep_sum(data) FROM simple_nested;
> {"a": 11, "b": 2}

## TESTING

To run the tests use:

make install && make installcheck

All the tests are in the sql directory.

## BENCHMARKING

The exact queries can be found in `./example`

| Numbers of rows | Level of nesting | jsonb_deep_sum (ms) | SQL (ms)
| --------------- | ---------------- | --------------- | ------- |
| 2 million | 0 | 3705,791 | 11083,089 |
| 2 million | 1 | 12014,711 | ?

It would be nice to have benchmarks using custom sql aggregate, using PL/V8 and storing the info in plain columns à la hstore.

## INTERNALS

JSONB is internall represented as a tree in which all levels are sorted.
Postgres provides iterators to walk this tree in DFS which respects this sorting. The algorithm uses this order to perform a [sorted merge join](https://en.wikipedia.org/wiki/Sort-merge_join).


## LIMITATIONS

Right now the algorithm only allows numeric types and objects. If nulls, strings, arrays or booleans are present it will raise an error.
In the future I might change this so that nulls is ignored in the sum or that strings are kept. Arrays are not supported because there isn't an obvious way to merge them.