https://github.com/gigaquads/store

A transactional in-memory SQL-like object store for long running processes, games, analytics, realtime processing and other applications.
https://github.com/gigaquads/store

database-management databases object-storage object-store query-engine query-language simulation store stores virtualization

Last synced: 3 months ago
JSON representation

A transactional in-memory SQL-like object store for long running processes, games, analytics, realtime processing and other applications.

Host: GitHub
URL: https://github.com/gigaquads/store
Owner: gigaquads
License: mit
Created: 2021-05-11T14:28:20.000Z (about 5 years ago)
Default Branch: main
Last Pushed: 2021-11-19T19:24:27.000Z (over 4 years ago)
Last Synced: 2025-09-20T23:24:10.569Z (10 months ago)
Topics: database-management, databases, object-storage, object-store, query-engine, query-language, simulation, store, stores, virtualization
Language: Python
Homepage:
Size: 89.8 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Store

This library provides a `Store` datatype for Python. Each store looks and feels

like an ORM, but unlike an ORM, there is no database on the other end. Instead,

all data lives in memory, in the form of plain Python dicts and B-tree indices.

Stores support **SQL-like _select_ statements** in the style of SQLAlchemy,

**atomic transactions** and **multithreading**.

The source code aims to be rebustly documented, as we encourage open-source

collaboration on this Project.

## Use-cases

- Long-running interactive applications, like games.

- Automated trading systems with complex internal state management requirements.

- Stream-processing applications that perform fast _ad hoc_ queries on stream buffers.

## An Example

Imagine a system that generates user input events, like _mouse click_ and _key

press_. In the following example, we delete _click events_ created after a

specified time and capitalize the character asssociated with each _key press_

within a transaction.

```python

from store import Store

events = Store()

# insert fictitious "event" records

events.create_many([

    {'event_type': 'press', 'char': 'x', 'time': 1},

    {'event_type': 'click', 'button': 'L', 'position': (5, 8), 'time': 2},

    {'event_type': 'click', 'button': 'R', 'position': (3, 4), 'time': 3},

    {'event_type': 'press', 'char': 'y', 'time': 4},

    {'event_type': 'press', 'char': 'p', 'time': 5},

])

with events.transaction() as transaction:

    # delete "click" events after specified time

    transaction.select().where(

        events.row.event_type == 'click',

        events.row.time > 2

    ).delete()

    # capitalize the "char" for each selected "press" event

    get_press_events = transaction.select().where(

        x.event_type == 'press',

        x.char.one_of(['x', 'y', 'z'])

    )

    for event in get_press_events(dtype=list):

        event['char'] = event['char'].upper()

```

## State Dicts

Store methods, like `create` and `update`, return _state dicts_. Unlike regular

dicts, any change to the keys or values of a state dict results in an update to

the store. For example, suppose that `user` is a state dict. As such,

`user['name'] ='John'` generates a call to `store.update` under the hood. When

this happens, any existing reference to the same `user` immediately reflect this

change. There is no need to refresh each reference manually (as they are all

actually the same object). The same is true for other methods, like `update`,

`setdefault`, etc.

Let's illustrate with an example:

```python

frank_1 = store.create({'id': 1, 'name': 'frank'})

frank_2 = store.get(1)

# the store manages a singleton reference to frank's StateDict

# in its internal so-called identity set.

assert frank_1 is frank_2

# frank_1 and frank_2 are references to the same object,

# so they should both reflect the same change.

frank_1['name'] = 'Franklin'

assert frank_2['name'] == 'Franklin'

# likewise, any subsequent reference should reflect the same change

frank_3 = store.get(1)

assert frank_3['name'] == 'Franklin'

```

### Stateful Methods

Here is a list of each `dict` method that has been extended to result in an

update to store as a side-effect. On the lefthand side of each arrow is the

`dict` method. On the righthand side is the corresponding `store` call.

- `state.update(mapping)` ➞ `store.update(state, mapping.keys())`

- `state.setdefault(key, default)` ➞ `store.update(state, {key})`

- `state[key] = value` ➞ `store.update(state, {key})`

- `del state[key]` ➞ `store.delete(state, {key})`

### Indexes

By default, all `StateDict` keys are indexed, including those with non-scalar

values -- like lists, sets, dicts, etc. This means that that queries are fast.

## Queries

You can query a store like a SQL database, using _select_, _where_, _order_by_,

_limit_ and _offset_ constraints.

### Symbols

Select statements are written with the help of a class called `Symbol`. A symbol

is a variable used to express what you want to select and how. Suppose you had a

store of _user_ records. Then, using a symbol, You could write a query to

selects all users, created after a certain cut-off date.

```python

user = user_store.symbol()

get_users = user_store.select(

    user.first_name,

    user.email

).where(

    user.created_at > cutoff_date

)

for user in get_users(dtype=list):

    send(message=f'Hello, {user["first_name"]}!', email=user['email'])

```

An alternative to instantiating a new symbol for each query is to use a built-in

property, `store.row`. The following query is identical to the one above:

```python

get_users = user_store.select(

    user_store.row.first_name,

    user_store.row.email

).where(

    user_store.row.created_at > cutoff_date

)

```

### Select

By default, an empty select will select everything, like `select * from...` in

SQL; however, if you're only interested in a subset of fields, you can

explicitly enumerate them.

#### Selecting Everything

```python

query = store.select()

```

#### Selecting Specific Fields

```python

query = store.select(store.row.name, store.row.email)

```

### Where (Filtering)

You can constrain queries to select only records whose values match a given

logical predicate. Predicates can be arbitrarily nested in compound boolean

expressions. This is similar to the "where" clause in SQL select statements.

### Filtering Non-scalars Values

Unlike a SQL database, with a store, you can apply predicate logic not only to

scalar values, like numbers and strings, but also non-scalar types, like dicts,

lists, and sets.

For example, this is possible:

```python

# imagine you have a store with user dicts, and each user dict

# has a nested dog dict with an "age" value.

get_users = store.select().where(store.row.dog <= {'age': 10})

for user in get_users():

    assert user['dog']['age'] <= 10

```

Using a symbol, here are some example:

#### Conditional Expressions

```python

user = store.symbol()

# equality

predicate = (user.email == 'elon.musk@gmail.com')

predicate = (user.email != 'elon.musk@gmail.com')

# inequality

predicate = (user.age >= 50)

# containment

predicate = (user.favorite_color.in(['red', 'blue'])

# logical conjunction (AND)

predicate = (user.scent == 'smelly') & (user.income <= 20000)

# logical disjunction (OR)

predicate = (user.scent == 'smelly') | (user.income <= 20000)

# logical conjunction and disjunction combined

predicate = (

    ((user.scent == 'smelly') | (user.age <= 20)) & (user.name == 'Bob')

)

```

Moreover, predicates can be built up gradually, like so:

```python

predicate = (user.age <= 20)

if some_condition:

    predicate &= (user.income > 100000)   # |= also works

```

Once you have your predicate, you can pass it into a query's `where` method:

```python

query = store.select().where(

    (user.age <= 20) | (user.is_member == True)

)

```

### Order By

Query results can be sorted by one or more values using the `order_by` query

method. For example:

```python

# sort results by age (in ascending order) first

# created_at date (in descending order) second.

query = store.select().order_by(

    user.age.asc,

    user.created_at.desc

)

```

#### Ordering By Non-scalar Values

Unlike SQL, the store can sort non-scalar datatypes, like dicts, lists, and sets

-- in addition to plain ints and strings. This means that you can do things like

-- this:

```python

store.create_many([

    {'owner': 'Mohammed', 'dog': {'age': 10}},

    {'owner': 'Kang Bo', 'dog': {'age': 6}},

])

get_users = store.select().order_by(store.row.dog.asc)

users = get_users(dtype=list)

for u1, u2 in zip(users, users[1:]):

    assert u1.dog['age'] <= u2.dog['age']

```

Note that, when sorting a dict, the dict's items are sorted and compared in the

resulting order.

### Limit & Offset

Queries support pagination via limit and offset parameters. The `limit`

parameter is an `int` that determines the maximum number of records returned by

the query while the `offset` parameter determines the starting index of the

returned slice. When using limit and offset, it is important to specify an order, using

`order_by`.

```python

query = store.select(

    user.email

).order_by(

    user.age.desc

).offset(

    20

).limit(

    10

)

```

## Transactions

Stores support transactions as well. If, for some reason you don't already know,

a database transaction is a mechanism that allows you to perform multiple

operations as if they were all performed int a single step. This way, if one

operation fails, then they all fail, and the state of the store remains intact.

The syntax for creating transactions is straight forward:

```python

with user_store.transaction() as user_trans:

    # update the name of one user and delete another

    users = user_trans.get_many([1, 2])

    users[1]['name'] = 'Updated Name'

    users[2].delete()

```

At the end of the `with` block, the transaction commits; otherwise, if an

exception is raised, the transaction rolls back, clearing its internal state.

Alternate to using the `with` statement, `commit` and `rollback` methods can be

called explicitly.

```python

user_trans = user_store.transaction()

try:

    users = user_trans.get_many([1, 2])

    users[1]['name'] = 'Updated Name'

    users[2].delete()

    user_trans.commit()

except Exception:

    user_trans.rollback()

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gigaquads/store

Awesome Lists containing this project

README