Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/KadekM/orientdb-scala-stream

OrientDB scala reactive-streams (akka streams) library for non-blocking and live queries.
https://github.com/KadekM/orientdb-scala-stream

akka orientdb reactive-streams scala

Last synced: about 2 months ago
JSON representation

OrientDB scala reactive-streams (akka streams) library for non-blocking and live queries.

Host: GitHub
URL: https://github.com/KadekM/orientdb-scala-stream
Owner: KadekM
License: apache-2.0
Created: 2015-06-30T01:36:56.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2016-02-15T01:14:41.000Z (over 8 years ago)
Last Synced: 2024-07-01T17:10:19.211Z (3 months ago)
Topics: akka, orientdb, reactive-streams, scala
Language: Scala
Homepage:
Size: 166 KB
Stars: 35
Watchers: 3
Forks: 4
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        _This library serves as Proof of Concept! It is not battle (production) tested._

_If you are missing functionality, or something doesn't work, please either raise issue or make a PR. [See info](#info) at bottom._

## OrientDB Scala Stream

Library that allows you to execute `non blocking queries` and `live queries` on database, and treat results as reactive stream.

__If your OrientDB version < 2.2 you need to enable capabilities it in server config! Check OrientDB docs__

```

	

        	

        

```

Supported

- [Live queries](#live-queries) (experimental - [OrientDB documentation](http://orientdb.com/docs/last/Live-Query.html#whats-next))

- [Nonblocking queries](#non-blocking-queries)

- [Named query execution](#non-blocking-queries)

- [Parametrized query execution](#non-blocking-queries)

If you're using SBT, add following to build.sbt:

`libraryDependencies += "com.marekkadek" %% "orientdb-scala-stream" % "0.5.4"`

## Live queries

[(OrientDB documentation)](http://orientdb.com/docs/last/Live-Query.html)

Example:

```scala

import orientdb.streams._

implicit val db: ODatabaseDocumentTx = ???

implicit val loader = OrientLoaderDeserializing()

val query = LiveQuery(bufferSize = 1000, OverflowStrategy.DropHead)

                     ("LIVE SELECT FROM Person")

Source.fromPublisher(query.execute())

    .collect{ case Created(data) => data }

    .takeWhile(_.field("name").toString().contains("Pet"))

    .runForeach(println)

```

Once you execute live query you get notifications for following events as they are streamed from OrientDB:

- Loaded

- Updated

- Deleted

- Created

The listener is automatically unsubscribed from OrientDB once the subscription is cancelled (executing command `live unsubscribe TOKEN_VALUE`)

Live queries have internal buffer (which is being filled by DB, for example, when there is no demand downstreams, but events keep happening on DB) which you configure during creation of query. [Read here on overflow strategies and how to handle if you don't need/want any internal buffer](#overflow-strategies).

## Non blocking queries

[(OrientDB documentation)](http://orientdb.com/docs/last/Document-Database.html#non-blocking-query-since-v21)

There are two types of queries, both having some advantages and disadvantages:

- Backpressuring (DB doesn't fetch more data than is demanded downstream)

- Buffering (DB fetches data and fills buffer, even without demand, but sends it downstream accordingly)

They have same interface, so you can exchange one for another, depending on your need.

```scala

import orientdb.streams._

implicit val db: ODatabaseDocumentTx = ???

implicit val loader = OrientLoaderDeserializing()

val query = NonBlockingQueryBackpressuring[ODocument]("SELECT * FROM Person")

val src = Source.fromPublisher(query.execute())

          .runForeach(println)

```

This will backpressure the databse - if there is no demand from downstream, database won't perform the fetch - thus non blocking queries have no need for internal buffer.Cancelling subscription will stop database from fetching next rows.

```scala

import orientdb.streams._

implicit val db: ODatabaseDocumentTx = ???

implicit val loader = OrientLoaderDeserializing()

val query = NonBlockingQueryBuffering[ODocument]

        (bufferSize = 1000, OverflowStrategy.DropHead)

        ("SELECT * FROM Person WHERE name = :lookingFor")

        

val src = Source.fromPublisher(query.executeNamed("lookingFor" -> "Peter"))

          .map(myMethod)

          .filter(myFilter)

          .runFold(...) 

```

This will start the query on database, and results will be aggregated as database provides them. They will be pushed downstream accordingly to reactive-streams specification (based on demand...). Cancelling subscription will not stop DB from finishing query, but elements will no longer be buffered.

Buffering queries have internal buffer which you configure during creation of query. [More here](#overflow-strategies).

## Overflow strategies

Overflow strategies are almost identical to akka-streams strategies. Overflow happens when buffer is full and you receive a message. Overflow strategy decides what will happen next. You can understand the buffer as FIFO collection. Here are illustrations of supported strategies (on left is the oldest element, thus element which is supposed to be emited on next demand). Suppose buffer size is 6:

| Strategy   | Description              | Buffer visualisation                                |

| :--------- |:------------------------ | :-------------------------------------------------- |

| DropHead   | drops oldest in buffer   | `x ~> [b u f f e r]` becomes `[u f f e r x]`        |

| DropTail   | drops youngest in buffer | `x ~> [b u f f e r]` becomes `[b u f f e x]`        |

| DropBuffer | drops current buffer     | `x ~> [b u f f e r]` becomes `[x]`                  | 

| DropNew    | drops incoming element   | `x ~> [b u f f e r]` becomes `[b u f f e r]`        |

| Fail       | emits error to stream    | `x ~> [b u f f e r]` emits `BufferOverflowException`|

You may decide you don't want any internal buffer - for example in LiveQueries, when you are interested in changes that happen only AFTER there is active demand from downstream

```scala

val query = LiveQuery(bufferSize = 0, OverflowStrategy.DropNew)

                     ("LIVE SELECT FROM Person")

```

Since it's curried you can easily define your own LiveQuery without buffer, such as `LiveQueryInstant`. Currently it is not provided as standard (as I tried to keep as few interfaces as possible).

```scala

val query = LiveQueryInstant("LIVE SELECT FROM Person")

```

## Loader

To execute the queries you need following implicit (beside the regular ones):

```scala

implicit val loader = OrientLoaderDeserializing()

```

Loader specified what to do with your entity before it is sent to the source of the stream (and thus possibly emitted downstreams). **It runs on thread which has database setup as ThreadLocal, so you can call methods which require it**. In stream operations, there is no such guarantee.

You can implement your `OrientLoader`.

**The way this works may change in future.**

## FAQ

Why does `Source.fromPublisher(query.execute()).runForeach(println)` produce inconstitent strings ? Sometimes with Person prefix, sometimes without ?

* Depends on which thread gets to run the execution. It can actually be the thread that has database set in ThreadLocals, which then makes `ODocument.toString()` to fetch also schema. Please read _implicits loader part_.

## Info

This library is yet in very early stage. There are lots of TODOs, and often you may discover better way to implement something - feel free to raise issue or submit PR, I will very much welcome it.

### Current TODOs

- finish some tests and move them from playground to real tests

- clean up tests, better setup and teardown logic, remote/local traits

- incode TODOs