https://github.com/hydreio/disk

Store and query your GraphQL types in Redis (and redisearch)
https://github.com/hydreio/disk

graphql redis redisearch

Last synced: 3 months ago
JSON representation

Store and query your GraphQL types in Redis (and redisearch)

Host: GitHub
URL: https://github.com/hydreio/disk
Owner: HydreIO
License: unlicense
Created: 2020-06-16T17:53:16.000Z (almost 6 years ago)
Default Branch: edge
Last Pushed: 2023-01-06T09:06:05.000Z (over 3 years ago)
Last Synced: 2024-08-11T10:04:37.410Z (almost 2 years ago)
Topics: graphql, redis, redisearch
Language: JavaScript
Homepage:
Size: 171 KB
Stars: 4
Watchers: 3
Forks: 0
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          
@hydre/disk



  

  

  

    

  

  

  



Store and query your GraphQL types in Redis




  



- [Requirements](#requirements)

- [Install](#install)

- [Usage](#usage)

  - [Schema](#schema)

  - [With Node](#with-node)

    - [CREATE](#create)

    - [DELETE](#delete)

    - [KEYS](#keys)

    - [GET](#get)

    - [SET](#set)

## Requirements

Hydre/disk leverage Graphql AST to provide an ORM kind tool

which map your schema to Redisearch indexes and allows you

to easily query your datas.

- Redis

- Redisearch module enabled

- Node > 14.4

- Advanced GraphQL knowledge

## Install

```sh

npm install @hydre/disk

```

## Usage

`hydre-disk` cli synchronize your GraphQL schema to Redisearch indexes

and reindex all hashes uppon changes. Use it manually or through your CI

`@hydre/disk` module provide you a query interface for usage in node

### Schema

> It is advised to use this tool in an advanced environement where

> you use schema generation and manage the AST yourself which would

> allow you to remove any trace of configuration mapping.

```sh

hydre-disk --file ./schema.gql

```

| flag | default value | description |

| --- | ---|---|

| `--file` | Error (mandatory) | direct path to a graphql schema file which contain your types |

| `--redis` | `'redis://localhost:6379'` | redis url |

| `--sync` |  | if an index exist it will be dropped and recreated to allow reindexing

Synchronization steps (current version)

1. The provided schema is parsed into a graphql ast

2. The graphql ast is parsed into a hydre/disk ast

3. Each graphql `type` definition represent a redisearch index

4. `FT.INFO` is used to check for index existence

5. If the index exist we skip it (unless --sync is passed) >> _go to 3_

   - when `--sync` is passed, any found index will be deleted `FT.DROP idx KEEPDOCS`

6. If the index doesn't exist, the ast is serialized into a `FT.CREATE` command

   and executed for the `type` (see examples below)

7. After creating the index, the hashes are reindexed serialy with `FT.ADDHASH`

   with a `SCAN  MATCH `

   - `namespace` is the graphql `type` name, which is also the index name

> WARN: Reindexed hashes will loose their document `WEIGHTS`, the scope of this tool

> has no interest in using this redisearch feature so if you need documents

> weight, you'll need to reindex them manually

hydre/disk is a 1-1 mapping with [available schema options](https://oss.redislabs.com/redisearch/Commands/#ftcreate)

```gql

type {index}

  @MAXTEXTFIELDS

  @TEMPORARY(seconds: Int!)

  @NOOFFSETS

  @NOHL

  @NOFIELDS

  @NOFREQS

  @STOPWORDS(words: [String!]!)

    {

      {field}: ... @TEXT @NOSTEM @WEIGHT(weight: Int! | Float!) @PHONETIC(matcher: String!)

      {field}: ... @NUMERIC

      {field}: ... @GEO

      {field}: ... @TAG @SEPARATOR(sep: String!)

      {field}: ... @{type} @SORTABLE @NOINDEX

}

```

The tool detect fields through their **primary positionned** directives.

If none of them is detected the `type` will simply be ignored.

```js

const FIELD_TYPES = new Set(['TEXT', 'NUMERIC', 'GEO', 'TAG'])

```

Redisearch is an interface on top of redis hashes which are independants,

that mean you can use a real graphql schema and specify redisearch directives

only on field which needs to be queried by secondary indexes (more on the

example below)

About shipping the schema it is cleaner if you can `visit` the ast

to remove directives before production to keep them internally,

the second best option is to extract

the types from your API schema and keep a graphql file dedicated only to redis,

the third option is to declare all directives at the top of your schema to

make them valid.

> I myself only write types and generate multiple things from it, including

> the production schema which contains no directives at all and is used remotly.

**Real usecase :**

```gql

type User @MAXTEXTFIELDS {

  uuid    : ID!

  name    : String! @TEXT @NOSTEM

  address : String!

  posts   : [Post!]!

  cities  : [String!]! @TAG @SEPARATOR(sep: ",")

}

type Post @STOPWORDS(words: ["i", "know", "right"]) {

  uuid   : ID!

  date   : Int! @NUMERIC

  text   : String @TEXT @WEIGHT(weight: 12.3)

}

```

Will yield

```sh

FT.CREATE User MAXTEXTFIELDS SCHEMA name TEXT NOSTEM cities TAG SEPARATOR ,

FT.CREATE Post STOPWORDS "i" "know" "right" SCHEMA date NUMERIC text TEXT WEIGHT 12.3

```

> I assume you're an adult and know what you are doing, no validation is done.

> That mean if you specify an invalid set of directives or wrong types

> you'll simply get a raw redis error.

### With Node

The master client will be used for write operations,

and the slave client will be used to read.

```js

import Redis  from 'ioredis'

import Mount  from '@hydre/disk'

import events from 'events'

const master = new Redis()

const Disk   = Mount({

  master_client : master,

  slave_client  : master,

  events_enabled: true,

  events_name   : '__disk__',

})

await events.once(master, 'ready')

```

If events are enabled Disk will publish to redis every CREATE, SET, and DELETE

operations in a similar way as keyevent-notifications

`PUBLISH "::" `

ex: `PUBLISH __disk__:CREATE:User User:xxxx-xxxx-xxxx-xxxx`

which you can listen all through `PSUBSCRIBE __disk__:*:*`

here is a example of connexion retry backoff client

```js

const client = new Redis({

  host         : 'redis://localhost:6379',

  retryStrategy: attempt => {

    if (attempt > 10)

      return new Error(`Can't connect to redis after ${ attempt } tries..`)

    return 250 * 2 ** attempt

  },

})

```

When using the disk you must follow this pattern

```js

Disk[][](query)

```

You may often only need to retrieve documents by ids, to allow

for better performance Disk is only using redisearch if you include

the `search` option. When it is not specified it only use regular

`HSET` and `HMGET` commands.

- **operation** is one of

  - `CREATE`

  - `DELETE`

  - `KEYS`

  - `GET`

  - `SET`

- **type** is the namespace, which is the graphql type name.

  It is used for the collection name (index) and the node namespace.

  A `type User` would yield an `User` redisearch index and `User:`

  documents. Basically `FT.ADD User User:xxxx 1 FIELDS ...`

- **query** is an option object containing the following

  ```js

  {

    // array of uuids to use instead of the whole database (INKEYS)

    // if not specified it will use the whole index

    keys     : [''],

    // array of field names to return

    // if not specified it will return all fields

    fields   : [''],

    limit    : Infinity, // limit results

    offset   : 0, // paginate

    search   : '', // redisearch query string (https://oss.redislabs.com/redisearch/Query_Syntax/)

    // object to use for CREATE and SET operations

    // it use `REPLACE PARTIAL` for a SET operation

    document: {}

  }

  ```

#### CREATE

> Creating a document execute a regular HSET and then try to index

> it in redisearch, because we may not want an index for every types.

Create will generate an uuid v4 and use it with the type as a namespace

```js

const uuid = await Disk.CREATE.User({ document: pepeg })

// User:xxxx-xxxx-xxx..

```

#### DELETE

Delete matching documents from redis (DD)

```js

const result = await Disk.DELETE.User({ search: '*' })

// result = raw result with an array of 'OK'

```

#### KEYS

Return an array of matching keys (NOCONTENT),

can be used to count element or check for existence

```js

const [key1, key2, ...keys] = await Disk.KEYS.User({ search: '*' })

```

#### GET

Return an array of javascript object

> note that redis only returns strings, Disk will convert back

> so any `'1'` will be converted to a Number, `'true'` to a boolean.

> Big numbers will be converted to native BigInts

```js

const Paul = await Disk.GET.User({

    fields: ['name', 'age'],

    search: '@name:Paul',

    limit : 1

  })

```

#### SET

> Setting a field to `undefined` remove it (HDEL)

Merge a document into all matching results

```js

const result = await Disk.SET.User({

  search  : '*',

  document: { group : 'communism' }

})

// result = raw redis array result (FT.ADD)

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hydreio/disk

Awesome Lists containing this project

README

@hydre/disk

Store and query your GraphQL types in Redis