https://github.com/sudodevstudio/genkitx-supabase

Last synced: 21 days ago
JSON representation
Host: GitHub
URL: https://github.com/sudodevstudio/genkitx-supabase
Owner: SudoDevStudio
License: mit
Created: 2026-04-01T03:06:49.000Z (29 days ago)
Default Branch: main
Last Pushed: 2026-04-07T03:54:57.000Z (23 days ago)
Last Synced: 2026-04-07T05:24:21.282Z (23 days ago)
Language: TypeScript
Size: 99.6 KB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project

README

          # @sudodevstudio/genkitx-supabase

`@sudodevstudio/genkitx-supabase` is a Genkit community plugin that turns a Supabase Postgres + `pgvector` table into a Genkit indexer and retriever pair for RAG workflows.

If you want the short version first, see [`HOW_IT_WORKS.md`](./HOW_IT_WORKS.md).

It lets you keep embeddings, content, and JSONB metadata in Supabase while using the familiar Genkit flow:

- configure a plugin with an `indexName` and embedder

- index documents with `ai.index()`

- retrieve relevant documents with `ai.retrieve()`

- filter retrievals with JSONB metadata

## Why Supabase + Genkit

Supabase gives you managed Postgres, `pgvector`, and JSONB in one place. Genkit gives you a clean retrieval/indexing abstraction inside AI flows. Together, they make it easy to build RAG pipelines that stay close to your application data and use standard Postgres tooling.

## Features

- Genkit plugin UX modeled after existing vector-store integrations

- `supabaseVectorStore(configs)`

- `supabaseIndexerRef(indexName)`

- `supabaseRetrieverRef(indexName)`

- batch embedding and upsert indexing

- delete by id through `ai.index()`

- top-k semantic retrieval through a Supabase RPC function

- JSONB metadata filtering

- configurable schema, table, RPC, and column names

- useful validation and runtime errors

## Install

```bash

npm install @sudodevstudio/genkitx-supabase @supabase/supabase-js genkit

```

If you want to use the Google AI quickstart below:

```bash

npm install @genkit-ai/google-genai

```

## Quickstart

```ts

import { googleAI } from '@genkit-ai/google-genai';

import { Document, genkit } from 'genkit';

import {

  supabaseIndexerRef,

  supabaseRetrieverRef,

  supabaseVectorStore,

} from '@sudodevstudio/genkitx-supabase';

const ai = genkit({

  plugins: [

    googleAI(),

    supabaseVectorStore([

      {

        indexName: 'docs',

        embedder: googleAI.embedder('gemini-embedding-001'),

        connection: {

          url: process.env.SUPABASE_URL!,

          key: process.env.SUPABASE_SERVICE_ROLE_KEY!,

        },

        embeddingDimension: 3072,

      },

    ]),

  ],

});

const docsIndexer = supabaseIndexerRef('docs');

const docsRetriever = supabaseRetrieverRef('docs');

await ai.index({

  indexer: docsIndexer,

  documents: [

    Document.fromText('Supabase stores embeddings in Postgres.', {

      id: 'doc-1',

      topic: 'supabase',

    }),

  ],

});

const docs = await ai.retrieve({

  retriever: docsRetriever,

  query: 'Where are the embeddings stored?',

  options: { k: 3 },

});

```

## SQL Setup

The package expects:

- a pgvector-enabled table for documents

- an RPC function that accepts `query_embedding`, `match_count`, and `filter`

- rows containing your configured `id`, `content`, and `metadata` fields

SQL examples are included in:

- [`sql/001_create_rag_documents.sql`](./sql/001_create_rag_documents.sql)

- [`sql/002_match_rag_documents.sql`](./sql/002_match_rag_documents.sql)

Default schema:

- `id text primary key`

- `content text not null`

- `metadata jsonb not null default '{}'::jsonb`

- `embedding extensions.vector(3072) not null`

- `created_at timestamptz`

- `updated_at timestamptz`

If your embedder uses a different dimension, update both the table definition and the RPC function signature to match.

## Usage

### Configure the plugin

```ts

import { googleAI } from '@genkit-ai/google-genai';

import { genkit } from 'genkit';

import { supabaseVectorStore } from '@sudodevstudio/genkitx-supabase';

const ai = genkit({

  plugins: [

    googleAI(),

    supabaseVectorStore([

      {

        indexName: 'products',

        embedder: googleAI.embedder('gemini-embedding-001'),

        connection: {

          url: process.env.SUPABASE_URL!,

          key: process.env.SUPABASE_SERVICE_ROLE_KEY!,

        },

        schema: 'public',

        table: 'rag_documents',

        queryRpcName: 'match_rag_documents',

        idColumn: 'id',

        contentColumn: 'content',

        metadataColumn: 'metadata',

        embeddingColumn: 'embedding',

        defaultK: 5,

        embeddingDimension: 3072,

      },

    ]),

  ],

});

```

### Index documents

Document ids are read from `document.metadata.id`. If a document already exists with the same id, it is updated.

```ts

import { Document } from 'genkit';

import { supabaseIndexerRef } from '@sudodevstudio/genkitx-supabase';

const productsIndexer = supabaseIndexerRef('products');

await ai.index({

  indexer: productsIndexer,

  documents: [

    Document.fromText('The red backpack fits a 16-inch laptop.', {

      id: 'sku-red-backpack',

      category: 'bags',

      inventoryStatus: 'in_stock',

    }),

    Document.fromText('The trail bottle keeps drinks cold for 18 hours.', {

      id: 'sku-trail-bottle',

      category: 'drinkware',

      inventoryStatus: 'in_stock',

    }),

  ],

});

```

### Delete documents by id

Use the same `ai.index()` call with delete options:

```ts

await ai.index({

  indexer: productsIndexer,

  documents: [],

  options: {

    operation: 'delete',

    ids: ['sku-trail-bottle'],

  },

});

```

You can also omit `options.ids` and pass documents that contain `metadata.id`.

### Retrieve documents

```ts

import { supabaseRetrieverRef } from '@sudodevstudio/genkitx-supabase';

const productsRetriever = supabaseRetrieverRef('products');

const docs = await ai.retrieve({

  retriever: productsRetriever,

  query: 'Which bag fits a laptop?',

  options: { k: 3 },

});

```

### Metadata filter example

Metadata filters are passed to the RPC function as JSONB and work well with `metadata @> filter`.

```ts

const docs = await ai.retrieve({

  retriever: productsRetriever,

  query: 'Show me in-stock bags',

  options: {

    k: 5,

    filter: {

      category: 'bags',

      inventoryStatus: 'in_stock',

    },

  },

});

```

## Public API

### `supabaseVectorStore(configs)`

Registers one or more Supabase-backed vector stores by `indexName`.

Config fields:

- `indexName`

- `embedder`

- `connection: { url, key }`

- `table?`

- `queryRpcName?`

- `idColumn?`

- `contentColumn?`

- `metadataColumn?`

- `embeddingColumn?`

- `schema?`

- `defaultK?`

- `embeddingDimension?`

- `embedderOptions?`

Defaults:

- `schema: 'public'`

- `table: 'rag_documents'`

- `queryRpcName: 'match_rag_documents'`

- `idColumn: 'id'`

- `contentColumn: 'content'`

- `metadataColumn: 'metadata'`

- `embeddingColumn: 'embedding'`

- `defaultK: 3`

### `supabaseIndexerRef(indexName)`

Returns the Genkit indexer reference for `ai.index()`.

Supported indexer options:

- `operation?: 'upsert' | 'delete'`

- `ids?: Array`

- `batchSize?: number`

### `supabaseRetrieverRef(indexName)`

Returns the Genkit retriever reference for `ai.retrieve()`.

Supported retriever options:

- `k?: number`

- `filter?: Record`

## Production Notes

- Use a service role key on trusted servers only. Do not expose it in browser bundles.

- Keep retrieval and indexing on the server side, especially when your table or RPC requires elevated database permissions.

- Make sure the RPC function lives in an exposed schema and that Row Level Security policies allow the operations you need.

- Keep your `embeddingDimension` aligned with the actual vector column dimension to catch mistakes before they hit Postgres.

## Repository Contents

- source: [`src/index.ts`](./src/index.ts)

- SQL: [`sql/001_create_rag_documents.sql`](./sql/001_create_rag_documents.sql), [`sql/002_match_rag_documents.sql`](./sql/002_match_rag_documents.sql)

- tests: [`test/config.test.ts`](./test/config.test.ts), [`test/mapping.test.ts`](./test/mapping.test.ts), [`test/filter.test.ts`](./test/filter.test.ts), [`test/store.test.ts`](./test/store.test.ts)

## License

See [`LICENSE`](./LICENSE) for the full terms.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sudodevstudio/genkitx-supabase

Awesome Lists containing this project

README