Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/oisee/zvdb

ABAP Vector Search Library
https://github.com/oisee/zvdb

abap vectordb

Last synced: 4 months ago
JSON representation

ABAP Vector Search Library

Host: GitHub
URL: https://github.com/oisee/zvdb
Owner: oisee
License: mit
Created: 2023-11-19T21:22:14.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-04-28T11:25:16.000Z (9 months ago)
Last Synced: 2024-09-27T13:20:14.483Z (4 months ago)
Topics: abap, vectordb
Language: ABAP
Homepage:
Size: 2.1 MB
Stars: 26
Watchers: 3
Forks: 4
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # ABAP Vector Search Library

The ABAP Vector Search Library is a Vector Database developed entirely in ABAP, designed to offer an independent solution without relying on external vector databases. It's particularly tailored for the "chat with your documents" scenario, enabling the slicing of documents into manageable chunks and the injection of these semantically relevant segments into user prompts. This approach facilitates an interactive and efficient method for handling and responding to user queries within document-based systems.

The library employs the RAG (Retrieval Augmented Generation) technique, which is integral to its functionality, enhancing the interaction between users and document data.

RAG is the technique that is powering the “chat with your documents” scenario:

 - Slice documents into chunks

 - Inject semantically relevant chunks into the Prompt:

```code

Considering only information below:

###

{chunk01}

{chunk02}

{chunk03}

###

please provide response to the following request: 

{query from user}

```

## How it works? Why it works? O_O

Explained here:

  - as LinkedIn article: [Vector DB in pure ABAP (any DB)](https://www.linkedin.com/pulse/vector-db-pure-abap-any-alice-vinogradova-znjbe):

  - a draft of the above article:[implementation](implementation.md).

## Stats

Some stats:

Optimization: initially it was ~3000μs for one floating-point unquantized comparison, using 1b quantization and bitwise operation it was optimized down to ~20μs, so we easily can do dot product (i.e.: do brute force search): for ~50000 vectors per second.

For RAG with the context window of 32k it is more than enough ^_^

### Memory footprint

Floating point variable (f) takes 8 bytes in memory, one floating-point vector: 1536 * 8 = **12288 bytes** (~12kb).

Quantized vector takes **192 bytes**.

1GB gives us 87 thousands fp-vectors or 5.5 millions quantized vectors.

Memory footprint is 64 times less. Nice! =) 

## Sample Usage

### Query DB:

show code...

```ABAP

    DATA(lo_lib) = zcl_vdb_002_lib=>new( ). "instantiate library

    DATA(ls_v) = lo_lib->read_vector( id ). "read vector from DB by its GUID

    DATA(lt_q) = lo_lib->query( ls_v-q1b ). "query DB with the quantized 1536-dimensional vector

    cl_demo_output=>display( lt_q ). "observe ranked result

```

### Answer with RAG:

show code...

```abap

    DATA(lo_e) = zcl_vdb_002_embedding_full=>new( ). "using https://github.com/microsoft/aisdkforsapabap

    DATA(lv_text) = 'Answer to the Ultimate Question of Life, the Universe, and Everything'.

* Embed and save text, retrieve ID

    DATA(lv_id_q) = lo_e->embed_and_save( lv_text )-id.

    COMMIT WORK AND WAIT.

* Generate an answer as vector

    DATA(lx_a) = lo_e->answer( lv_text ).

*--------------------------------------------------------------------*

    DATA(lo_lib) = zcl_vdb_002_lib=>new( ).

    DATA(lt_q) = lo_lib->query_by_id( lv_id_q ). "get chunks semantically close to the question

* Query for answers

    DATA(lt_a) = lo_lib->query( lx_a ).          "get chunks semantically close to the "modelled answer"

* Combine, sort, and remove duplicates from query results

    DATA(lt_all) = lt_q.

    APPEND LINES OF lt_a TO lt_all.

    SORT lt_all BY id.

    DELETE ADJACENT DUPLICATES FROM lt_all.

* Prepare for RAG (Retrieve and Generate) method

    DATA: lv_prompt TYPE string.

    DATA: lt_used TYPE zcl_vdb_002_lib=>tt_vector.

* Execute RAG with text and combined results, get prompt and used vectors

    DATA(lv_rag_answer) = lo_e->rag(

                            EXPORTING iv_ = lv_text

                                      it_ = lt_all

                            IMPORTING ev_prompt = lv_prompt

                                      et_used   = lt_used

    ).

    cl_demo_output=>write( lv_rag_answer ). "observe RAG answer

    cl_demo_output=>write( lv_prompt ).     "check out the resulting prompt

    cl_demo_output=>display(  ).

```

## Dependencies

The library is self-contained for the core functionality of the Vector Search.

However if you want to generate embeddings, you might want to install this [OpenAI ABAP SDK](https://github.com/microsoft/aisdkforsapabap).

To instantiate SDK you might need to use your own private keys, you can store them locally in a simple key-value .env file, something like:

```.env

#https://.openai.azure.com/openai/deployments//chat/completions?api-version=2023-07-01-preview

#https://.openai.azure.com/openai/deployments//chat/completions?api-version=

API_URL=https://.openai.azure.com/

API_VER=2023-07-01-preview

API_KEY=

API_DEP=

API_DEP_EMBED=

```

show code...

```ABAP

    DATA(lo_) = zcl_vdb_000_dotenv=>new( lv_path ).

    DATA(lv_api_key) = cut->v( 'API_KEY' ).

    data(ls_) = VALUE ts_env(

      api_url       = lo_->v( k = 'API_URL' )

      api_ver       = lo_->v( k = 'API_VER' )

      api_key       = lo_->v( k = 'API_KEY' )

      api_dep       = lo_->v( k = 'API_DEP' )

      api_dep_embed = lo_->v( k = 'API_DEP_EMBED' )

    ).

```

## Installation

Install this repository using [abapGit](https://github.com/abapGit/abapGit#abapgit).

- Run the report **zvdb_002_demo_01_upload** to upload sample vector embeddings from **sample_embeddings.tsv**.

- Now you can use **zvdb_002_demo_02_query** to query DB.

- **zvdb_002_demo_03_reindex** to recalibrate heuristic hyperplane hash and reindex.

© 2023 Alice Vinogradova. For more information, see the [LICENSE](LICENSE) file in this repository.