https://github.com/redis/redis-vl-java

Redis Vector Library (RedisVL) -- the AI-native Java client for Redis.
https://github.com/redis/redis-vl-java
agentic-ai ai embeddings generative-ai java llm llm-cache rag rag-chatbot redis semantic-cache semantic-routing vector-database vector-search vectors
Last synced: 5 months ago
JSON representation
Redis Vector Library (RedisVL) -- the AI-native Java client for Redis.
Host: GitHub
URL: https://github.com/redis/redis-vl-java
Owner: redis
Created: 2025-10-02T19:06:54.000Z (10 months ago)
Default Branch: main
Last Pushed: 2025-12-19T00:39:18.000Z (7 months ago)
Last Synced: 2026-02-07T17:50:24.156Z (6 months ago)
Topics: agentic-ai, ai, embeddings, generative-ai, java, llm, llm-cache, rag, rag-chatbot, redis, semantic-cache, semantic-routing, vector-database, vector-search, vectors
Language: Java
Homepage: https://redis.github.io/redis-vl-java/
Size: 2.16 MB
Stars: 15
Watchers: 0
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
Awesome Lists containing this project

README

          


    

    Vector Library 





    The AI-native Redis Java client

    


[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

![Language](https://img.shields.io/badge/language-Java-orange)

![Java Version](https://img.shields.io/badge/Java-17%2B-blue)

[![Maven Central](https://img.shields.io/maven-central/v/com.redis/redisvl)](https://central.sonatype.com/artifact/com.redis/redisvl)

[![Snapshots](https://img.shields.io/nexus/s/https/s01.oss.sonatype.org/com.redis/redisvl.svg)](https://s01.oss.sonatype.org/content/repositories/snapshots/com/redis/redisvl/)

![GitHub last commit](https://img.shields.io/github/last-commit/redis/redis-vl-java)

[![GitHub stars](https://img.shields.io/github/stars/redis/redis-vl-java)](https://github.com/redis/redis-vl-java/stargazers)







    Home   

    Documentation   

    Recipes   

  

    




# Introduction

Redis Vector Library (RedisVL) is the ultimate Java client designed for AI-native applications harnessing the power of [Redis](https://redis.io).

RedisVL is your go-to client for:

- Lightning-fast information retrieval & vector similarity search

- Real-time RAG pipelines

- LLM semantic caching

- Hybrid search combining vectors and metadata

# 💪 Getting Started

## Installation

Add RedisVL to your Java (17+) project using Maven or Gradle:

**Maven:**

```xml

    com.redis

    redisvl

    0.12.2

```

**Gradle:**

```gradle

implementation 'com.redis:redisvl:0.12.2'

```

## Setting up Redis

Choose from multiple Redis deployment options:

1. [Redis Cloud](https://redis.io/try-free): Managed cloud database (free tier available)

2. [Redis on Docker](https://hub.docker.com/_/redis): Docker image for development

    ```bash

    docker run -d --name redis -p 6379:6379 -p 8001:8001 redis:latest

    ```

3. [Redis Enterprise](https://redis.io/enterprise/): Commercial, self-hosted database

4. [Azure Managed Redis](https://azure.microsoft.com/en-us/products/managed-redis): Fully managed Redis Enterprise on Azure

> Enhance your experience and observability with the free [Redis Insight GUI](https://redis.io/insight/).

# Overview

## 🗃️ Redis Index Management

1. **Design a schema** for your use case that models your dataset with built-in Redis and indexable fields (*e.g. text, tags, numerics, geo, and vectors*). Load a schema from a YAML file:

    ```yaml

    index:

      name: user-idx

      prefix: user

      storage_type: json

    fields:

      - name: user

        type: tag

      - name: credit_score

        type: tag

      - name: embedding

        type: vector

        attrs:

          algorithm: flat

          dims: 4

          distance_metric: cosine

          datatype: float32

    ```

    ```java

    import com.redis.vl.schema.IndexSchema;

    IndexSchema schema = IndexSchema.fromYaml("schemas/schema.yaml");

    ```

    Or load directly from a Java Map:

    ```java

    Map schemaMap = Map.of(

        "index", Map.of(

            "name", "user-idx",

            "prefix", "user",

            "storage_type", "json"

        ),

        "fields", List.of(

            Map.of("name", "user", "type", "tag"),

            Map.of("name", "credit_score", "type", "tag"),

            Map.of(

                "name", "embedding",

                "type", "vector",

                "attrs", Map.of(

                    "algorithm", "flat",

                    "datatype", "float32",

                    "dims", 4,

                    "distance_metric", "cosine"

                )

            )

        )

    );

    IndexSchema schema = IndexSchema.fromMap(schemaMap);

    ```

2. **Create a SearchIndex** class with an input schema to perform admin and search operations on your index in Redis:

    ```java

    import redis.clients.jedis.UnifiedJedis;

    import com.redis.vl.index.SearchIndex;

    // Define the index

    UnifiedJedis jedis = new UnifiedJedis("redis://localhost:6379");

    SearchIndex index = new SearchIndex(schema, jedis);

    // Create the index in Redis

    index.create(true); // overwrite if exists

    ```

3. **Load and fetch** data to/from your Redis instance:

    ```java

    Map data = Map.of(

        "user", "john",

        "credit_score", "high",

        "embedding", new float[]{0.23f, 0.49f, -0.18f, 0.95f}

    );

    // load list of maps, specify the "id" field

    index.load(List.of(data), "user");

    // fetch by "id"

    Map john = index.fetch("john");

    ```

## 🔍 Retrieval

Define queries and perform advanced searches over your indices, including the combination of vectors, metadata filters, and more.

- **VectorQuery** - Flexible vector queries with customizable filters enabling semantic search:

    ```java

    import com.redis.vl.query.VectorQuery;

    VectorQuery query = VectorQuery.builder()

        .vector(new float[]{0.16f, -0.34f, 0.98f, 0.23f})

        .field("embedding")

        .numResults(3)

        .build();

    // run the vector search query against the embedding field

    List> results = index.query(query);

    ```

    Incorporate complex metadata filters on your queries:

    ```java

    import com.redis.vl.query.filter.FilterQuery;

    // define a tag match filter

    FilterQuery tagFilter = FilterQuery.tag("user", "john");

    // update query definition

    VectorQuery filteredQuery = query.toBuilder()

        .withPreFilter(tagFilter.build())

        .build();

    // execute query

    List> results = index.query(filteredQuery);

    ```

- **HybridQuery** - Native hybrid search combining text and vector similarity using Redis 8.4+ `FT.HYBRID` command with built-in score fusion:

    ```java

    import com.redis.vl.query.HybridQuery;

    import com.redis.vl.query.Filter;

    // Native hybrid search with LINEAR combination (Redis 8.4+)

    HybridQuery hybridQuery = HybridQuery.builder()

        .text("machine learning algorithms")

        .textFieldName("description")

        .vector(queryVector)

        .vectorFieldName("embedding")

        .filterExpression(Filter.tag("category", "AI"))

        .combinationMethod(HybridQuery.CombinationMethod.LINEAR)

        .linearAlpha(0.3f)  // 30% text, 70% vector

        .numResults(10)

        .build();

    List> results = index.query(hybridQuery);

    // Automatically falls back to AggregateHybridQuery on older Redis versions

    ```

- **AggregateHybridQuery** - Backward-compatible hybrid search using `FT.AGGREGATE` for Redis versions before 8.4:

    ```java

    import com.redis.vl.query.AggregateHybridQuery;

    AggregateHybridQuery aggQuery = AggregateHybridQuery.builder()

        .text("machine learning algorithms")

        .textFieldName("description")

        .vector(queryVector)

        .vectorFieldName("embedding")

        .alpha(0.7f)  // 70% vector, 30% text

        .numResults(10)

        .build();

    List> results = index.query(aggQuery);

    ```

- **VectorRangeQuery** - Vector search within a defined range paired with customizable filters

- **FilterQuery** - Standard search using filters and the full-text search

- **CountQuery** - Count the number of indexed records given attributes

> Read more about building [advanced Redis queries](https://redis.github.io/redis-vl-java/redisvl/current/hybrid-queries.html).

## 🔧 Utilities

### Vectorizers

Integrate with popular embedding providers to greatly simplify the process of vectorizing unstructured data for your index and queries:

- **LangChain4J** - Integration with LangChain4J embedding models

  - OpenAI

  - Cohere

  - HuggingFace

  - Ollama

  - Vertex AI

  - Azure OpenAI

  - Mistral AI

  - Voyage AI

- **ONNX Models** - Local embedding models via ONNX Runtime

  - Sentence Transformers (all-MiniLM-L6-v2, etc.)

  - Custom ONNX models

- **DJL (Deep Java Library)** - Face recognition and embeddings

```java

import com.redis.vl.utils.vectorize.LangChain4JVectorizer;

import dev.langchain4j.model.cohere.CoherEmbeddingModel;

// Using LangChain4J with Cohere

CoherEmbeddingModel cohereModel = CoherEmbeddingModel.withApiKey(apiKey);

LangChain4JVectorizer vectorizer = new LangChain4JVectorizer(cohereModel);

float[] embedding = vectorizer.embed("What is the capital city of France?");

List embeddings = vectorizer.embedBatch(

    List.of(

        "my document chunk content",

        "my other document chunk content"

    )

);

```

```java

import com.redis.vl.utils.vectorize.SentenceTransformersVectorizer;

// Using local ONNX model

SentenceTransformersVectorizer vectorizer = new SentenceTransformersVectorizer("sentence-transformers/all-mpnet-base-v2");

float[] embedding = vectorizer.embed("What is the capital city of France?");

```

> Learn more about using [vectorizers](https://redis.github.io/redis-vl-java/redisvl/current/vectorizers.html) in your embedding workflows.

### Rerankers

Integrate with popular reranking providers to improve the relevancy of the initial search results from Redis:

- **Cohere Reranker** - Using Cohere's reranking API

- **HuggingFace Cross-Encoder** - Local reranking with cross-encoder models

```java

import com.redis.vl.utils.rerank.CohereReranker;

CohereReranker reranker = new CohereReranker(apiKey);

List> reranked = reranker.rerank(

    "What is the capital of France?",

    results,

    topK

);

```

## 💫 Extensions

RedisVL supports **Extensions** that implement interfaces exposing best practices and design patterns for working with LLM memory and agents.

### LLM Semantic Caching

Increase application throughput and reduce the cost of using LLM models in production by leveraging previously generated knowledge with the `SemanticCache`.

```java

import com.redis.vl.extensions.cache.llm.SemanticCache;

import com.redis.vl.extensions.cache.llm.CacheConfiguration;

// init cache with TTL and semantic distance threshold

CacheConfiguration config = CacheConfiguration.builder()

    .name("llmcache")

    .ttl(360L)

    .distanceThreshold(0.1)

    .build();

SemanticCache cache = new SemanticCache(config, jedis);

// store user queries and LLM responses in the semantic cache

cache.store(

    "What is the capital city of France?",

    "Paris"

);

// quickly check the cache with a slightly different prompt (before invoking an LLM)

CacheResult result = cache.check("What is France's capital city?");

if (result.isHit()) {

    System.out.println(result.getResponse());

    // Output: Paris

}

```

> Learn more about [semantic caching](https://redis.github.io/redis-vl-java/redisvl/current/llmcache.html) for LLMs.

### LLM Semantic Routing

Build fast decision models that run directly in Redis and route user queries to the nearest "route" or "topic".

```java

import com.redis.vl.extensions.router.SemanticRouter;

import com.redis.vl.extensions.router.Route;

import com.redis.vl.utils.vectorize.SentenceTransformersVectorizer

SentenceTransformersVectorizer vectorizer = new SentenceTransformersVectorizer("sentence-transformers/all-mpnet-base-v2")

List routes = List.of(

    Route.builder()

        .name("greeting")

        .references(List.of("hello", "hi"))

        .metadata(Map.of("type", "greeting"))

        .distanceThreshold(0.3)

        .build(),

    Route.builder()

        .name("farewell")

        .references(List.of("bye", "goodbye"))

        .metadata(Map.of("type", "farewell"))

        .distanceThreshold(0.3)

        .build()

);

// build semantic router from routes

SemanticRouter router = SemanticRouter.builder()

    .name("topic-router")

    .vectorizer(vectorizer)

    .routes(routes)

    .jedis(jedis)

    .build();

RouteMatch match = router.route("Hi, good morning");

System.out.println(match.getName()); // Output: greeting

System.out.println(match.getDistance()); // Output: 0.273891836405

```

> Learn more about [semantic routing](https://redis.github.io/redis-vl-java/redisvl/current/semantic-router.html).

## 🧪 Experimental: VCR Test System

RedisVL includes an experimental VCR (Video Cassette Recorder) test system for recording and replaying LLM/embedding API calls. This enables:

- **Deterministic tests** - Replay recorded responses for consistent results

- **Cost reduction** - Avoid repeated API calls during test runs

- **Speed improvement** - Local Redis playback is faster than API calls

- **Offline testing** - Run tests without network access or API keys

### Quick Start with JUnit 5

The simplest way to use VCR is with the declarative annotations:

```java

import com.redis.vl.test.vcr.VCRMode;

import com.redis.vl.test.vcr.VCRModel;

import com.redis.vl.test.vcr.VCRTest;

@VCRTest(mode = VCRMode.PLAYBACK_OR_RECORD)

class MyLLMTest {

    // Models are automatically wrapped by VCR

    @VCRModel(modelName = "text-embedding-3-small")

    private EmbeddingModel embeddingModel = createEmbeddingModel();

    @VCRModel

    private ChatLanguageModel chatModel = createChatModel();

    @Test

    void testEmbedding() {

        // First run: Records API response to Redis

        // Subsequent runs: Replays from Redis cassette

        Response response = embeddingModel.embed("What is Redis?");

        assertNotNull(response.content());

    }

    @Test

    void testChat() {

        String response = chatModel.generate("Explain Redis in one sentence.");

        assertNotNull(response);

    }

}

```

### VCR Modes

| Mode | Description | API Key Required |

|------|-------------|------------------|

| `PLAYBACK` | Only use recorded cassettes. Fails if missing. | No |

| `PLAYBACK_OR_RECORD` | Use cassette if available, record if not. | Only for first run |

| `RECORD` | Always call real API and record response. | Yes |

| `OFF` | Bypass VCR, always call real API. | Yes |

### Environment Variable Override

Override the VCR mode at runtime without changing code:

```bash

# Record new cassettes

VCR_MODE=RECORD OPENAI_API_KEY=your-key ./gradlew test

# Playback only (CI/CD, no API key needed)

VCR_MODE=PLAYBACK ./gradlew test

```

### LangChain4J Integration

```java

import com.redis.vl.test.vcr.VCREmbeddingModel;

import com.redis.vl.test.vcr.VCRChatModel;

import com.redis.vl.test.vcr.VCRMode;

// Wrap any LangChain4J EmbeddingModel

VCREmbeddingModel vcrEmbedding = new VCREmbeddingModel(openAiEmbeddingModel);

vcrEmbedding.setMode(VCRMode.PLAYBACK_OR_RECORD);

Response response = vcrEmbedding.embed("What is Redis?");

// Wrap any LangChain4J ChatLanguageModel

VCRChatModel vcrChat = new VCRChatModel(openAiChatModel);

vcrChat.setMode(VCRMode.PLAYBACK_OR_RECORD);

String response = vcrChat.generate("What is Redis?");

```

### Spring AI Integration

```java

import com.redis.vl.test.vcr.VCRSpringAIEmbeddingModel;

import com.redis.vl.test.vcr.VCRSpringAIChatModel;

import com.redis.vl.test.vcr.VCRMode;

// Wrap any Spring AI EmbeddingModel

VCRSpringAIEmbeddingModel vcrEmbedding = new VCRSpringAIEmbeddingModel(openAiEmbeddingModel);

vcrEmbedding.setMode(VCRMode.PLAYBACK_OR_RECORD);

EmbeddingResponse response = vcrEmbedding.embedForResponse(List.of("What is Redis?"));

// Wrap any Spring AI ChatModel

VCRSpringAIChatModel vcrChat = new VCRSpringAIChatModel(openAiChatModel);

vcrChat.setMode(VCRMode.PLAYBACK_OR_RECORD);

String response = vcrChat.call("What is Redis?");

```

### How It Works

1. **Container Management**: VCR starts a Redis container with persistence

2. **Model Wrapping**: `@VCRModel` fields are wrapped with VCR proxies

3. **Cassette Storage**: Responses stored as Redis JSON documents

4. **Persistence**: Data saved to `src/test/resources/vcr-data/` via Redis AOF/RDB

5. **Playback**: Subsequent runs load cassettes from persistent storage

### Demo Projects

Complete working examples are available:

- **[LangChain4J VCR Demo](demos/langchain4j-vcr/)** - LangChain4J embedding and chat models

- **[Spring AI VCR Demo](demos/spring-ai-vcr/)** - Spring AI embedding and chat models

Run the demos without an API key (uses pre-recorded cassettes):

```bash

./gradlew :demos:langchain4j-vcr:test

./gradlew :demos:spring-ai-vcr:test

```

> Learn more about [VCR testing](https://redis.github.io/redis-vl-java/redisvl/current/vcr-testing.html).

## 🚀 Why RedisVL?

In the age of GenAI, **vector databases** and **LLMs** are transforming information retrieval systems. With emerging and popular frameworks like [LangChain4J](https://github.com/langchain4j/langchain4j) and [Spring AI](https://spring.io/projects/spring-ai), innovation is rapid. Yet, many organizations face the challenge of delivering AI solutions **quickly** and at **scale**.

Enter [Redis](https://redis.io) – a cornerstone of the NoSQL world, renowned for its versatile [data structures](https://redis.io/docs/data-types/) and [processing engines](https://redis.io/docs/interact/). Redis excels in real-time workloads like caching, session management, and search. It's also a powerhouse as a vector database for RAG, an LLM cache, and a chat session memory store for conversational AI.

The Redis Vector Library bridges the gap between the AI-native developer ecosystem and Redis's robust capabilities. With a lightweight, elegant, and intuitive interface, RedisVL makes it easy to leverage Redis's power. Built on the [Jedis](https://github.com/redis/jedis) client, RedisVL transforms Redis's features into a grammar perfectly aligned with the needs of today's AI/ML Engineers and Data Scientists.

## 📚 Examples and Notebooks

Check out the [notebooks](notebooks/) directory for interactive Jupyter notebook examples:

- [Getting Started](notebooks/01_getting_started.ipynb) - Introduction to RedisVL basics

- [Hybrid Queries](notebooks/02_hybrid_queries.ipynb) - Combining vector and metadata search

- [Advanced Queries](notebooks/11_advanced_queries.ipynb) - TextQuery, HybridQuery, and MultiVectorQuery

- [LLM Cache](notebooks/03_llmcache.ipynb) - Semantic caching for LLMs

- [Hash vs JSON Storage](notebooks/05_hash_vs_json.ipynb) - Storage type comparison

- [Vectorizers](notebooks/06_vectorizers.ipynb) - Working with embedding models

- [Rerankers](notebooks/07_rerankers.ipynb) - Improving search relevance

### Running the Notebooks

The notebooks use Java kernel support via JJava. To run them:

1. **Start the notebook environment:**

   ```bash

   cd notebooks

   docker compose up -d

   ```

2. **Access Jupyter Lab:**

   Navigate to [http://localhost:8888/](http://localhost:8888/)

3. **Stop when done:**

   ```bash

   docker compose down

   ```

> See the [notebooks README](notebooks/README.md) for more details.

## 😁 Helpful Links

For additional help, check out the following resources:

- [Getting Started Guide](https://redis.github.io/redis-vl-java/redisvl/current/getting-started.html)

- [API Reference (Javadoc)](https://redis.github.io/redis-vl-java/redisvl/current/_attachments/javadoc/aggregate/index.html)

- [Example Notebooks](notebooks/)

- [Redis AI Recipes](https://github.com/redis-developer/redis-ai-resources)

- [Official Redis Vector API Docs](https://redis.io/docs/interact/search-and-query/advanced-concepts/vectors/)

## 🫱🏼‍🫲🏽 Contributing

Please help us by contributing PRs, opening GitHub issues for bugs or new feature ideas, improving documentation, or increasing test coverage. [Read more about how to contribute!](CONTRIBUTING.md)

## Requirements

- Java 17+

- Redis 8.0+ (includes built-in search and vector capabilities)

- For native `FT.HYBRID` support: Redis 8.4+

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🚧 Maintenance

This project is supported by [Redis, Inc](https://redis.io) on a good faith effort basis. To report bugs, request features, or receive assistance, please [file an issue](https://github.com/redis/redis-vl-java/issues).
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/redis/redis-vl-java

Awesome Lists containing this project

README

Vector Library