https://github.com/deepset-ai/rag-with-nvidia-nims

🚀 Use NVIDIA NIMs with Haystack pipelines
https://github.com/deepset-ai/rag-with-nvidia-nims

ai deployment haystack llm

Last synced: 3 months ago
JSON representation

🚀 Use NVIDIA NIMs with Haystack pipelines

Host: GitHub
URL: https://github.com/deepset-ai/rag-with-nvidia-nims
Owner: deepset-ai
Created: 2024-05-08T09:30:23.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-09-04T12:01:26.000Z (10 months ago)
Last Synced: 2025-03-22T13:22:17.390Z (4 months ago)
Topics: ai, deployment, haystack, llm
Language: Python
Homepage: https://haystack.deepset.ai/blog/haystack-nvidia-nim-rag-guide
Size: 632 KB
Stars: 30
Watchers: 0
Forks: 8
Open Issues: 2
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Build Air-Gapped RAG with Nvidia NIMs and Haystack

📚 This repository is accompanied by our article ["Building RAG Applications with NVIDIA NIM and Haystack on K8s"](https://haystack.deepset.ai/blog/haystack-nvidia-nim-rag-guide)

> Info: This repo is set up to use models hosted and accessible via https://build.nvidia.com/
>
> These models are already available and you can use them by creating yourself API keys through the platform.
> The project is set up so that you can change these models to NIM deployments by setting the `model` name and `api_url` in the `NvidiaGenerator`, `NvidiaDocumentEmbedder` and `NvidiaTextEmbedder` components.
>
> 👩🏻‍🍳 We also provide a notebook on Haystack Cookbooks that provide the same code and setup, only expecting self-hosted NIMs
>
>

## Run with Docker

1. `pip install -r requirements.txt`
2. Create a `.env` file and add `NVIDIA_API_KEY` (if you're using hosted models via https://build.nvidia.com/)
3. `docker-compose up`
6. `hayhooks deploy rag.yaml`
7. Go to `localhost:1416/docs` to interact with your RAG pipeline

## File Structure

- `indexing.py`: This script preproecesses, embeds and writes ChipNemo.pdf into a Qdrant database
- `rag.py`: This scripts runs a RAG pipeline with a NIM LLM and retrieval model.
- `Dockerfile`: This is used by the docker-compose file to install dependencies
- `docker-compose.yml`: This is the docker compose file we use to spin up a container for hayhooks (Haystack pipeline deployment) and Qdrant
- `rag.yaml`: This is the serialized RAG pipeline which is the same as `rag.py` in YAML. We use this to deploy our pipeline with hayhooks
- : This notebook shows you how you can set up your components to use self-hosted NIMs.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/deepset-ai/rag-with-nvidia-nims

Awesome Lists containing this project

README