https://github.com/leokwsw/local-rag

A local rag demo
https://github.com/leokwsw/local-rag

embedding-models huggingface llm openai python rag unstructured weaviate

Last synced: about 1 month ago
JSON representation

A local rag demo

Host: GitHub
URL: https://github.com/leokwsw/local-rag
Owner: leokwsw
Created: 2023-12-12T11:25:00.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-05-09T11:12:41.000Z (about 1 year ago)
Last Synced: 2025-12-26T19:17:37.816Z (6 months ago)
Topics: embedding-models, huggingface, llm, openai, python, rag, unstructured, weaviate
Language: Python
Homepage:
Size: 19.5 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Demo - Local RAG Application Using Unstructured

This repository features a simple notebook which demonstrates how to use [Unstructured](https://unstructured.io/) to
ingest and pre-process documents for a local Retrieval-Augmented-Generation (RAG) application

The goal of this repo is not use any cloud services or external APIs and to run everything locally. This demonstrates
RAG applications can be built with siloed infrastructure.

## Setup Steps

1. Install [Python](https://www.python.org/downloads/). Please use version 3.9 or later.

2. Install Docker, Docker-Compose and Docker Desktop and make sure Docker Desktop is
running. [Installation instructions are here](https://docs.docker.com/compose/install/)

3. Clone this repository by running the following command

```bash
git clone git@github.com:leokwsw/local-rag.git
```

4. CD into this repository locally, create a virtual environment, install the requirements

```bash
cd local-RAG #enter local-RAG directory
python3.10 -m venv env #create venv called env
source env/bin/activate #activate environment
pip install -r requirements.txt #install required packages
```

5. Install and Download LLama2 CCP Model. This is slightly different for every OS, but here is a link
with [download instructions](https://github.com/ggerganov/llama.cpp#obtaining-and-using-the-facebook-llama-2-model).
Below is how to install + download on MAC.

```bash
mkdir model_files #make model files folder to store Llama 2 model files
CMAKE_ARGS="-DLLAMA_METAL=on" FORCE_CMAKE=1 pip install llama-cpp-python
#install llama-cpp-python package made for MAC Silicon chips
huggingface-cli download TheBloke/Llama-2-7b-Chat-GGUF --local-dir model_files --local-dir-use-symlinks False --include='*Q4_K*gguf' #download model
huggingface-cli download TheBloke/Mistral-7B-Instruct-v0.2-GGUF --local-dir model_files --local-dir-use-symlinks False --include='*Q4_K*gguf'

```

6. Start Docker Container to Spin up Weaviate VectorDB

```bash
docker-compose up -d
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/leokwsw/local-rag

Awesome Lists containing this project

README