https://github.com/seeed-projects/rag_based_on_jetson

This project has implemented the RAG function on Jetson and supports TXT and PDF document formats. It uses MLC for 4-bit quantization of the Llama2-7b model, utilizes ChromaDB as the vector database, and connects these features with Llama_Index. I hope you like this project.
https://github.com/seeed-projects/rag_based_on_jetson

chromadb jetson llama-index llama2-7b mlc

Last synced: 5 months ago
JSON representation

Host: GitHub
URL: https://github.com/seeed-projects/rag_based_on_jetson
Owner: Seeed-Projects
License: mit
Created: 2024-04-19T01:33:17.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-05-16T08:36:02.000Z (about 1 year ago)
Last Synced: 2024-11-08T08:51:06.635Z (6 months ago)
Topics: chromadb, jetson, llama-index, llama2-7b, mlc
Language: Python
Homepage:
Size: 12.7 MB
Stars: 7
Watchers: 0
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# RAG_based_on_Jetson
This project has implemented the RAG function on Jetson and supports TXT and PDF document formats. It uses MLC for 4-bit quantization of the Llama2-7b model, utilizes ChromaDB as the vector database, and connects these features with Llama_Index. I hope you like this project.

# Hardware Prepare
Here I use reComputer J4012 powered by NVIDIA [Jetson Orin NX 16GB](https://www.seeedstudio.com/reComputer-J4012-p-5586.html), this project will use RAM at a peak of 11.7GB.

# Run this project
## Step 1: prepare environment

```
# install jetson-container and its requirements

git clone --depth=1 https://github.com/dusty-nv/jetson-containers
cd jetson-containers
pip install -r requirements.txt
cd data
```

```
# Install RAG project and llama2-7b model after 4bit quantification

git clone https://github.com/Seeed-Projects/RAG_based_on_Jetson.git
sudo apt-get install git-lfs
cd RAG_based_on_Jetson
git clone https://huggingface.co/JiahaoLi/llama2-7b-MLC-q4f16-jetson-containers
cd ..
```

## Step 2: run and enter the docker

```cd .. && ./run.sh $(./autotag mlc) ```

![](./source/enter_docker.png)
```
# Those command will run in this docker
cd data/RAG_based_on_Jetson && pip install -r requirements.txt
pip install chromadb==0.3.29
```

>Note: If you get this error please ignore it.

![](./source/error.png)

## step 3: run the project

```
# Command run in docker
python3 RAG.py
```
![](./source/RAG.png)

# Result
Below is the live demo, and the blue text is the context search from ChromaDB will be the context of the question

[![Alt text](https://img.youtube.com/vi/v1SDRko5cNM/0.jpg)](https://youtu.be/v1SDRko5cNM)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/seeed-projects/rag_based_on_jetson

Awesome Lists containing this project

README