https://github.com/metaskills/unremarkable-vh-rag

Last synced: 6 months ago
JSON representation

Host: GitHub
URL: https://github.com/metaskills/unremarkable-vh-rag
Owner: metaskills
License: mit
Created: 2024-03-05T12:48:04.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-04-03T19:15:32.000Z (over 1 year ago)
Last Synced: 2025-04-01T17:13:18.688Z (7 months ago)
Language: JavaScript
Size: 4.68 MB
Stars: 0
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

# unRemarkable.ai GPT vs. Assistants Demo

Exploring a panel of experts assistants model.

* Mixing Vertical Horizontal RAG Architectures
* Faceted Search, Semantic Retrieval, & Data Analysis

## Setup

This project leverages [Dev Containers](https://containers.dev/) meaning you can open it in any supporting IDE and get started right away. This includes using [VS Code with Dev Containers](https://www.youtube.com/watch?v=b1RavPr_878) which is the recommended approach.

Once opened in your development container, create a `.env.development.local` file with your OpenAI API key:

```
OPENAI_API_KEY=sk-...
```

Now you can run the following commands:

```bash
npm install
npm run db:create
npm run db:index
```

## Demo Commands

This will run the various OpenAI Assistants API demos. The `demo:gpt` command will mimic a Custom GPT's knowledge retrieval. The `demo:rag` will demonstrate the same Q&A sequence with a panel of experts using the Assistants API.

```bash
npm run demo:gpt
npm run demo:rag
```

Images created by any run step such as charts created by code interpreter will be saved to the `./files` directory. These are git ignored. You can use the following environment variables to customize the scripts' behaviors.

- `DEBUG` Set to any value to enable debug 🪲 logging.
- `KNOWLEDGE_FORMAT` Set to process a specific knowledge format. Options: `csv`, `json`, `md`. Defaults to CSV. Only used in the `demo:gpt` script.

## Access OpenSearch

To access the OpenSearch Dashboards hosted in the dev container, use the following URL. Use the `admin` username and `admin` password to log in.

```
http://localhost:5601
```

## Luxury Apparel Data

The data used for this demonstration is from Kaggle.

https://www.kaggle.com/datasets/chitwanmanchanda/luxury-apparel-data

Here are the categories in the dataset. Use these to test your own pre-filtered queries.

* Accessories
* Activewear
* Jackets/Coats
* Jewelry
* Pants
* Shirts
* Shoes
* Suits
* Sweaters
* Underwear and Nightwear

## Notes

Consider 1st tier pass-thru tool data. For example, if `products` responds with `Yes.` is there any guarantee that the orchestrator should echo that message or answer in it's own words? **Should products return raw tool data only or LLM response to said data?**

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/metaskills/unremarkable-vh-rag

Awesome Lists containing this project

README