https://github.com/qdrant/search-feedback-loop

Using discovery API for improving agentic RAG
https://github.com/qdrant/search-feedback-loop

Last synced: 2 months ago
JSON representation

Using discovery API for improving agentic RAG

Host: GitHub
URL: https://github.com/qdrant/search-feedback-loop
Owner: qdrant
Created: 2025-02-19T12:53:59.000Z (about 1 year ago)
Default Branch: master
Last Pushed: 2025-02-19T13:49:18.000Z (about 1 year ago)
Last Synced: 2025-06-29T10:07:08.608Z (11 months ago)
Language: Python
Size: 9.77 KB
Stars: 3
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Search Feedback Loop
## Idea:
Teach agents to use Discovery API instead of a bad query reformulation.
## Goal:
Better Agentic RAG results.

## Basic Experiment:
- `all-MiniLM-L6-v2` as main model;
- `mxbai-embed-large-v1` as agent;
- BEIR datasets for eval;
- `precision@1` metric

### Expensive agent scenario
top-10 results of `all-MiniLM-L6-v2` reranked with `mxbai-embed-large-v1`

### Discovery-aware agent scenario
1. top-3 results of `all-MiniLM-L6-v2` reranked with `mxbai-embed-large-v1`
If any results in top-3 changed their order, we have feedback from the agent -- context for discovery
2. Discovery with `positive context` (top-1 reranked) and `negative context` (top-3 reranked) using `all-MiniLM-L6-v2`, results from 1. excluded
3. Reranking discovered top-3 with `mxbai-embed-large-v1`
Selecting the best top-1 result from 1 and 3 based on the `mxbai-embed-large-v1` score.

## How to run
BEIR datasets folders should be downloaded and put on the same level as scripts;
In the current set-up, `Qdrant Cloud` is used, and credentials are taken from `config.ini`.
So, to use it also with Cloud, `config.ini` should be changed with your credentials.

This is the example for running scripts on `FiQa-2018.`
1. `indexing.py`

```bash
python indexing.py --dataset_path nfcorpus/corpus.jsonl --total-points-in-dataset 3600 --collection-name "discovery_agents"
```
2. `evaluating.py`

```bash
python evaluating.py --input-path-queries nfcorpus/queries.jsonl --input-path-qrels nfcorpus/qrels/test.tsv --collection-name "discovery_agents" --total-queries-in-dataset 323
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/qdrant/search-feedback-loop

Awesome Lists containing this project

README