Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/egorsmkv/qdrant-ukrainian

Some experiments with Qdrant for Ukrainian search using embeddings
https://github.com/egorsmkv/qdrant-ukrainian

qdrant ukrainian vector-search

Last synced: 25 days ago
JSON representation

Some experiments with Qdrant for Ukrainian search using embeddings

Awesome Lists containing this project

README

        

# Test Qdrant vector similarity search engine for Ukrainian

Dataset:

Search result:

---

Tested with Qdrant 1.10.0

## Install deps

### Create an env

```
uv venv --python 3.12
```

### Activate the env

```
source .venv/bin/activate
```

### Install libraries

```
uv pip install qdrant-client sentence-transformers polars sentencepiece
```

optional:

```
uv pip install ruff
```

## Run

### Start qdrant server

```
docker run -p 6333:6333 qdrant/qdrant
```

### Download dataset and convert from Parquet to JSON

```
python download_dataset.py
```

### Embedd docs

```
python embedd_docs.py
```

Also, you can see them in the UI: http://localhost:6333/dashboard

### Search docs

```
python search_docs.py
```

## Lint and format the code

```
ruff check
ruff format
```