https://github.com/tensorchord/vectordb-benchmark
https://github.com/tensorchord/vectordb-benchmark
Last synced: about 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/tensorchord/vectordb-benchmark
- Owner: tensorchord
- Created: 2024-01-04T06:47:31.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-02-01T11:00:43.000Z (over 2 years ago)
- Last Synced: 2024-09-21T12:52:18.439Z (almost 2 years ago)
- Language: Python
- Size: 19.5 KB
- Stars: 3
- Watchers: 7
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Vector DB Benchmark
Supported databases/extensions:
- [x] [`pgvecto.rs`](https://github.com/tensorchord/pgvecto.rs)
- [x] [`pgvector`](https://github.com/pgvector/pgvector)
- [x] [`qdrant`](https://github.com/qdrant/qdrant/)
Supported datasets:
- [x] random generated
- [x] GIST 960
- [x] GLOVE
- [x] Deep Image
- [x] LAION
For more information, check the [source.py](./vector_bench/dataset/source.py).
## Installation
```bash
pip install vector_bench
```
## Run
### Server
Run the docker compose file under [`server`](server/) folder.
```base
docker compose -f docker/compose.${DB_NAME}.yaml up -d
```
### Client
```bash
# help
vector_bench --help
# only insert the data
vector_bench --insert --url postgresql://postgres:password@127.0.0.1:5432/postgres -s gist_960_l2
# only query the data (make sure the data is already inserted)
vector_bench --query --url postgresql://postgres:password@localhost:5432/postgres -s gist_960_l2
# insert and query the data
vector_bench --insert --query --url postgresql://postgres:password@localhost:5432/postgres -s gist_960_l2
```
## How to contribute
```bash
# install all the necessary dependencies:
make dev
# format code
make format
# lint
make lint
```
### Add more datasets
- Add new `DatasetConfig` to `vector_bench/dataset/source.py`
### Add more clients
- Inherit and implement the `BaseClient` class in `vector_bench/client/base.py`