An open API service indexing awesome lists of open source software.

https://github.com/weaviate/contextionary

Weaviate's own language vectorizer, which allows for semantic context-based searches in Weaviate
https://github.com/weaviate/contextionary

machine-learning vectorizer weaviate

Last synced: 6 months ago
JSON representation

Weaviate's own language vectorizer, which allows for semantic context-based searches in Weaviate

Awesome Lists containing this project

README

          

# Weaviate Contextionary Weaviate logo

> The contextionary powers the semantic, context-based searches in Weaviate.

Not intended for stand-alone use. Used by [Weaviate - the ML-first vector
search engine](https://github.com/weaviate/weaviate).

## Versioning

The version tag is `-v`. So for
example the app version `0.1.0` deployed with the [contextionary vector db
version](https://c11y.semi.technology/contextionary.json) `0.6.0` of the
English language will have the version `en0.6.0-v0.1.0`. This also
corresponds to the Docker tag.

## Languages

Currently available languages include:
* `en`
* `de`
* `nl`
* `cs`
* `it`

Other languages coming soon.

## Docker Requirements

The build pipeline makes use of Docker's `buildx` for multi-arch builds. Make
sure you run a Docker version which supports `buildx` and have run `docker
buildx create --use` at least once.

## How to build and test project

1. Regenerate schema:

```bash
./gen_proto_code.sh
```

2. Build image:

```bash
LANGUAGE=en MODEL_VERSION=0.16.0 ./build.sh
```

3. Run journey tests:

```bash
LANGUAGE=en MODEL_VERSION=0.16.0 ./build.sh && DIMENSIONS=300 ./test/journey.sh
```