https://github.com/qdrant/page-search
Neural search for web-sites, docs, articles - online!
https://github.com/qdrant/page-search
Last synced: 11 months ago
JSON representation
Neural search for web-sites, docs, articles - online!
- Host: GitHub
- URL: https://github.com/qdrant/page-search
- Owner: qdrant
- License: apache-2.0
- Created: 2022-08-22T14:40:56.000Z (almost 4 years ago)
- Default Branch: master
- Last Pushed: 2024-10-22T08:58:23.000Z (over 1 year ago)
- Last Synced: 2024-10-23T12:13:28.125Z (over 1 year ago)
- Language: Rust
- Size: 1.71 MB
- Stars: 126
- Watchers: 4
- Forks: 7
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Speedy Semantic Search
This project has two implementations, one in Python and one in Rust. See [dev-docs.md](./dev-docs.md) for an overview over the Python version and tools, and the [rust_search README](./rust_search) for one on the Rust version and helpers.
Both the python and Rust version contain a service that is able to use a Qdrant vector search engine to do a semantic search in the matter of milliseconds. The python version does a text search below 4 characters, while the Rust version always does a full semantic search.
Both versions use the [all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) model made available by Hugging Face for embedding, and [Qdrant](https://qdrant.tech) for searching.
The services both have the same `GET /api/search` endpoint with the following arguments:
* `q` contains the term that is going to be searched.
* `section` contains a section that is added by the python version's crawler that can be used to additionally filter the search to a particular section of the page. This argument is optional.
The code in this project powers the [Qdrant documentation](https://qdrant.tech/documentation)'s search box.