Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with similarity-search

A curated list of projects in awesome lists tagged with similarity-search .

https://github.com/typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

algolia datastore elasticsearch enterprise-search faceting full-text-search fuzzy-search geosearch in-memory instantsearch merchandising pinecone search search-engine semantic-search similarity-search site-search synonyms typo-tolerance vector-search

Last synced: 16 Dec 2024

https://github.com/weaviate/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate

Last synced: 16 Dec 2024

https://github.com/semi-technologies/weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

approximate-nearest-neighbor-search generative-search grpc hnsw hybrid-search image-search information-retrieval mlops nearest-neighbor-search neural-search recommender-system search-engine semantic-search semantic-search-engine similarity-search vector-database vector-search vector-search-engine vectors weaviate

Last synced: 10 Dec 2024

https://github.com/lancedb/lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

approximate-nearest-neighbor-search image-search nearest-neighbor-search recommender-system search-engine semantic-search similarity-search vector-database

Last synced: 16 Dec 2024

https://lancedb.github.io/lancedb/

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

approximate-nearest-neighbor-search image-search nearest-neighbor-search recommender-system search-engine semantic-search similarity-search vector-database

Last synced: 13 Nov 2024

https://github.com/unum-cloud/usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly

Last synced: 01 Nov 2024

https://github.com/jbellis/jvector

JVector: the most advanced embedded vector search engine

ann java knn machine-learning search-engine similarity-search vector-search

Last synced: 19 Dec 2024

https://github.com/sherlockchou86/videopipe

A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )

ai behaviour-analysis cv deep-learning deepstream face-recognition feature-extraction gstreamer image-classification image-enhancement image-segmentation license-plate-recognition object-detection opencv reid similarity-search video-analysis video-processing

Last synced: 19 Dec 2024

https://github.com/sherlockchou86/VideoPipe

A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )

ai behaviour-analysis cv deep-learning deepstream face-recognition feature-extraction gstreamer image-classification image-enhancement image-segmentation license-plate-recognition object-detection opencv reid similarity-search video-analysis video-processing

Last synced: 27 Oct 2024

https://github.com/ashvardanian/simsimd

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 17 Dec 2024

https://github.com/tantaraio/voy

🕸️🦀 A WASM vector similarity search written in Rust

k-d-tree nearest-neighbor-search rust similarity-search vector-search wasm wasm-pack webassembly

Last synced: 02 Nov 2024

https://github.com/myscale/MyScaleDB

An open-source, high-performance SQL vector database built on ClickHouse.

ann big-data embedding image-search llm myscaledb rag search-engine similarity-search sql sql-vector unstructured-analytics vector-search vectordb

Last synced: 24 Oct 2024

https://github.com/shibing624/similarities

Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

bm25 deep-learning faiss image-search image-similarity matching nlp pytorch search-engine similarity similarity-search text-matching

Last synced: 19 Dec 2024

https://github.com/ashvardanian/SimSIMD

Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐

arm-neon arm-sve assembly avx2 avx512 blas blas-libraries distance-calculation distance-measures float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 28 Oct 2024

https://github.com/ekzhu/setsimilaritysearch

All-pair set similarity search on millions of sets in Python and on a laptop

all-pairs set-similarity-search similarity-search

Last synced: 15 Dec 2024

https://github.com/hhblaze/dbreeze

C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 14 Dec 2024

https://github.com/chunelfeng/caiss

一款简单好用的 跨平台/多语言的 相似向量/相似词/相似句 高性能检索引擎。欢迎star & fork。Build together! Power another !

ai ann chatbot deep-learning faiss hnsw mrpt nlp search-engine similarity-search

Last synced: 15 Dec 2024

https://github.com/hhblaze/DBreeze

C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 26 Oct 2024

https://github.com/arcadedata/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 19 Dec 2024

https://github.com/ArcadeData/arcadedb

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

arcadedb database dbms distributed docker document embedded graph k8s key-value kubernetes multi-model orientdb search-engine similarity-search time-series vector-database vector-search

Last synced: 10 Nov 2024

https://github.com/cluebenchmark/kgclue

KgCLUE: 大规模中文开源知识图谱问答

kbqa knowledge-graph ner qa similarity-search

Last synced: 16 Dec 2024

https://github.com/CLUEbenchmark/KgCLUE

KgCLUE: 大规模中文开源知识图谱问答

kbqa knowledge-graph ner qa similarity-search

Last synced: 16 Nov 2024

https://github.com/alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

elasticsearch elasticsearch-plugin embeddings locality-sensitive-hashing lucene nearest-neighbor-search neural-search semantic-search similarity-search

Last synced: 19 Dec 2024

https://github.com/oasysai/oasysdb

An embedded vector database designed to run on edge devices. Lightweight and fast with HNSW indexing algorithm.

approximate-nearest-neighbors edge-ai edge-computing hnsw key-value-database open-source rest-api similarity-search vector-database vector-search

Last synced: 06 Nov 2024

https://github.com/vioshyvo/mrpt

Fast and lightweight header-only C++ library (with Python bindings) for approximate nearest neighbor search

approximate-nearest-neighbor-search k-nn knn-search mrpt nearest-neighbor-search random-projection similarity-search

Last synced: 02 Nov 2024

https://github.com/postgrespro/imgsmlr

Similar images search for PostgreSQL

gist image-processing postgres postgresql similarity-search

Last synced: 17 Nov 2024

https://github.com/fzliu/radient

Radient turns many data types (not just text) into vectors for similarity search, clustering, regression analysis, and more.

audio embeddings etl fraud-detection graphs image-search images milvus molecular-search molecules recommender-system retrieval-augmented-generation semantic-search similarity-search text unstructured-data-etl vector-database vectors

Last synced: 16 Dec 2024

https://github.com/dbaranchuk/ivf-hnsw

Code for ECCV2018 paper: Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors

knn-search similarity-search

Last synced: 20 Dec 2024

https://github.com/pinecone-io/pinecone-ts-client

The official TypeScript/Node client for the Pinecone vector database

llm pinecone semantic-search similarity-search vector-database

Last synced: 19 Dec 2024

https://github.com/antgroup/vsag

vsag is a vector indexing library used for similarity search.

ann indexing-library similarity-search vector vectordb

Last synced: 15 Dec 2024

https://github.com/guenthermi/postgres-word2vec

utils to use word embedding models like word2vec vectors in a PostgreSQL database

inverted-index knn-search postgresql product-quantization similarity-search word-embeddings word2vec

Last synced: 08 Nov 2024

https://github.com/chembl/FPSim2

Simple package for fast molecular similarity searches

cheminformatics chemistry gpu python similarity-search

Last synced: 16 Nov 2024

https://github.com/matrix-profile-foundation/mass-ts

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

algorithms data-mining datascience distance-matrix euclidean-distances similarity-search time-series time-series-analysis time-series-data-mining

Last synced: 09 Nov 2024

https://github.com/huichen/wordvector_be

Web服务:使用腾讯 800 万词向量模型和 spotify annoy 引擎得到相似关键词

annoy golang http-server nearest-neighbor-search similarity-search wordembeddings wordvectors

Last synced: 15 Nov 2024

https://github.com/scrubbbbs/cbird

Command-line program for managing a media collection, with focus on Content-Based Image Retrieval (Computer Vision) methods for finding duplicates.

command-line-interface computer-vision content-based-image-retrieval duplicate-detection duplicate-files duplicates ffmpeg opencv qt6 similarity-search

Last synced: 16 Dec 2024

https://github.com/jasonjmcghee/portable-hnsw

What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?

hnsw knn portable similarity-search

Last synced: 06 Dec 2024

https://github.com/brunoarine/org-similarity

Emacs package that helps org-mode users (re)discover similar documents

bm25 elisp emacs org-mode org-roam python semantic-similarity similarity-search tf-idf

Last synced: 16 Nov 2024

https://github.com/daac-tools/find-simdoc

Finding all pairs of similar documents time- and memory-efficiently

all-pairs document-search rust similarity-search

Last synced: 29 Nov 2024

https://github.com/qwertyforce/scenery

photo gallery with extended search capabilities

photo-gallery reverse-image-search similarity-search

Last synced: 06 Dec 2024

https://github.com/sankalp1999/semantweet-search

Vector search over tweets from the tweet archive using OpenAI embeddings and LanceDB

embeddings image-search lancedb openai python semantic-search similarity-search vector-search

Last synced: 02 Nov 2024

https://github.com/elmiraghorbani/chatgpt-long-term-memory

The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of simultaneous users and external sources.

chatbot chatgpt chatgpt-api context datastore embedding-similarity embeddings gpt-3 gpt-35-turbo llama-index long-term-memory memory openai python redis similarity-search text-retrieval text-summarization tiktoken vector

Last synced: 11 Nov 2024

https://github.com/rinx/alvd

alvd = A Lightweight Vald. A lightweight distributed vector search engine works without K8s.

approximate-nearest-neighbor-search nearest-neighbor-search ngt similarity-search vald vector-search vector-search-engine

Last synced: 23 Nov 2024

https://github.com/developermindset-com/faiss-mobile

FAISS library compiled for iOS, macOS, tvOS, watchOS

c cpp embeddings faiss ios knn macos neighbor-search search similarity-search tvos vector watchos

Last synced: 10 Nov 2024

https://github.com/vitrivr/cottontaildb

Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.

cottontail-db cottontaildb database embedding-similarity knearest-neighbours-lookup multimedia multimedia-retrieval retrieval similarity-search vector-database vector-search-engine vector-space-retrieval

Last synced: 19 Dec 2024

https://github.com/barakmich/bbqvec

Scalable Embedded Vector Index for Go and Rust

aknn ann golang knn rust similarity-search vector vector-database vector-search

Last synced: 06 Nov 2024

https://github.com/ekzhu/go-set-similarity-search

Efficient set similarity search algorithms implemented in Go

all-pairs set-similarity-search similarity-search

Last synced: 28 Oct 2024

https://github.com/senior-sigan/kawaiisearch

An application to find similar pictures based on the VGG16 and kNN

fashion image-classification keras knn machine-learning picture similarity-search vgg16

Last synced: 16 Nov 2024

https://github.com/anush008/chromadb-rs

Rust client library for ChromaDB

chromadb rust-lang similarity-search vector-database

Last synced: 20 Dec 2024

https://github.com/AlbertSuarez/searchly

🎶 Song similarity search API based on lyrics

api flask lyrics nmslib python redoc similarity-search song word2vec

Last synced: 26 Oct 2024

https://github.com/code-kern-ai/refinery-sample-projects

Containing examples of projects you can use to test refinery. Please select the use case from the branches.

chatbot example exercise rasa sentiment-analysis similarity-search tutorial

Last synced: 10 Nov 2024

https://github.com/code-kern-ai/embedders

With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.

classification machine-learning named-entity-recognition natural-language-processing ner nlp python representation-learning similarity-search

Last synced: 10 Nov 2024

https://github.com/coderham/videosimilarity

Capstone Project for MS in Data Science @ University of Washington - Video Similarity Search

computer-vision deep-learning images similarity-search videos

Last synced: 19 Dec 2024

https://github.com/CoderHam/VideoSimilarity

Capstone Project for MS in Data Science @ University of Washington - Video Similarity Search

computer-vision deep-learning images similarity-search videos

Last synced: 07 Nov 2024

https://github.com/brunoarine/findlike

Command-line tool that finds lexically similar documents in relation to a reference text file or ad-hoc query

bm25 nlp similarity-search tfidf

Last synced: 25 Nov 2024

https://github.com/semasuka/talk-to-your-pdf

An application that enable the users to upload PDF files and ask questions regarding their content using Retrieval Augmented Generation (RAG)

ai cosine-similarity embeddings google-drive-api gpt-4 intent-classification llm openai openai-api pgvector postgresql python rag similarity-search sqlalchemy streamlit text-moderation vector-database vector-search

Last synced: 11 Nov 2024

https://github.com/AlbertSuarez/donework

📚 Text generator using ML and Search Similarity

flask gpt-2 latex machine-learning python similarity-search text-generation

Last synced: 26 Oct 2024

https://github.com/patrickfrank1/chesspos

Embedding based chess position search and embedding learning for chess positions

chess-database embeddings faiss metric-learning similarity-search tensorflow triplet-loss

Last synced: 14 Oct 2024

https://github.com/infinisil/soph

Efficiently import pictures while handling duplicates gracefully

blockhash deduplication haskell perceptual-hashing pictures-organizer similarity-search

Last synced: 28 Oct 2024

https://github.com/AlbertSuarez/casescan

🔍 Clinical cases search by similarity specialized in Covid-19

nlp python react similarity-search

Last synced: 26 Oct 2024

https://github.com/karolzak/images-vector-search

Simple implementation of search for visually similar images using deep learning and vector search. It's based on pretrained ImageNet weights so it doesnt require any additional training

cnn deep-learning deep-neural-networks embedding-models image image-processing images keras keras-tensorflow neural-networks resnet resnet-152 resnet-50 similarity-detection similarity-search transfer-learning vector-search vgg vgg19 visual-search

Last synced: 23 Oct 2024

https://github.com/kampersanda/dyft

C++17 Implementation of Dynamic Filter Trie

cpp17 data-structures hamming-distace index similarity-search

Last synced: 29 Nov 2024

https://github.com/longmaoteamtf/ant

Open-source vector database built to embedding similarity search

faiss k-nearest-neighbours similarity-search vector-database

Last synced: 28 Nov 2024

https://github.com/zazaho/simimg

Python/TkInter program to find and display "similar" images. This is the development site, to install use 'pip install simimg'.

gui imageviewer python3 similarity-search

Last synced: 12 Oct 2024

https://github.com/pinecone-io/pinecone-rust-client

The official Rust client for the Pinecone vector database

llm pinecone semantic-search similarity-search vector-database

Last synced: 12 Nov 2024

https://github.com/agrover112/siamesenet-search

MNIST digits image similarity search by Indexing with Annoy and using trained embeddings from a Siamese Net with Triplet Loss .

annoy image-similarity indexing mnist siamese-architecture siamese-network siamese-neural-network similarity-search tensorflow-examples tensorflow2

Last synced: 10 Oct 2024

https://github.com/mxmlnkn/cppbktree

Python BK-Tree module based on a C++ implementation

hamming-distance python python3 similarity-search tree

Last synced: 13 Oct 2024

https://github.com/DidierRLopes/similarstocks

This repository will hold similar stocks based on their description through NLP models

finance nlp similarity-search stocks

Last synced: 01 Nov 2024

https://github.com/didierrlopes/similarstocks

This repository will hold similar stocks based on their description through NLP models

finance nlp similarity-search stocks

Last synced: 13 Oct 2024

https://github.com/bhavnicksm/pokemon-card-explorer

Who's that Pokemon (card)? Search over more than 10K Pokemon cards to find out the coolest one yet! ✨

cohere openai openai-api pinecone pokemon python3 reranking retrieval similarity-search streamlit vectordb

Last synced: 03 Dec 2024

https://github.com/podgorskiy/hashranking

Fast procedures for hamming distance computation, ranking, mAP computation. Fore deep-learning research in hashing and retrieval.

deep-hashing deep-learning hashing neural-network ranking retrieval similarity-search

Last synced: 12 Nov 2024

https://github.com/georgesittas/similarity-search

Implementation and survey of similarity search methods that rely on dimensionality reduction (e.g. LSH), D-dimensional vector clustering

clustering k-means-clustering k-nearest-neighbours lsh randomized-projection similarity-search

Last synced: 13 Oct 2024

https://github.com/anirudhdagar/paperviz

AI conference papers search space visualized based on similarity.

bert d3js deep-learning machine-learning nlp papers pytorch similarity-search visualization

Last synced: 20 Dec 2024