Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-semantic-search
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
https://github.com/Agrover112/awesome-semantic-search
Last synced: about 8 hours ago
JSON representation
-
Libraries and Tools
-
2023
- weaviate
- embeddinghub
- fastText
- SBERT
- Relevance AI - Vector Platform From Experimentation To Deployment
- Jina.AI
- pinecone
- BEIR :Benchmarking IR
- RELiC: Retrieving Evidence for Literary Claims Dataset
- Which Frame?
- lexica.art
- milvus
- NeuroNLP++
- weaviate
- same.energy
- ann benchmarks
- scaNN
- REALM
- opensemanticsearch.org
- GPT3 Semantic Search
- Universal Sentence Encoder
- LaBSE
- ELECTRA
- LASER
- SentEval Toolkit
- ranx
- matchzoo-py
- deep_text_matching
- emoji semantic search
- PySerini
- BERTSimilarity
- natural-language-youtube-search
- annoy
- pynndescent
- nsg
- FALCONN
- redis HNSW
- autofaiss
- DPR
- rank_BM25
- FlashRank
- BEIR :Benchmarking IR
- weaviate
- milvus
- Which Frame?
- semantic-search-through-wikipedia-with-weaviate
-
-
Papers
-
2010
-
2015
-
2016
- Bag of Tricks for Efficient Text Classification
- Enriching Word Vectors with Subword Information
- Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs
- On Approximately Searching for Similar Word Embeddings
- Learning Distributed Representations of Sentences from Unlabelled Data
- Approximate Nearest Neighbor Search on High Dimensional Data --- Experiments, Analyses, and Improvement
-
2017
-
2018
- Universal Sentence Encoder
- Learning Semantic Textual Similarity from Conversations
- Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
- Optimization of Indexing Based on k-Nearest Neighbor Graph for Proximity Search in High-dimensional Data
- The Case for Learned Index Structures
- Google AI Blog: Advances in Semantic Textual Similarity
- The Case for Learned Index Structures
- Universal Sentence Encoder
- Learning Semantic Textual Similarity from Conversations
-
2019
- LASER: Language Agnostic Sentence Representations
- Document Expansion by Query Prediction
- Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
- Multi-Stage Document Ranking with BERT
- Latent Retrieval for Weakly Supervised Open Domain Question Answering
- End-to-End Open-Domain Question Answering with BERTserini
- BioBERT: a pre-trained biomedical language representation model for biomedical text mining
- Analyzing and Improving Representations with the Soft Nearest Neighbor Loss
- Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
- Analyzing and Improving Representations with the Soft Nearest Neighbor Loss
-
2020
- Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned
- PASSAGE RE-RANKING WITH BERT
- CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
- LaBSE:Language-agnostic BERT Sentence Embedding
- Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset
- DeText: A deep NLP framework for intelligent text understanding
- Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation
- Pretrained Transformers for Text Ranking: BERT and Beyond
- REALM: Retrieval-Augmented Language Model Pre-Training
- ELECTRA: PRE-TRAINING TEXT ENCODERS AS DISCRIMINATORS RATHER THAN GENERATORS
- Improving Deep Learning For Airbnb Search
- Managing Diversity in Airbnb Search
- Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
- Unsupervised Image Style Embeddings for Retrieval and Recognition Tasks
- DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations
- Improving Deep Learning For Airbnb Search
- CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
- Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation
- PASSAGE RE-RANKING WITH BERT
-
2021
- Hybrid approach for semantic similarity calculation between Tamil words
- Augmented SBERT
- BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
- Compatibility-aware Heterogeneous Visual Search
- Learning Personal Style from Few Examples
- TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
- A Survey of Transformers
- SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking
- High Quality Related Search Query Suggestions using Deep Reinforcement Learning
- Embedding-based Product Retrieval in Taobao Search
- TPRM: A Topic-based Personalized Ranking Model for Web Search
- mMARCO: A Multilingual Version of MS MARCO Passage Ranking Dataset
- Database Reasoning Over Text
- How Does Adversarial Fine-Tuning Benefit BERT?
- Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
- Primer: Searching for Efficient Transformers for Language Modeling
- SimCSE: Simple Contrastive Learning of Sentence Embeddings
- Compositional Attention: Disentangling Search and Retrieval
- SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search
- GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
- Generative Search Engines: Initial Experiments
- Rethinking Search: Making Domain Experts out of Dilettantes
- WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach
- Augmented SBERT
- Embedding-based Product Retrieval in Taobao Search
- SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking
- Rethinking Search: Making Domain Experts out of Dilettantes
-
2022
- Text and Code Embeddings by Contrastive Pre-Training
- RELIC: Retrieving Evidence for Literary Claims
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations
- SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
- An Analysis of Fusion Functions for Hybrid Retrieval
- Out-of-distribution Detection with Deep Nearest Neighbors
- ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition
- Analyzing Acoustic Word Embeddings From Pre-Trained Self-Supervised Speech Models
- Rethinking with Retrieval: Faithful Large Language Model Inference
- Precise Zero-Shot Dense Retrieval without Relevance Labels
- Transformer Memory as a Differentiable Search Index
- Analyzing Acoustic Word Embeddings From Pre-Trained Self-Supervised Speech Models
- Precise Zero-Shot Dense Retrieval without Relevance Labels
-
2023
-
-
Articles
-
2023
- Introducing the hybrid index to enable keyword-aware semantic search
- Argilla Semantic Search
- Simplify Search woth Multilingual Embedding Models
- Tackling Semantic Search
- Semantic search in Azure Cognitive Search
- How we used semantic search to make our search 10x smarter
- Stanford AI Blog : Building Scalable, Explainable, and Adaptive NLP Models with Retrieval
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Some observations about similarity search thresholds
- Near Duplicate Image Search using Locality Sensitive Hashing
- Comprehensive Guide To Approximate Nearest Neighbors Algorithms
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Free Course on Vector Similarity Search and Faiss
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Co:here's Multilingual Text Understanding Model
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
-
-
Datasets
Categories
Keywords
information-retrieval
5
approximate-nearest-neighbor-search
4
nearest-neighbor-search
4
deep-learning
3
nlp
3
python
3
tensorflow
2
pytorch
2
bert
2
locality-sensitive-hashing
2
hnsw
2
ranking
2
hybrid-search
2
semantic-search
2
vector-database
2
vector-search
2
natural-language-processing
1
neural-network
1
search-engine
1
text
1
text-matching
1
recommender-system
1
semantic
1
similarity
1
clip
1
computer-vision
1
machine-learning
1
search
1
youtube
1
neural-search
1
c-plus-plus
1
golang
1
mlops
1
lua
1
weaviate
1
vectors
1
vector-search-engine
1
comparison
1
data-fusion
1
evaluation
1
evaluation-metrics
1
similarity-search
1
information-retrieval-evaluation
1
information-retrieval-metrics
1
metasearch
1
numba
1
semantic-search-engine
1
rank-fusion
1
ranking-metrics
1
recommender-systems
1