awesome-semantic-search
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
https://github.com/Agrover112/awesome-semantic-search
Last synced: 15 minutes ago
JSON representation
-
Papers
-
2016
- Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs
- Bag of Tricks for Efficient Text Classification
- Enriching Word Vectors with Subword Information
- On Approximately Searching for Similar Word Embeddings
- Learning Distributed Representations of Sentences from Unlabelled Data
- Approximate Nearest Neighbor Search on High Dimensional Data --- Experiments, Analyses, and Improvement
- On Approximately Searching for Similar Word Embeddings
-
2010
-
2015
-
2017
- Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
- Semantic Textual Similarity For Hindi
- Efficient Natural Language Response Suggestion for Smart Reply
- Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
- Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
-
2018
- Universal Sentence Encoder
- Learning Semantic Textual Similarity from Conversations
- Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
- Optimization of Indexing Based on k-Nearest Neighbor Graph for Proximity Search in High-dimensional Data
- Google AI Blog: Advances in Semantic Textual Similarity
- Universal Sentence Encoder
- Learning Semantic Textual Similarity from Conversations
- The Case for Learned Index Structures
- Google AI Blog: Advances in Semantic Textual Similarity
-
2019
- LASER: Language Agnostic Sentence Representations
- Document Expansion by Query Prediction
- Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
- Multi-Stage Document Ranking with BERT
- Latent Retrieval for Weakly Supervised Open Domain Question Answering
- End-to-End Open-Domain Question Answering with BERTserini
- BioBERT: a pre-trained biomedical language representation model for biomedical text mining
- Analyzing and Improving Representations with the Soft Nearest Neighbor Loss
- Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
- Analyzing and Improving Representations with the Soft Nearest Neighbor Loss
- End-to-End Open-Domain Question Answering with BERTserini
-
2020
- Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned
- PASSAGE RE-RANKING WITH BERT
- CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
- LaBSE:Language-agnostic BERT Sentence Embedding
- Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset
- DeText: A deep NLP framework for intelligent text understanding
- Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation
- Pretrained Transformers for Text Ranking: BERT and Beyond
- REALM: Retrieval-Augmented Language Model Pre-Training
- ELECTRA: PRE-TRAINING TEXT ENCODERS AS DISCRIMINATORS RATHER THAN GENERATORS
- Managing Diversity in Airbnb Search
- Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
- Unsupervised Image Style Embeddings for Retrieval and Recognition Tasks
- DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations
- Improving Deep Learning For Airbnb Search
- CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
- Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation
- PASSAGE RE-RANKING WITH BERT
- DeText: A deep NLP framework for intelligent text understanding
-
2021
- Hybrid approach for semantic similarity calculation between Tamil words
- Augmented SBERT
- BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
- Compatibility-aware Heterogeneous Visual Search
- Learning Personal Style from Few Examples
- TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
- A Survey of Transformers
- High Quality Related Search Query Suggestions using Deep Reinforcement Learning
- Embedding-based Product Retrieval in Taobao Search
- TPRM: A Topic-based Personalized Ranking Model for Web Search
- mMARCO: A Multilingual Version of MS MARCO Passage Ranking Dataset
- Database Reasoning Over Text
- How Does Adversarial Fine-Tuning Benefit BERT?
- Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
- Primer: Searching for Efficient Transformers for Language Modeling
- SimCSE: Simple Contrastive Learning of Sentence Embeddings
- Compositional Attention: Disentangling Search and Retrieval
- SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search
- GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
- Generative Search Engines: Initial Experiments
- WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach
- Augmented SBERT
- Embedding-based Product Retrieval in Taobao Search
- SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking
- Rethinking Search: Making Domain Experts out of Dilettantes
- Learning Personal Style from Few Examples
-
2022
- Text and Code Embeddings by Contrastive Pre-Training
- RELIC: Retrieving Evidence for Literary Claims
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations
- SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
- An Analysis of Fusion Functions for Hybrid Retrieval
- Out-of-distribution Detection with Deep Nearest Neighbors
- ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition
- Analyzing Acoustic Word Embeddings From Pre-Trained Self-Supervised Speech Models
- Rethinking with Retrieval: Faithful Large Language Model Inference
- Precise Zero-Shot Dense Retrieval without Relevance Labels
- Transformer Memory as a Differentiable Search Index
- Analyzing Acoustic Word Embeddings From Pre-Trained Self-Supervised Speech Models
- Precise Zero-Shot Dense Retrieval without Relevance Labels
-
2023
-
-
Libraries and Tools
-
2023
- lexica.art
- Jina.AI
- fastText
- SBERT
- Relevance AI - Vector Platform From Experimentation To Deployment
- pinecone
- RELiC: Retrieving Evidence for Literary Claims Dataset
- Which Frame?
- milvus
- NeuroNLP++
- same.energy
- scaNN
- REALM
- opensemanticsearch.org
- GPT3 Semantic Search
- Which Frame?
- Universal Sentence Encoder
- LaBSE
- txtai
- vespa
- annoy
- natural-language-youtube-search
- FALCONN
- LASER
- vearch
- autofaiss
- ranx
- pgANN
- redis HNSW
- PySerini
- pynndescent
- BERTSimilarity
- rank_BM25
- ELECTRA
- emoji semantic search
- vectorai
- DPR
- Tensorflow Similarity
- SentEval Toolkit
- matchzoo-py
- deep_text_matching
- nsg
- FlashRank
- HyperTag
- embeddinghub
- weaviate
- AquilaDb
- Which Frame?
- ann benchmarks
- nearPy
- Universal Sentence Encoder
- LaBSE
- Relevance AI - Vector Platform From Experimentation To Deployment
- Haystack
- BEIR :Benchmarking IR
- Which Frame?
- BERTSerini
- milvus
- NeuroNLP++
- semantic-search-through-wikipedia-with-weaviate
- searchy
- STripNet
-
-
Articles
-
2023
- Tackling Semantic Search
- Semantic search in Azure Cognitive Search
- How we used semantic search to make our search 10x smarter
- Stanford AI Blog : Building Scalable, Explainable, and Adaptive NLP Models with Retrieval
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Some observations about similarity search thresholds
- Near Duplicate Image Search using Locality Sensitive Hashing
- Comprehensive Guide To Approximate Nearest Neighbors Algorithms
- Introducing the hybrid index to enable keyword-aware semantic search
- Argilla Semantic Search
- Simplify Search woth Multilingual Embedding Models
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Free Course on Vector Similarity Search and Faiss
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Co:here's Multilingual Text Understanding Model
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Billion-scale semantic similarity search with FAISS+SBERT
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Building a semantic search engine with dual space word embeddings
- Tackling Semantic Search
- Semantic search in Azure Cognitive Search
- How we used semantic search to make our search 10x smarter
- Free Course on Vector Similarity Search and Faiss
- Comprehensive Guide To Approximate Nearest Neighbors Algorithms
- Introducing the hybrid index to enable keyword-aware semantic search
- Co:here's Multilingual Text Understanding Model
-
-
Datasets
Categories
Keywords
machine-learning
6
python
6
vector-search
5
nearest-neighbor-search
5
information-retrieval
5
tensorflow
5
deep-learning
5
nlp
5
approximate-nearest-neighbor-search
4
rag
4
semantic-search
4
search
4
search-engine
4
vector-database
3
retrieval-augmented-generation
3
embeddings
3
pytorch
3
bert
2
clustering
2
vectors
2
cosine-similarity
2
ranking
2
locality-sensitive-hashing
2
hybrid-search
2
transformers
2
knn
2
llm
2
nearest-neighbors
2
artificial-intelligence
2
ai
2
evaluation
1
evaluation-metrics
1
data-fusion
1
comparison
1
document-retrieval
1
cloud-native
1
information-retrieval-evaluation
1
information-retrieval-metrics
1
metasearch
1
numba
1
rank-fusion
1
ranking-metrics
1
recommender-systems
1
score-fusion
1
ann
1
language-model
1
large-language-models
1
sentence-embeddings
1
txtai
1
big-data
1