Projects in Awesome Lists tagged with sbert
A curated list of projects in awesome lists tagged with sbert .
https://github.com/embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
benchmark bitext-mining clustering information-retrieval low-resource-nlp mteb multilingual-nlp multimodal neural-search reranking retrieval sbert semantic-search sentence-transformers sts text-classification text-embedding
Last synced: 19 Apr 2026
https://github.com/beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
benchmark bert colbert dataset deep-learning dpr elasticsearch information-retrieval llm nlp passage-retrieval pytorch question-generation rag retrieval retrieval-models sbert sentence-transformers zero-shot-retrieval
Last synced: 14 May 2025
https://github.com/sudharsan13296/getting-started-with-google-bert
Build and train state-of-the-art natural language processing models using BERT
albert bart bert bertsum clinical-bert distilbert electra huggingface-transformers nlp pytorch roberta sbert sentence-bert spanbert tinybert transformer videobert
Last synced: 04 Oct 2025
https://github.com/yuanzhoulvpi2017/documentsearch
基于sentence transformers和chatglm实现的文档搜索工具
Last synced: 09 Jan 2026
https://github.com/thiswillbeyourgithub/anna_anki_neuronal_appendix
Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
ai anki bert clustering doc2vec embedding flashcards kmeans latent machinelearning neighbourhood nlp pca sbert scheduler sementics sentence-embeddings umap
Last synced: 10 Apr 2025
https://github.com/wri-dssg-omdena/policy-data-analyzer
Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
active-learning bert data-science document-classification environmental huggingface incentives landscape-restoration lda machine-learning nlp policy sbert scraping scrapy sentence-transformers spyder text-classification topic transformers
Last synced: 27 Mar 2025
https://github.com/ukplab/useb
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.
benchmark domain-adaptation information-retrieval nlp paraphrase-identification pytorch reranking sbert sentence-embeddings transformer unsupervised-learning
Last synced: 09 Aug 2025
https://github.com/IndexStorm/git-rec-back
Backend code for GitHub Recommendation Extension
ai faiss flask machine-learning sbert
Last synced: 14 Jul 2025
https://github.com/astrowonk/emoji_finder
emoji_finder
emoji python pytorch sbert semantic-search sentence-transformers sqlite
Last synced: 06 Mar 2026
https://github.com/ahmedshahriar/twittercelebritymatcher
Match celebrity users with their respective tweets by making use of Semantic Textual Similarity on over 900+ celebrity users' 2.5 million+ scraped tweets utilizing SBERT, streamlit, tweepy and FastAPI
fastapi multilingual-bert mypy pydantic python310 python39 pytorch pytorch-gpu rest-api sbert sentence-transformers streamlit streamlit-webapp tweepy twitter-scraping web-scraping
Last synced: 22 Apr 2025
https://github.com/dwyl/rag-elixir-doc
Livebook to run a Phoenix_LiveView documentation Retrieval-Augmented Generation (RAG) enhanced LLM
cross-encoder elixir embeddings livebook llm-inference rag retrieval-augmented-generation sbert
Last synced: 11 Oct 2025
https://github.com/susheel-1999/sentence_similarity
Package to calculate the similarity score between two sentences
bert natural-language-processing nlp sbert sentence-embeddings sentence-similarity sentence-transformer sentence-transformers transformers
Last synced: 27 Oct 2025
https://github.com/droyed/mansh
Linux manual search shell with natural language
console natural-language natural-language-processing sbert semantic-search semantic-similarity shell
Last synced: 17 Jan 2026
https://github.com/seungjaelim/k-hyunmoogpt
[K-Data Science Hackaton 3rd Award] Development and use of Korean large-scale generative language models
chromadb fine-tuning gradio koalpaca llm llm-inference rag sbert
Last synced: 13 Oct 2025
https://github.com/rid17pawar/semantic-search-model-experiments
Experiments in the field of Semantic Search using BM-25 Algorithm, Mean of Word Vectors, along with state of the art Transformer based models namely USE and SBERT.
bm25 fasttext fasttext-embeddings glove glove-embeddings information-retrieval sbert semantic-search universal-sentence-encoder word2vec word2vec-embeddinngs
Last synced: 17 Oct 2025
https://github.com/mymusise/sentence-transformers-tf
sentence-transformers with tensorflow
sbert sentence-embeddings sentence-transformers tensorflow transformer
Last synced: 29 Apr 2026
https://github.com/kmock930/natural-language-processing
This project contains codes and paperwork based on the course CSI5386 at University of Ottawa (delivered by Professor Dr. Diana Inkpen).
bert bigram-modeling corpus-linguistics distilbert fasttext-embeddings glove-embeddings hugging-face-transformers large-language-models lemmatizer logistic-regression macro-micro-f1 natural-language-processing paraphrase-minilm pos-tagging roberta-large sbert stopwords text-embedding-ada-002 universal-sentence-encoder word-tokenizer
Last synced: 12 Jul 2025
https://github.com/louisbrulenaudet/tax-retrieval-benchmark
An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.
benchmark droit embeddings fiscal fiscalite information-retrieval mteb rag retrieval retrieval-augmented-generation sbert semantic-search sentence-embeddings sentence-transformers stp tax taxation
Last synced: 05 Mar 2026
https://github.com/sartim/search-engine
Search engine helper using Sentence-BERT and elasticsearch to return results based on semantic similarity
Last synced: 07 May 2026
https://github.com/ulf1/simiscore-semantic
An ML API to compute semantic similarity scores between sentence examples.
ml-api sbert sentence-bert similarity-score
Last synced: 03 Apr 2025
https://github.com/maettuu/24hs-essentials-in-text-and-speech-processing
Repository for the course Essentials in Text and Speech Processing Fall 2024
amazon-books-reviews approximate-nearest-neighbors backend content-based-recommendation python recommendation-system sbert tf-idf-vectorization
Last synced: 20 Jul 2025
https://github.com/rahulvictor12/resume-scoring-using-nlp-and-sbert-transformer
ResumeScorer is an intelligent resume screening tool that leverages Natural Language Processing (NLP) techniques to assess the relevance of resumes against a given job description. The core idea is to reduce manual screening time and improve candidate-job matching through semantic similarity.
cosine-similarity embeddings-extraction lemmitization nlp-keywords-extraction nlp-machine-learning pdfplumber sbert
Last synced: 14 May 2025
https://github.com/jfmdev/simple_chatbots
Simple Chatbots implemented with Machine Learning
chatbot ia machine-learning python sbert tensorflow
Last synced: 27 Jul 2025
https://github.com/alikhalajii/text-classification-life-sciences
Text classification of Life Science apps
data-analysis data-science datasets feature-importance jupyter-notebook pandas sbert scikit-learn word2vec
Last synced: 05 May 2026
https://github.com/aleedm/sick-summarization
This repository explores enhancing dialogue summarization with commonsense knowledge through the SICK framework, evaluating models on dialogue datasets to assess commonsense's impact on summarization quality.
bart-model comet commonsense-knowledge dialogue-summarization natural-language-processing pegasus-model sbert t5-model
Last synced: 18 Mar 2025
https://github.com/anurima-saha/topic_modelling_lda_hdbscan
Using unsupervised learning to group reddit text and identify major conspiracy theories using NLP, LDA, spacy, SVD, SBert embedding and HDBSCAN.
hdbscan latent-dirichlet-allocation natural-language-processing sbert spacy topic-modeling unsupervised-learning
Last synced: 26 Feb 2025
https://github.com/kwokhing/demo-on-automated-fact-checking-using-s-bert
In this demo, we illustrate the the possibility of using Semantic Search + Recognising Textual Entailment with Gradio to build an automated fact checking tool
deberta gradio gradio-interface msmarco natural-language-inference recognizing-textual-entailment sbert sbert-implementation semantic-search semantic-similarity sentence-embeddings sentence-transformers
Last synced: 25 Mar 2025
https://github.com/abdeldjalilchafai/nasa-asrs-fault-assistant-pt
Semantic fault assistant using SBERT for aviation safety reports.
cosine-similarity neural-network nlp pretrained-models research-project sbert semantic-search semantic-segmentation sematicsearch sentence-similarity sentence-transformers transformer
Last synced: 12 Jun 2025
https://github.com/ambidextrous9/mtp-news-article-based-question-answering-system
MTP-FlanT5-SBERT-Model-for-NewsQA-and-Teacher-Student-Model
bm25 flan-t5 language-model newsqa nlp qa question-answering sbert transformer
Last synced: 10 Oct 2025
https://github.com/zzarif/ai-detector
Detect AI generated coding answers
cosine-similarity embeddings fine-tuning flask gpt4 huggingface openai regression sbert sentence-transformers
Last synced: 29 Apr 2026