Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with sentence-embeddings

A curated list of projects in awesome lists tagged with sentence-embeddings .

https://github.com/MaartenGr/BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers

Last synced: 01 Aug 2024

https://github.com/maartengr/bertopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers

Last synced: 29 Sep 2024

https://github.com/shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec

Last synced: 03 Oct 2024

https://github.com/princeton-nlp/simcse

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

nlp sentence-embeddings

Last synced: 30 Sep 2024

https://github.com/princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

nlp sentence-embeddings

Last synced: 01 Aug 2024

https://github.com/seanlee97/xmnlp

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

lexical-analysis ner nlp pinyin postagging radical segmentation sentence-embeddings sentence-similarity sentiment-analysis spell-checker

Last synced: 30 Sep 2024

https://github.com/SeanLee97/xmnlp

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

lexical-analysis ner nlp pinyin postagging radical segmentation sentence-embeddings sentence-similarity sentiment-analysis spell-checker

Last synced: 31 Jul 2024

https://github.com/JohnGiorgi/DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

allennlp contrastive-learning metric-learning natural-language-processing pytorch representation-learning self-supervised-learning semantic-search semantic-text-similarity sentence-embeddings sentence-similarity transformers

Last synced: 01 Aug 2024

https://github.com/voidism/DiffCSE

Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

contrastive-learning representation-learning self-supervised-learning sentence-embeddings sentence-similarity sentence-transformers

Last synced: 03 Aug 2024

https://github.com/tharindudr/simple-sentence-similarity

Exploring the simple sentence similarity measurements using word embeddings

elmo fasttext glove ipynb python sentence-embeddings sentence-similarity wmd word-embeddings word2vec

Last synced: 27 Sep 2024

https://github.com/nikolamilosevic86/local-genai-search

Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files

generative-ai langchain large-language-models llama3 local msmarco python3 qdrant-client search-engine sentence-embeddings sentence-transformers

Last synced: 27 Sep 2024

https://github.com/hellojwilde/energetic-ai

EnergeticAI is TensorFlow.js, optimized for serverless environments, with fast cold-start, small module size, and pre-trained models.

ai artificial-intelligence embeddings embeddings-trained machine-learning sentence-embeddings tensorflow tensorflowjs

Last synced: 01 Aug 2024

https://github.com/luozhouyang/deepse

Sentence Embeddings using Deep Nerual Networks in PRODUCTION!

bert keras sentence-embeddings simcse

Last synced: 30 Sep 2024

https://github.com/talmago/simple-but-tough-to-beat-examples

Bunch of examples of a "Simple but tough to beat baseline for sentence embeddings" in classification tasks

fake-news-classification fasttext fasttext-python imdb-dataset machine-learning nlp sentence-embeddings sentence2vec w2v word-embeddings word2vec

Last synced: 02 Oct 2024

https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system

The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....

cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer

Last synced: 30 Sep 2024

https://github.com/ryandsilva/clef-2023-joker

Code Repository for AKRaNLU @ CLEF JOKER 2023: Using Sentence Embeddings and Multilingual Models to Detect and Interpret Wordplay

computational-humor puns sentence-embeddings sentence-transformers transformers wordplay xlm-roberta

Last synced: 02 Oct 2024

https://github.com/bilalhameed248/faq-finder-using-rag

A RAG (Retrieval augmented generation)-based FAQ Chat-Bot, designed to operate within an organization's internal domain. - Jul 2023 - Oct 2023

chat-application chatbot faqbot faqs llama3 perplexity perplexity-ai perplexity-api query-builder question-answering rag sentence-embeddings sentence-transformers t5-base t5-model

Last synced: 01 Oct 2024

https://github.com/salman-khan-mohammed/q-a-system

The "Codebasics Q&A" project is an end-to-end Question and Answer (Q&A) system developed for Codebasics, an e-learning company specializing in data-related courses and bootcamps. The system is designed to assist students who typically ask questions via Discord or email by providing instant, automated responses.

faiss googlepalm huggingface langchain sentence-embeddings streamlit

Last synced: 26 Sep 2024