Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with sentence-embeddings
A curated list of projects in awesome lists tagged with sentence-embeddings .
https://github.com/neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
embeddings information-retrieval language-model large-language-models llm machine-learning neural-search nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search vector-search-engine
Last synced: 29 Sep 2024
https://neuml.github.io/txtai/
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
embeddings information-retrieval language-model large-language-models llm machine-learning neural-search nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search vector-search-engine
Last synced: 24 Sep 2024
https://github.com/flagopen/flagembedding
Retrieval and Retrieval-augmented LLMs
embeddings information-retrieval llm retrieval-augmented-generation sentence-embeddings text-semantic-similarity
Last synced: 29 Sep 2024
https://github.com/MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers
Last synced: 01 Aug 2024
https://github.com/maartengr/bertopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers
Last synced: 29 Sep 2024
https://github.com/FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
embeddings information-retrieval llm retrieval-augmented-generation sentence-embeddings text-semantic-similarity
Last synced: 31 Jul 2024
https://github.com/shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec
Last synced: 03 Oct 2024
https://github.com/princeton-nlp/simcse
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Last synced: 30 Sep 2024
https://github.com/princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Last synced: 01 Aug 2024
https://github.com/seanlee97/xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
lexical-analysis ner nlp pinyin postagging radical segmentation sentence-embeddings sentence-similarity sentiment-analysis spell-checker
Last synced: 30 Sep 2024
https://github.com/SeanLee97/xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
lexical-analysis ner nlp pinyin postagging radical segmentation sentence-embeddings sentence-similarity sentiment-analysis spell-checker
Last synced: 31 Jul 2024
https://github.com/johnsnowlabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
bert-embedding dependency-parsing entity-resolution language-detection lemmatizer named-entity-recognition natural-language-understanding nlu pandas sentence-embeddings sentiment-analysis sentiment-classifier seq2seq spell-checker streamlit t5 text-classification text-summarization text-translation transformers
Last synced: 26 Sep 2024
https://github.com/JohnSnowLabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
bert-embedding dependency-parsing entity-resolution language-detection lemmatizer named-entity-recognition natural-language-understanding nlu pandas sentence-embeddings sentiment-analysis sentiment-classifier seq2seq spell-checker streamlit t5 text-classification text-summarization text-translation transformers
Last synced: 05 Aug 2024
https://github.com/Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding
Last synced: 03 Aug 2024
https://github.com/muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding
Last synced: 02 Aug 2024
https://github.com/wangyuxinwhy/uniem
unified embedding model
embeddings huggingface nlp sentence-embeddings sentence-transformers
Last synced: 03 Aug 2024
https://github.com/oborchers/Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast!
cython document-similarity embeddings fasttext fse gensim gensim-model maxpooling sentence-embeddings sentence-representation sentence-similarity sif swem usif word2vec-model wordembedding
Last synced: 02 Aug 2024
https://github.com/jina-ai/vectordb
A Python vector database you just need - no more, no less.
embedding-similarity neural-search sentence-embeddings vector-database vector-database-embedding vector-search
Last synced: 01 Aug 2024
https://github.com/kaushalshetty/Structured-Self-Attention
A Structured Self-attentive Sentence Embedding
attention attention-mechanism attention-model attention-weights classification deep-learning python3 pytorch self-attention self-attentive-rnn sentence-embeddings visualization
Last synced: 05 Aug 2024
https://github.com/seanlee97/angle
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec
Last synced: 27 Sep 2024
https://github.com/JohnGiorgi/DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
allennlp contrastive-learning metric-learning natural-language-processing pytorch representation-learning self-supervised-learning semantic-search semantic-text-similarity sentence-embeddings sentence-similarity transformers
Last synced: 01 Aug 2024
https://github.com/voidism/DiffCSE
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
contrastive-learning representation-learning self-supervised-learning sentence-embeddings sentence-similarity sentence-transformers
Last synced: 03 Aug 2024
https://github.com/geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
context embedding embedding-vectors gpt gpt3-turbo gpt4 huggingface huggingface-transformers information-extraction language-model llama llm natural-language-processing openai python question-answering scraper sentence-embeddings sentence-similarity vector-search
Last synced: 01 Aug 2024
https://github.com/tharindudr/simple-sentence-similarity
Exploring the simple sentence similarity measurements using word embeddings
elmo fasttext glove ipynb python sentence-embeddings sentence-similarity wmd word-embeddings word2vec
Last synced: 27 Sep 2024
https://github.com/nikolamilosevic86/local-genai-search
Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files
generative-ai langchain large-language-models llama3 local msmarco python3 qdrant-client search-engine sentence-embeddings sentence-transformers
Last synced: 27 Sep 2024
https://github.com/hppRC/simple-simcse-ja
Exploring Japanese SimCSE
japanese pytorch sentence-embeddings sentence-transformers simcse transformers
Last synced: 03 Aug 2024
https://github.com/hellojwilde/energetic-ai
EnergeticAI is TensorFlow.js, optimized for serverless environments, with fast cold-start, small module size, and pre-trained models.
ai artificial-intelligence embeddings embeddings-trained machine-learning sentence-embeddings tensorflow tensorflowjs
Last synced: 01 Aug 2024
https://github.com/sdadas/polish-sentence-evaluation
Evaluation of Sentence Representations in Polish
natural-language-processing polish-language sentence-embeddings word-embeddings
Last synced: 31 Jul 2024
https://github.com/iarroyof/sentence_embedding
A sentence embedding method based on weighted series
natural-language-processing semantic-similarity sentence-embeddings sentence-representations word-embeddings
Last synced: 03 Aug 2024
https://github.com/luozhouyang/deepse
Sentence Embeddings using Deep Nerual Networks in PRODUCTION!
bert keras sentence-embeddings simcse
Last synced: 30 Sep 2024
https://github.com/talmago/simple-but-tough-to-beat-examples
Bunch of examples of a "Simple but tough to beat baseline for sentence embeddings" in classification tasks
fake-news-classification fasttext fasttext-python imdb-dataset machine-learning nlp sentence-embeddings sentence2vec w2v word-embeddings word2vec
Last synced: 02 Oct 2024
https://github.com/ozlerhakan/keywords_clustering
cluster text data using sentence bert
bert embeddings python3 search sentence-bert sentence-embeddings
Last synced: 02 Oct 2024
https://github.com/skywardai/kirin
APIs aggregator for inference, fine-tuning and build models.
ai api container conversational-ai fastapi fine-tuning llamacpp llm-inference llm-training rag sentence-embeddings vector-database
Last synced: 27 Sep 2024
https://github.com/galal-pic/talented-recruitment-and-skills-analysis-system
The project's goal is to help job seekers understand the basic qualifications for specific jobs and evaluate the suitability of their skills for those positions. Additionally, the program aims to assist recruiters in enhancing their resume selection processes by analyzing and understanding job advertisements ....
cvanalysis fine-tuning flask huggingface ner nlp python scraping sentence-embeddings sentence-transformers spacy sqlalchemy transformer
Last synced: 30 Sep 2024
https://github.com/ryandsilva/clef-2023-joker
Code Repository for AKRaNLU @ CLEF JOKER 2023: Using Sentence Embeddings and Multilingual Models to Detect and Interpret Wordplay
computational-humor puns sentence-embeddings sentence-transformers transformers wordplay xlm-roberta
Last synced: 02 Oct 2024
https://github.com/bilalhameed248/faq-finder-using-rag
A RAG (Retrieval augmented generation)-based FAQ Chat-Bot, designed to operate within an organization's internal domain. - Jul 2023 - Oct 2023
chat-application chatbot faqbot faqs llama3 perplexity perplexity-ai perplexity-api query-builder question-answering rag sentence-embeddings sentence-transformers t5-base t5-model
Last synced: 01 Oct 2024
https://github.com/salman-khan-mohammed/q-a-system
The "Codebasics Q&A" project is an end-to-end Question and Answer (Q&A) system developed for Codebasics, an e-learning company specializing in data-related courses and bootcamps. The system is designed to assist students who typically ask questions via Discord or email by providing instant, automated responses.
faiss googlepalm huggingface langchain sentence-embeddings streamlit
Last synced: 26 Sep 2024