Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with text-embedding
A curated list of projects in awesome lists tagged with text-embedding .
https://github.com/embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
benchmark bitext-mining clustering information-retrieval multilingual-nlp neural-search reranking retrieval sbert semantic-search sentence-transformers sgpt sts text-classification text-embedding
Last synced: 17 Dec 2024
https://github.com/xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
embeddings information-retrieval language-model prompt-retrieval text-classification text-clustering text-embedding text-evaluation text-reranking text-semantic-similarity
Last synced: 18 Dec 2024
https://github.com/muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding
Last synced: 18 Dec 2024
https://github.com/Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding
Last synced: 15 Nov 2024
https://github.com/seanlee97/angle
Train and Infer Powerful Sentence Embeddings with AnglE | π₯ SOTA on STS and MTEB Leaderboard
dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec
Last synced: 20 Dec 2024
https://github.com/neonwatty/meme-search
The open source Meme Search Engine. Free and built to self-host locally with Python, Ruby, and Docker.
demo-app docker generative-ai large-language-models machine-learning python ruby ruby-on-rails self-hosted text-embedding vector-database vision-language-model
Last synced: 21 Dec 2024
https://github.com/neonwatty/meme_search
Index your memes by their content and text, making them easily retrievable for your meme warfare pleasures. Find funny fast.
demo-app generative-ai large-language-models machine-learning text-embedding vector-database vision-language-model
Last synced: 01 Dec 2024
https://github.com/milosgajdos/go-embeddings
Go module for fetching embeddings from embeddings providers
ai aws-bedrock bedrock cohere embeddings go golang llm ollama openai text-embedding text-embeddings vertex-ai
Last synced: 03 Dec 2024
https://github.com/alash3al/vecdb
a vector embedding database with multiple storage engines and AI embedding integrations
ai database gemini machine-learning text-embedding vector-database vector-embeddings
Last synced: 29 Nov 2024
https://github.com/lakeraai/canica
A text embedding viewer for the Jupyter environment
jupyter jupyter-notebook text-embedding text-embeddings visualization
Last synced: 10 Dec 2024
https://github.com/cloudera/cml_amp_few-shot_text_classification
Perform topic classification on news articles in several limited-labeled data regimes.
bert few-shot-learning nlp text-embedding zero-shot-classification
Last synced: 07 Nov 2024
https://github.com/astrabert/sentrev
Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs
embedders evaluation-framework python python-package qdrant semantic-search sentence-transformers text-embedding vector-database
Last synced: 12 Dec 2024
https://github.com/ubos-tech/node-red-contrib-openai-ubos
A Node-RED node that interacts with OpenAI machine learning models to generate text like ChatGPT
ada-002 ai chatgpt chatgpt-api code-generation dall-e embeddings embeddings-model flow gpt-3 gpt-35-turbo-0301 javascript machine-learning no-code node-red nodejs openai text-embedding text-generation
Last synced: 02 Nov 2024
https://github.com/izhx/uni-rep
Code for embedding and retrieval research.
embeddings infomation-retrieval sentence-embeddings text-embedding
Last synced: 12 Nov 2024
https://github.com/easonlai/product_recommendations_with_gpt
I have improved the demo by using Azure OpenAIβs Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.
azure azure-openai azure-openai-api cosine-similarity openai product-recommendation python python3 recommender-system text-embedding text-embeddings word-embedding word-embeddings
Last synced: 10 Nov 2024
https://github.com/amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion
diffusion-models lda machine-learning natural-language-processing nlp sentence-embeddings t5 text-embedding text-embeddings text-generation topic topic-modeling topic-models transformers
Last synced: 12 Nov 2024
https://github.com/rosette-api/csharp
Rosette API Client Library for C#
capi csharp entity-extraction language-identification machine-learning morphology name-translation natural-language-processing nlp nuget rosette text-analysis text-analytics text-embedding visual-studio
Last synced: 12 Nov 2024
https://github.com/rosette-api/php
Babel Street Analytics Client Library for PHP
entity-extraction language-identification lemma morphology named-entity-recognition natural-language-processing nlp php text-analytics text-embedding tokenization
Last synced: 19 Dec 2024
https://github.com/rosette-api-community/text-embeddings-sample
A little python code to show how to get similarity between word embeddings returned from the Rosette API's new /text-embedding endpoint.
machine-learning natural-language-processing nlp python text-embedding text-extraction text-similarity word-similarity
Last synced: 12 Nov 2024
https://github.com/rosette-api/ruby
Rosette API Client Library for Ruby
deduplication entity-extraction language-identification machine-learning morphology named-entity-recognition natural-language-processing nlp ruby sentiment-analysis text-analytics text-embedding tokenization
Last synced: 12 Nov 2024
https://github.com/easonlai/azure_openai_semantic_search_sample
This code repo demonstrates how to use the word embedding model from Azure OpenAI Service to perform a semantic search on a grocery store dataset.
azure azure-openai azure-openai-api python python3 semantic-search semantic-similarity text-embedding word-embeddings
Last synced: 10 Nov 2024
https://github.com/easonlai/product_semantic_search_streamlit
This code repo demonstrates how to use the word embedding model from Azure OpenAI Service to perform a semantic search on a grocery store dataset. This enhanced/completed version used Streamlit to build a web user experience to semantic search and display the most relevant items
azure azure-openai azure-openai-api product-search python python3 semantic-search streamlit streamlit-webapp text-embedding word-embeddings
Last synced: 10 Nov 2024
https://github.com/deadbits/vector-embedding-api
Flask API for generating text embeddings using OpenAI or sentence_transformers
api-server cache-storage embedding huggingface nlp openai sentence-transformers text-embedding text-embeddings vector-embeddings
Last synced: 13 Dec 2024
https://github.com/aldenhovel/image-retrieval
An image retrieval engine . εΎεζ£η΄’η³»η»γ
image-captioning image-retrieval image-search-engine object-detection text-classification text-embedding
Last synced: 13 Nov 2024
https://github.com/rosette-api-community/visualize-embeddings
A simple Python script for transforming a corpus of documents into text vectors suitable for visualization
machine-learning natural-language-processing nlp python text-embedding text-vectorization tsv visualization
Last synced: 12 Nov 2024
https://github.com/turian/embeddingcache
Retrieve text embeddings, but cache them locally if we have already computed them.
ai caching embeddings language-model machine-learning nlp nlp-tool nlp-tools openai-api sentence-transformers text-embedding text-embeddings vector-database
Last synced: 20 Dec 2024
https://github.com/rosette-api/curl-examples
cUrl examples for the Rosette API
categorization curl entity-extraction lemmatization morphology natural-language-processing nlp relation-extraction sentiment text-analytics text-embedding text-mining tokenization
Last synced: 12 Nov 2024
https://github.com/manzoorali29/re-llm-papers
[Paper List] Papers using LLMs for any type of relation extraction/classification
deep-learning few-shot-learning fine-tuning gpt-4 information-retrieval knowledge-graph language-model llama llms machine-learning ner nlp prompt-engineering relation-extraction text-embedding text-mining transfer-learning
Last synced: 13 Dec 2024
https://github.com/rootguillen/patent-search-system-with-gradio
Developed by Gyudong HAN, Counsellor, WIPO ([email protected]). Developed this system with reference to the general text retrieval system which was uploaded together with the video clip named "LangChain Retrieval QA Over Multiple Files with ChromaDB". I only added the implementation of Gradio for its UI.
chromadb python3 text-embedding text-retrieval vector-database
Last synced: 10 Nov 2024
https://github.com/brianlesko/rag-text-search
This git repository hosts a user interface for a chat-app, with integrated text similarity search for querying a document. Think of it as an upgrded Cmd+F search. It's written in Pure Python. Created for Learning Purposes.
cosine-similarity gpt llm openai python search-engine streamlit text text-embedding text-processing ui
Last synced: 06 Nov 2024
https://github.com/simonpierreboucher/embedding
A robust Python tool for generating embeddings from text files using OpenAI's API. This tool processes text files, splits them into chunks while preserving context headers, and generates embeddings using OpenAI's models, saving both text and embeddings in structured formats.
api-rate-limiting automated-text-analysis context-preservation data-preprocessing embeddings-generation error-handling json-and-npy-formats machine-learning metadata-management natural-language-processing openai-api python-tool text-chunking text-embedding yaml-configuration
Last synced: 12 Dec 2024
https://github.com/simonpierreboucher/embedding-generator
A robust Python tool for generating embeddings from text files using OpenAI's API. This tool processes text files, splits them into chunks while preserving context headers, and generates embeddings using OpenAI's models, saving both text and embeddings in structured formats.
embeddings json npy openai semantic-search text-embedding
Last synced: 16 Nov 2024
https://github.com/bilalhameed248/faq-chat-bot-using-vertexai
A generative AI-based FAQ Chat-Bot with a Flask Back-End, designed to operate within an organization's internal domain. - Jul 2023 - Oct 2023
csv embeddings flask gecko html java jquery natural-language-processing nlp python python3 pytorch text-bison text-embedding text-preprocessing vertex-ai
Last synced: 15 Nov 2024
https://github.com/jibril14/openai_text_embedding
OpenAI Text Embedding. Clean, process and create vectorize representation of text for indexing and semantic search
chat-application gpt-4 machine-learning natural-language-processing openai prompt-engineering text-embedding vector-database
Last synced: 22 Nov 2024
https://github.com/somenath203/four-in-one-ai-toolkit-powered-by-google-gemini-api
Click below to checkout the website
chatbot gemini gemini-api image-caption-generator python question-answering-system streamlie-cloud streamlit text-embedding
Last synced: 19 Nov 2024