Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/IngestAI/embedditor

⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.

datapreprocessing datascience embedding-vectors embeddings genai laravel llm markup-language ml nlp nltk php vector-database vector-search vectorization veml

Last synced: 04 Jul 2024

https://github.com/supabase/vecs

Postgres/pgvector Python Client

ai embeddings pgvector postgres vectors

Last synced: 03 Jul 2024

https://github.com/m1guelpf/clippy-widget

An AI-powered assistant for your company's docs.

chatbot chatgpt docs documentation embeddings gpt-3 search

Last synced: 03 Jul 2024

https://github.com/malllabiisc/cesi

WWW 2018: CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information

canonicalization cesi dataset embeddings knowledge-graph knowledge-graph-embeddings www

Last synced: 02 Jul 2024

https://github.com/MaxwellRebo/awesome-2vec

Curated list of 2vec-type embedding models

awesome embeddings list

Last synced: 02 Jul 2024

https://github.com/mop/bier

Cleaned up reference implementation of BIER: Boosting Independent Embeddings Robustly.

cnn computer-vision embeddings

Last synced: 02 Jul 2024

https://github.com/PKU-DAIR/Hetu

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

artificial-intelligence autograd data-science deep-learning deep-neural-networks distributed-systems distributed-training embeddings gpu high-dimensional machine-learning python state-of-the-art

Last synced: 01 Jul 2024

https://github.com/IITH-Compilers/IR2Vec

Implementation of IR2Vec, published in ACM TACO

embeddings llvm

Last synced: 01 Jul 2024

https://github.com/superagent-ai/super-rag

Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.

agents ai embeddings inference rag vector-database

Last synced: 30 Jun 2024

https://github.com/julien040/hn-recommendation-api

A recommendation system for Hacker News. Get the most similar posts for a given URL

embedding embeddings faiss hacker-news hnsw nextjs openai recommendation

Last synced: 29 Jun 2024

https://github.com/enmanuelmag/AnimeClassificator

Investigation of models for anime classification using representative anime data (frames, videos, audio)

anime classification-algorithm deep-learning embeddings image-classification learning machile manga

Last synced: 29 Jun 2024

https://github.com/hybrid-kg/clep

🤖 A Python Package for generating new patient representations driven by data and prior knowledge

bioinformatics embeddings hybrid-data knowledge-driven-framework knowledge-graph knowledge-graph-embeddings machine-learning patient-data

Last synced: 28 Jun 2024

https://kevinmusgrave.github.io/pytorch-metric-learning/

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

computer-vision contrastive-learning deep-learning deep-metric-learning embeddings image-retrieval machine-learning metric-learning pytorch self-supervised-learning

Last synced: 21 Jun 2024

https://github.com/eugeneyan/ml-surveys

📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.

computer-vision deep-learning embeddings machine-learning nlp recommender-system reinforcement-learning survey

Last synced: 20 Jun 2024

https://github.com/warchildmd/game2vec

TensorFlow implementation of word2vec applied on https://www.kaggle.com/tamber/steam-video-games dataset, using both CBOW and Skip-gram.

cbow embeddings game2vec kaggle skipgram tensorflow word2vec

Last synced: 20 Jun 2024

https://github.com/danieldk/dpar

Neural network transition-based dependency parser (in Rust)

dependency-parser embeddings neural-networks parsing rust transition

Last synced: 19 Jun 2024

https://github.com/joisino/wordtour

Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)

embeddings machine-learning natural-language-processing word-embeddings word2vec

Last synced: 16 Jun 2024

https://github.com/pinecone-io/pinecone-datasets

An open-source dataset library for pre-embedded dataset: create your own data catalog, or use Pinecone's public datasets.

data database embeddings vector

Last synced: 16 Jun 2024

https://github.com/persiyanov/skip-thought-tf

An implementation of skip-thought vectors in Tensorflow

deep-learning deeplearning embeddings nlp skip-thought-vectors tensorflow text-summarization

Last synced: 15 Jun 2024

https://github.com/linjungz/chat-with-your-doc

Chat with your docs in PDF/PPTX/DOCX format, using LangChain and GPT4/ChatGPT from both Azure OpenAI Service and OpenAI

azure azure-openai-service chatgpt embeddings gpt-4 langchain openai vectorstore

Last synced: 14 Jun 2024

https://github.com/marcominerva/ChatGptNet

A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service

azure-openai azure-openai-api chatgpt csharp dotnet embedding embeddings embeddings-similarity hacktoberfest net openai openai-api

Last synced: 14 Jun 2024

https://github.com/yusufhilmi/client-vector-search

A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.

embedding-models embedding-vectors embeddings openai search text-embeddings transformers vector vector-search

Last synced: 12 Jun 2024

https://github.com/Atome-FE/llama-node

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

ai embeddings gpt langchain large-language-models llama llama-node llama-rs llamacpp llm napi napi-rs nodejs rwkv

Last synced: 11 Jun 2024

https://github.com/guangzhengli/vectorhub

Quickly and easily build AI website or application by using embeddings!

chatgpt chatpdf embedding embeddings gpt gpt-3 nextjs supabase vector vector-database

Last synced: 11 Jun 2024

https://github.com/shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec

Last synced: 11 Jun 2024

https://github.com/jxzhangjhu/Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

embeddings large-language-models llm rag rag-embeddings retrieval-augmented-generation retrieval-information

Last synced: 09 Jun 2024

https://github.com/amscotti/local-LLM-with-RAG

Running local Language Learning Models to perform Retrieval-Augmented Generation

embeddings langchain llm mistral ollama python rag

Last synced: 09 Jun 2024

https://github.com/Praful932/Kitabe

Book Recommendation System built for Book Lovers📖. Simply Rate ⭐ some books and get immediate recommendations🤩

ajax book-recomendation book-recommender css deep-learning django embeddings funksvd goodbooks-10k goodreads heroku html javascript machine-learning python recommendation-engine recommendation-system surprise svd web-application

Last synced: 09 Jun 2024

https://github.com/grumpyp/aixplora

AIxplora is a open-source tool which let's you query all kind of files not limited to any length or format.

audio chat chatbot chatgpt embeddings embeddings-model generativeai llm llms nlp openai ownfiles pdf question-answering search second-brain vectorstore

Last synced: 08 Jun 2024

https://github.com/Zeeshanahmad4/chatgpt-knowledge-base-chatbot

An advanced chatbot that utilizes your own data to provide intelligent ChatGPT-style conversations using gpt-3.5-turbo and Ada for advanced embedding, as well as custom indexes and knowledgebase for a seamless user experience.

artificial-intelligence chatbot chatgpt embeddings gpt-3 gpt-3-5-turbo knowledge-base long-short-term-memory machine-learning openai openai-api openai-chatgpt pinecone qa qabot vectors

Last synced: 08 Jun 2024

https://github.com/Jordan-Gilliam/ai-template

Mercury - Train your own custom GPT. Chat with any file, or website.

ai chatgpt cheeriojs embeddings gpt-3 gpt-4 nextjs openai openai-template pdf pinecone radix-ui tailwindcss typescript vector-database

Last synced: 08 Jun 2024

https://github.com/Mintplex-Labs/vector-admin

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

ai ai-agents aitools chroma database-management document-retrieval embeddings flowise langchain langchain-js llms pinecone qdrant vector-data-management vector-database vector-database-embedding vector-search vectordatabase vectorspace weaviate

Last synced: 08 Jun 2024

https://github.com/epsilla-cloud/vectordb

Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/

ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search

Last synced: 08 Jun 2024

https://github.com/NeumTry/NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 08 Jun 2024

https://github.com/MilaNLProc/contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

bert embeddings multilingual-models multilingual-topic-models neural-topic-models nlp nlp-library nlp-machine-learning text-as-data topic-coherence topic-modeling transformer

Last synced: 07 Jun 2024

https://github.com/intelligentnode/IntelliNode

Access the latest AI models like ChatGPT, LLaMA, Diffusion, Gemini Hugging face, and beyond through a unified prompt layer and performance evaluation

anthropic chatbot chatgpt claude dall-e embeddings gemini google-ai gpt-4 hugging-face image-generation language-model mistralai nodejs openai prompt-engineering semantic-search speech-synthesis vectors

Last synced: 07 Jun 2024

https://github.com/thinktecture-labs/semantic-kernel-semanticsearch

Example how to implement a question & answer flow using semantic search with OpenAI - by using C# & Semantic Kernel

dotnet embeddings generative-ai llms openai semantic-kernel semantic-search

Last synced: 07 Jun 2024

https://github.com/YC-wind/embedding_study

中文预训练模型生成字向量学习,测试BERT,ELMO的中文效果

bert chinese elmo elmo-tutorial embeddings z-w

Last synced: 06 Jun 2024

https://github.com/alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

elasticsearch elasticsearch-plugin embeddings locality-sensitive-hashing lucene nearest-neighbor-search neural-search semantic-search similarity-search

Last synced: 05 Jun 2024

https://github.com/supabase-community/langchain-chatbot-demo

Example of building a chatbot with Langchain and Supabase Vector.

ai chatbots embeddings langchain langchain-js openai supabase vector-database

Last synced: 02 Jun 2024

https://github.com/supabase-community/chatgpt-your-files

Production-ready MVP for securely chatting with your documents using pgvector

ai db embeddings ml rag supabase vector

Last synced: 02 Jun 2024

https://github.com/bramses/quoordinates

Use OpenAI Embeddings to visualize Kindle Highlights from Readwise!

embeddings future-of-reading kindle nomic openai readwise supabase

Last synced: 02 Jun 2024

https://github.com/lancedb/lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

apache-arrow computer-vision data-analysis data-analytics data-centric data-format data-science dataops deep-learning duckdb embeddings llms machine-learning mlops python rust

Last synced: 02 Jun 2024

https://github.com/lilianweng/stock-rnn

Predict stock market prices using RNN model with multilayer LSTM cells + optional multi-stock embeddings.

embeddings lstm rnn-tensorflow stock-price-prediction

Last synced: 02 Jun 2024

https://github.com/hayabhay/frogbase

Transform audio-visual content into navigable knowledge.

embeddings package python search semantic-search speech-to-text streamlit ui

Last synced: 02 Jun 2024

https://github.com/lancedb/vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

agents ai deep-learning embeddings fine-tuning gpt gpt-4-vision langchain llama-index llms machine-learning multimodal openai rag vector-database

Last synced: 02 Jun 2024

https://github.com/Kav-K/GPTDiscord

A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!

artificial-intelligence asyncio chatbot code-interpreter collaborate dalle2 digitalocean discord embeddings extractive-question-answering github gpt3 hacktoberfest help-wanted moderator-bot multi-modal openai openai-api pinecone python

Last synced: 31 May 2024

https://github.com/SamurAIGPT/EmbedAI

An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

chatbot chatgpt embedai embeddings generative gpt gpt4 gpt4all langchain models openai privategpt vectorstore whisper

Last synced: 31 May 2024

https://github.com/danny-avila/rag_api

ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector

api api-rest embeddings fastapi langchain pgvector postgresql psql python rag vector vector-database

Last synced: 30 May 2024

https://github.com/veekaybee/what_are_embeddings

A deep dive into embeddings starting from fundamentals

embeddings machine-learning machine-learning-algorithms nlp-machine-learning

Last synced: 30 May 2024

https://github.com/google/generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

ai chatbot documentation embeddings gemini gemini-api gemma llm machine-learning

Last synced: 30 May 2024

https://github.com/Azure-Samples/azure-sql-db-session-recommender

Build a recommender using OpenAI, Azure Functions, Azure Static Web Apps, Azure SQL DB, Data API builder and Text Embeddings

azure-functions azure-sql-db azure-static-web-apps data-api-builder embeddings event-driven fullstack jamstack open-ai vectors

Last synced: 26 May 2024

https://github.com/deepfates/silicon

Add some intelligence to your notes with Silicon AI for Obsidian

ai embeddings gpt-3 obsidian-md tools-for-thought

Last synced: 22 May 2024

https://github.com/wpydcr/LLM-Kit

🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用工具

chatbot embeddings fine-tuning generative-agents llm player

Last synced: 22 May 2024

https://github.com/mims-harvard/decagon

Graph convolutional neural network for multirelational link prediction

deep-learning embeddings graph-convolutional-networks graph-neural-networks pharmacology representation-learning

Last synced: 20 May 2024

https://github.com/xgfs/verse

Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures

embeddings graph graph-algorithms machine-learning machine-learning-algorithms similarity-measures

Last synced: 20 May 2024

https://github.com/claws-lab/jodie

A PyTorch implementation of ACM SIGKDD 2019 paper "Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks"

dynamic-networks embedding-trajectories embeddings kdd2019 machine-learning network-embedding representation-learning temporal-network

Last synced: 20 May 2024

https://github.com/MI2DataLab/memr

R package for Multisource Embeddings for Medical Records

embeddings medical-records rstats

Last synced: 20 May 2024

https://github.com/wikipedia2vec/wikipedia2vec

A tool for learning vector representations of words and entities from Wikipedia

embeddings natural-language-processing nlp python text-classification wikipedia

Last synced: 19 May 2024

https://github.com/Hellisotherpeople/CX_DB8

a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)

contextual-summarization cuda debate-evidence embeddings extractive-summarization flair python semantic-search semantic-summarization summarization summarizer token-level-summarization universal-sentence-encoder

Last synced: 19 May 2024

https://github.com/gustavz/DataChad

Ask questions about any data source by leveraging langchains

activeloop chatbot chatgpt chatwithanything chatwithpdf embeddings knowledge-base langchain openai python streamlit

Last synced: 19 May 2024

https://github.com/Dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

embedding-similarity embedding-vectors embeddings llama2 llamacpp semantic-search

Last synced: 19 May 2024

https://github.com/eifuentes/awesome-embeddings

🪁A curated list of awesome resources around entity embeddings

awesome awesome-list deep-learning embedding embeddings feature-engineering machine-learning

Last synced: 19 May 2024

https://github.com/nlpcloud/nlpcloud-js

NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, code generation, and much more...

ad-generator chatbot code-generation conversational-ai embeddings intent-classification keywords-extraction language-detection machine-translation ner nlp paraphrasing question-answering semantic-similarity sentiment-analysis text-classification text-generation text-summarization tokenization

Last synced: 16 May 2024

https://github.com/dotvignesh/PDFChat

The PDFChat app allows you to chat with your PDF files in natural language.

embeddings gpt-3 langchain llm openai python streamlit

Last synced: 15 May 2024

https://github.com/jiatastic/GPTInterviewer

📚 GPT Interviewer - Practice interview with AI interviewer based on job descriptions and resume

ai chatbot chatgpt embeddings langchain llm pdf vectorstores

Last synced: 15 May 2024

https://github.com/marcominerva/OpenAIEmbeddingSample

An example that shows how to use Semantic Kernel and Kernel Memory to work with embeddings in a .NET application using SQL Server as Vector Database.

azure-openai c-sharp chatgpt dotnet embeddings kernel-memory openai semantic-kernel sql-server vector-database visual-studio

Last synced: 15 May 2024

https://github.com/Hironsan/awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

awesome embedding-models embeddings machine-learning natural-language-processing papers word2vec

Last synced: 14 May 2024

https://github.com/chroma-core/chroma

the AI-native open-source embedding database

document-retrieval embeddings llms

Last synced: 14 May 2024

https://github.com/ThoughtRiver/lmdb-embeddings

Fast word vectors with little memory usage in Python

embeddings fasttext gensim glove lmdb magnitude memory speed text vectors word word2vec

Last synced: 14 May 2024

https://github.com/BaseModelAI/cleora

Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.

ai cleora-embeddings datasets deepwalk embeddings entity graphs hypergraphs inductive-entity-embeddings machine-learning ml pytorch-biggraph synerise

Last synced: 13 May 2024