Projects in Awesome Lists tagged with rerank
A curated list of projects in awesome lists tagged with rerank .
https://github.com/mudler/localai
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts
Last synced: 14 May 2026
https://github.com/go-skynet/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts
Last synced: 03 May 2025
https://github.com/mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference
ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts
Last synced: 14 Mar 2025
https://github.com/quantumnous/new-api
A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥
ai-gateway claude deepseek gemini openai rerank
Last synced: 09 Apr 2026
https://github.com/QuantumNous/new-api
AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
ai-gateway claude deepseek gemini openai rerank
Last synced: 10 Apr 2025
https://github.com/calcium-ion/new-api
AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
ai-gateway claude deepseek gemini openai rerank
Last synced: 03 Apr 2025
https://github.com/Calcium-Ion/new-api
AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
ai-gateway claude deepseek gemini openai rerank
Last synced: 24 Mar 2025
https://github.com/shell-nlp/gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
asr embedding fastchat function-calling gpt infinity llama llm lmdeploy openai prompt-injection rerank sglang text-moderation tts vllm
Last synced: 28 Feb 2026
https://github.com/tensorlakeai/rerank-ts
rerank library for easy reranking of results
rerank reranking typescript-library
Last synced: 01 Mar 2026
https://github.com/lzjever/lexilux
Unified LLM API client library for Python. Simple API for Chat, Embedding, Rerank, and Tokenizer. OpenAI-compatible with streaming support and unified usage tracking.
api-client chat-api document-ranking embedding function-api llm openai-api openai-compatible python rerank reranker semantic-search streaming tokenizer
Last synced: 26 Jan 2026
https://github.com/eliaspereirah/searchaugmentedllm
SearchAugmentedLLM empowers LLMs with information from the web
google-search-api grounding-llms llm rag rerank reranker reranking retrieval-augmented-generation
Last synced: 18 Mar 2025
https://github.com/pashpashpash/python-rag-scaffold
A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.
boilerplate cohere cohere-ai cosine-similarity embeddings fastapi observability openai python rag rerank reranker retrieval-augmented-generation template turbopuffer uvicorn vault-ai vector vector-database vector-embeddings
Last synced: 03 Sep 2025
https://github.com/atopx/teiclient
go client for text-embedding-inference (https://github.com/huggingface/text-embeddings-inference)
rerank tei text-embedding text-embeddings-inference
Last synced: 01 Jul 2025
https://github.com/jiangnanboy/jiajia-search
Multilingual Lightweight & High-Performance Hybrid Search Engine | Built for RAG
embedding hybrid-search multilingual onnx qa rag rerank search
Last synced: 04 Apr 2026