An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with rerank

A curated list of projects in awesome lists tagged with rerank .

https://github.com/mudler/localai

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Last synced: 14 May 2026

https://github.com/go-skynet/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes libp2p llama llama3 llm mamba mistral musicgen rerank rwkv stable-diffusion text-generation tts

Last synced: 03 May 2025

https://github.com/mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

ai api audio-generation distributed gemma gpt4all image-generation kubernetes llama llama3 llm mamba mistral musicgen p2p rerank rwkv stable-diffusion text-generation tts

Last synced: 14 Mar 2025

https://github.com/quantumnous/new-api

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥

ai-gateway claude deepseek gemini openai rerank

Last synced: 09 Apr 2026

https://github.com/QuantumNous/new-api

AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

ai-gateway claude deepseek gemini openai rerank

Last synced: 10 Apr 2025

https://github.com/calcium-ion/new-api

AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

ai-gateway claude deepseek gemini openai rerank

Last synced: 03 Apr 2025

https://github.com/Calcium-Ion/new-api

AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.

ai-gateway claude deepseek gemini openai rerank

Last synced: 24 Mar 2025

https://github.com/shell-nlp/gpt_server

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。

asr embedding fastchat function-calling gpt infinity llama llm lmdeploy openai prompt-injection rerank sglang text-moderation tts vllm

Last synced: 28 Feb 2026

https://github.com/mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings ggml gguf gpt langchain llama llamacpp llava llavacpp llm rerank reranking ros2 vlm

Last synced: 04 Apr 2025

https://github.com/tensorlakeai/rerank-ts

rerank library for easy reranking of results

rerank reranking typescript-library

Last synced: 01 Mar 2026

https://github.com/lzjever/lexilux

Unified LLM API client library for Python. Simple API for Chat, Embedding, Rerank, and Tokenizer. OpenAI-compatible with streaming support and unified usage tracking.

api-client chat-api document-ranking embedding function-api llm openai-api openai-compatible python rerank reranker semantic-search streaming tokenizer

Last synced: 26 Jan 2026

https://github.com/stephanj/bm25

A BM25 Java implementation using streams, stop words and stemming.

bm25 llm nlp rerank stemming

Last synced: 13 Oct 2025

https://github.com/eliaspereirah/searchaugmentedllm

SearchAugmentedLLM empowers LLMs with information from the web

google-search-api grounding-llms llm rag rerank reranker reranking retrieval-augmented-generation

Last synced: 18 Mar 2025

https://github.com/ittia-research/check

Automated fact-check

embedding factcheck llm rag rerank

Last synced: 29 Apr 2026

https://github.com/pashpashpash/python-rag-scaffold

A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere for reranking.

boilerplate cohere cohere-ai cosine-similarity embeddings fastapi observability openai python rag rerank reranker retrieval-augmented-generation template turbopuffer uvicorn vault-ai vector vector-database vector-embeddings

Last synced: 03 Sep 2025

https://github.com/atopx/teiclient

go client for text-embedding-inference (https://github.com/huggingface/text-embeddings-inference)

rerank tei text-embedding text-embeddings-inference

Last synced: 01 Jul 2025

https://github.com/payamnajat/chatgpt-5-configuration-analysis

📊 Analyze and deobfuscate 3,099 configuration elements for ChatGPT, enhancing search optimization and system performance.

aeo ai algorithm chatgpt geo llm openai ranking rerank seo

Last synced: 31 Aug 2025

https://github.com/jiangnanboy/jiajia-search

Multilingual Lightweight & High-Performance Hybrid Search Engine | Built for RAG

embedding hybrid-search multilingual onnx qa rag rerank search

Last synced: 04 Apr 2026