An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with llm-proxy

A curated list of projects in awesome lists tagged with llm-proxy .

https://github.com/romgX/openrelay

几百个免费 AI 模型配额,一键接入本地项目。| Hundreds of free AI model quotas, one-click access to local projects.

ai ai-proxy aider cerebras claude claude-code copilot cursor developer-tools free-ai free-api groq kiro llm-proxy model-router openai openclaw proxy windsurf

Last synced: 26 Jun 2026

https://github.com/liaohch3/claude-tap

Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi, Pi, and Hermes in a local trace viewer.

agent-debugging agent-observability ai-agents ai-tools api-debugging claude-code codex codex-cli cursor-cli developer-tools gemini-cli hermes-agent kimi llm llm-proxy opencode pi-coding-agent proxy trace trace-viewer

Last synced: 26 May 2026

https://github.com/toby-bridges/api-relay-audit

Local security audit for AI API relays and LLM proxies: detects prompt injection, model substitution, tool-call rewriting, SSE anomalies, error leakage, and Web3 wallet risks.

ai-agents ai-audit ai-security anthropic api-gateway claude cli llm-audit llm-proxy llm-security model-substitution openai-api prompt-injection python security-audit security-scanner supply-chain-security tool-call-rewriting web3-security web3-wallet

Last synced: 21 Jun 2026

https://github.com/starbaser/ccproxy

Build mods for Claude Code: Hook any request, modify any response, /model "with-your-custom-model", intelligent model routing using your logic or ours

ai ai-gateway ai-proxy ai-tools anthropic claude claude-ai claude-api claude-code claude-max claudecode gemini gemini-cli litellm llm llm-gateway llm-proxy llmops openai openrouter

Last synced: 09 Jun 2026

https://github.com/romgx/openrelay

几百个免费 AI 模型配额,一键接入本地项目。| Hundreds of free AI model quotas, one-click access to local projects.

ai ai-proxy aider cerebras claude claude-code copilot cursor developer-tools free-ai free-api groq kiro llm-proxy model-router openai openclaw proxy windsurf

Last synced: 09 May 2026

https://github.com/LeenHawk/gproxy

gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditing.

claude gemini gpt llm-proxy

Last synced: 26 Jun 2026

https://github.com/guanxiaol/WindsurfPoolAPI

Multi-account pool proxy for Windsurf — 113+ models (Claude/GPT/Gemini/Grok/Kimi) via OpenAI & Anthropic APIs, image upload, Cursor & Claude Code native / 企业级 Windsurf 多账号池化 API 代理

ai-proxy anthropic anthropic-api api claude claude-code codeium cursor gemini gpt language-server llm-proxy multi-account multimodal nodejs openai openai-api pool proxy windsurf

Last synced: 26 Jun 2026

https://github.com/nayjest/lm-proxy

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton

Last synced: 18 Jun 2026

https://github.com/guanxiaol/windsurfpoolapi

Multi-account pool proxy for Windsurf — 113+ models (Claude/GPT/Gemini/Grok/Kimi) via OpenAI & Anthropic APIs, image upload, Cursor & Claude Code native / 企业级 Windsurf 多账号池化 API 代理

ai-proxy anthropic anthropic-api api claude claude-code codeium cursor gemini gpt language-server llm-proxy multi-account multimodal nodejs openai openai-api pool proxy windsurf

Last synced: 27 Apr 2026

https://github.com/peva3/SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router local-llm model-serving ollama ollama-api openai-proxy self-hosted self-hosted-ai semantic-cache

Last synced: 26 Jun 2026

https://github.com/Nayjest/lm-proxy

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton

Last synced: 09 Jun 2026

https://github.com/ferro-labs/ai-gateway

Unified AI Gateway for 30+ LLMs (OpenAI, Anthropic, Bedrock, Azure etc) with Caching, Guardrails, A/B test & cost controls. Go-native Fastest & Scalable AI Gateway LiteLLM & Kong AI Gateway alternative.

ai-gateway ai-infrastructure gateway guardrails kong litellm llm llm-cost llm-proxy llm-strategy llmops mcp pii-detection prompt-management semantic-cache

Last synced: 24 May 2026

https://github.com/leenhawk/gproxy

gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditing.

claude gemini gpt llm-proxy

Last synced: 15 Apr 2026

https://github.com/Inebrio/Routerly

Self-hosted LLM gateway that routes requests across AI providers (OpenAI, Anthropic, Gemini, Mistral, Ollama) using intelligent multi-policy scoring — including an LLM-native routing policy. Drop-in compatible: just swap the base URL. No database required, built-in cost tracking, budget enforcement and multi-tenant isolation.

ai-gateway ai-router anthropic budget-enforcement cost-tracking llm-gateway llm-proxy llm-routing multi-tenant openai-proxy self-hosted

Last synced: 22 Jun 2026

https://github.com/soapbucket/sbproxy

AI Governance Engine. One self-hostable gateway for AI traffic, APIs, MCP, and AI crawlers.

ai-gateway ai-governance anthropic api-gateway governance-engine llm-proxy load-balancer mcp openai pingora rate-limiting reverse-proxy rust waf

Last synced: 26 Jun 2026

https://github.com/bluewave-labs/langroute

This is a robust and configurable LLM proxy server built with Node.js, Express, and PostgreSQL. It acts as an intermediary between your applications and various Large Language Model (LLM) providers

llm llm-gateway llm-proxy llmproxy proxy

Last synced: 20 Jan 2026

https://github.com/sunflower0305/claude-proxy

Claude Code / Claude Agent SDK proxy for DeepSeek, Qwen, GLM, MiniMax and Kimi via Anthropic Messages API

anthropic claude claude-agent-sdk claude-code llm-proxy

Last synced: 03 Jun 2026

https://github.com/azerozero/grob

LLM proxy with built-in DLP and regulatory compliance. Redacts secrets before they reach the API. EU AI Act, GDPR, HDS/PCI DSS ready. Multi-provider failover, live TUI, virtual keys, fan-out. 6 MB, zero deps. Rust.

ai-gateway air-gapped anthropic audit-log dlp eu-ai-act failover fan-out gdpr gemini llm-proxy multi-provider ollama openai opentelemetry rust secret-scanning sovereign streaming virtual-keys

Last synced: 14 Jun 2026

https://github.com/wa91h/local-ai-toolkit

A self-hosted AI toolkit running locally via Docker Compose, bundling an LLM gateway, workflow automation, and a chat UI — all backed by a shared PostgreSQL database.

ai ai-agent docker docker-compose litellm llm llm-gateway llm-proxy local-llm n8n ollama openwebui self-hosted workflow

Last synced: 18 May 2026

https://github.com/peva3/smarterrouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router local-llm model-serving ollama ollama-api openai-proxy self-hosted self-hosted-ai semantic-cache

Last synced: 27 Feb 2026

https://github.com/tokligence/tokligence-gateway

Go LLM gateway — one interface for Claude Code, Codex, Gemini CLI, Anthropic, OpenAI, Qwen, and vLLM.

ai-gateway llm-proxy model-router openai-compatible-proxy-server token-tracking

Last synced: 01 Mar 2026

https://github.com/aikazu/kelola-router

Local-first API router for MiniMax + Kiro (AWS CodeWhisperer / Amazon Q) upstreams — OpenAI & Anthropic compatible, multi-account fallback, switchable Kiro IDE/CLI persona, RTK + Caveman compression, built-in dashboard (Hono + SQLite + Preact)

amazon-q anthropic api-router aws-codewhisperer hono kiro llm-proxy minimax openai preact sqlite typescript

Last synced: 03 Jul 2026

https://github.com/riipandi/radium

[WIP] Radium is an open-source LLM proxy gateway built for resource efficiency and high performance.

ai ai-gateway antrophic gateway llm llm-gateway llm-proxy llmops mcp-gateway openai openai-compatible openai-proxy

Last synced: 15 May 2026

https://github.com/xk1ko/aigloo

Self-hosted AI gateway. Route, translate and track requests across providers, with access keys, budgets, and a built-in dashboard.

ai-gateway anthropic claude claude-code cline codex cursor dashboard deepseek fallback gemini llm llm-gateway llm-proxy openai openai-proxy opencode qwen token-saver

Last synced: 01 Jul 2026

https://github.com/pmbstyle/openrouter-proxy

Nodejs OpenRouter proxy inference that provides all nessary endpoints for your LLM application.

llm-inference llm-proxy llm-webservice nodejs openrouter-api

Last synced: 16 Apr 2026

https://github.com/antkawam/claude-code-aws-gateway

Self-hosted API gateway for Claude Code on Amazon Bedrock. Team management, virtual API keys, per-user budgets, OIDC SSO, web search, and an admin portal.

amazon-bedrock anthropic api-gateway api-proxy aws-cdk bedrock-runtime budget-management claude claude-code developer-tools docker ecs-fargate graviton llm-proxy oidc rust self-hosted sso team-management web-search

Last synced: 01 Apr 2026

https://github.com/kianwoon/modelweaver

Multi-provider model orchestration proxy for Claude Code. Route agent roles (planning, coding, research) to different LLM providers with automatic fallback, daemon mode, desktop GUI, config hot-reload, and crash recovery.

ai-agents anthropic api-proxy claude claude-code desktop-gui developer-tools fallback hono hot-reload llm llm-proxy model-routing multi-provider openrouter proxy rate-limiting sse tauri typescript

Last synced: 15 Apr 2026

https://github.com/pysugar/oauth-llm-nexus

The universal headless bridge for OAuth-authenticated LLM services.

antigravity claude-code llm-proxy oauth openai-api reverse-proxy

Last synced: 11 Feb 2026

https://github.com/llimona-org/llimona

Llimona is an open and modular Python framework for building production-ready LLM gateways

asyncio llm llm-gateway llm-proxy llm-tools python

Last synced: 25 Apr 2026

https://github.com/kckempf/yallmap

An OpenTelemetry-instrumented gateway for Anthropic-compatible LLMs

ai-gateway anthropic claude claude-code langfuse llm-gateway llm-observability llm-proxy ollama opentelemetry otel typescript

Last synced: 09 Jun 2026

https://github.com/mostlydev/cllama

The blood-brain barrier for autonomous agents. A context-aware LLM governance proxy that enforces credential starvation — identity-verified, provider-routed, cost-tracked, and audit-logged.

ai-agents inference-api inference-gateway llm llm-inference llm-proxy

Last synced: 10 May 2026

https://github.com/syndicalt/provara

Intelligent multi-provider LLM gateway with adaptive routing, A/B testing, and cost optimization. Self-host it or use the managed SaaS.

ai-gateway automation cost-optimization llm llm-proxy observability self-hosted

Last synced: 11 Jun 2026

https://github.com/nullata/llamaman

A browser-based UI for launching, monitoring, and managing multiple llama.cpp server instances from inside a Docker container. Includes an Ollama-compatible API proxy

frontend llamacpp llm llm-inference llm-infrastructure llm-manager llm-proxy proxy rest-api

Last synced: 02 Apr 2026

https://github.com/1mb-dev/shim

HTTP proxy: run Claude Code against OpenAI-compatible providers (DeepSeek/OpenAI/OpenRouter/Ollama) or pass through to Anthropic, with built-in request measurement.

anthropic claude-code deepseek go llm-proxy ollama openai openrouter prometheus

Last synced: 10 Jun 2026

https://github.com/rawcontext/reflex

Episodic memory and semantic cache proxy for LLM APIs with ~40% token savings

agent-orchestration ai-agents context-graph developer-tools knowledge-graph llm-proxy semantic-cache token-optimization

Last synced: 11 Jan 2026

https://github.com/vivian254338489/tken-fastapi-ai-gateway-starter

FastAPI OpenAI-compatible AI gateway starter with custom base_url, Docker, mock mode, and portable provider examples.

ai-gateway api-proxy base-url chatgpt-api cheap-ai-api developer-tools docker fastapi llm-proxy model-routing openai-api openai-compatible python tken

Last synced: 27 Jun 2026

https://github.com/gitstq/ahg-ai-gateway

蓝鹰AI网关 BlueEagle - 全球顶尖大模型统一API网关 | 0.09x倍率 | 1:1充值 | GPT-4o/Claude-4/Gemini-2.5 | OpenAI兼容 | 免费测试额度

ai-gateway ai-models anthropic api-proxy chatgpt claude-4 claude-api deepseek gemini-api gpt-4 gpt-4o llm llm-proxy openai openai-api

Last synced: 21 Jun 2026

https://github.com/b-macker/naab-passage

🔒 Sovereign data gateway & PII protection - Zero leakage to LLMs and APIs with self-synthesizing architecture. HIPAA/GDPR compliant. Part of the NAAb Ecosystem.

anthropic api-gateway audit compliance data-privacy data-protection encryption gateway gdpr hipaa llm-proxy naab naab-ecosystem openai pii-protection polyglot privacy security soc2 zero-trust

Last synced: 06 Mar 2026

https://github.com/navneetlal/ai-gateway

Unified API gateway for LLM providers. Route requests to OpenAI, Anthropic, and more through a single OpenAI-compatible interface.

ai ai-gateway anthropic api-gateway fastify llm llm-gateway llm-proxy openai openai-api typescript

Last synced: 27 Feb 2026

https://github.com/chicogong/stream-relay-go

A lightweight Go streaming relay for LLM/TTS APIs with production-grade observability and policy controls

anthropic api-gateway docker gin golang grafana llm llm-proxy observability openai prometheus proxy rate-limiting siliconflow sse streaming tts

Last synced: 08 Feb 2026

https://github.com/study8677/llm-router

自托管 OpenAI-compatible AI Gateway:用 auto / auto-coding / auto-longtext 自动选择合适模型,支持流式、工具调用、多模态透传和 fallback。

ai-gateway ai-router auto-model developer-tools docker function-calling llm-gateway llm-proxy llm-router model-routing multimodal nodejs openai-api openai-compatible self-hosted streaming typescript

Last synced: 04 Jun 2026

https://github.com/lab34-es/llm-proxy

LLM proxy written in go with usage & guard rails support.

go llm llm-proxy

Last synced: 20 Apr 2026

https://github.com/tessera-llm/tessera-sdk

Drop-in LLM cost-optimization proxy. Auto-route + cache + compress + batch. Flat monthly pricing by token volume, keep 100% of savings. Free 60M tokens/mo.

ai-cost ai-proxy anthropic apache-2 claude cohere cost-optimization gemini gpt-4o groq llm llm-cost llm-proxy mistral openai python sdk tessera typescript

Last synced: 01 Jun 2026