Projects in Awesome Lists tagged with llm-proxy
A curated list of projects in awesome lists tagged with llm-proxy .
https://github.com/dwgx/WindsurfAPI
Windsurf OpenAI-compatible and Anthropic-compatible LLM API proxy
anthropic-compatible api-proxy claude-code cline cursor docker llm-gateway llm-proxy openai-compatible reverse-engineering sse windsurf
Last synced: 26 Jun 2026
https://github.com/romgX/openrelay
几百个免费 AI 模型配额,一键接入本地项目。| Hundreds of free AI model quotas, one-click access to local projects.
ai ai-proxy aider cerebras claude claude-code copilot cursor developer-tools free-ai free-api groq kiro llm-proxy model-router openai openclaw proxy windsurf
Last synced: 26 Jun 2026
https://github.com/liaohch3/claude-tap
Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi, Pi, and Hermes in a local trace viewer.
agent-debugging agent-observability ai-agents ai-tools api-debugging claude-code codex codex-cli cursor-cli developer-tools gemini-cli hermes-agent kimi llm llm-proxy opencode pi-coding-agent proxy trace trace-viewer
Last synced: 26 May 2026
https://github.com/toby-bridges/api-relay-audit
Local security audit for AI API relays and LLM proxies: detects prompt injection, model substitution, tool-call rewriting, SSE anomalies, error leakage, and Web3 wallet risks.
ai-agents ai-audit ai-security anthropic api-gateway claude cli llm-audit llm-proxy llm-security model-substitution openai-api prompt-injection python security-audit security-scanner supply-chain-security tool-call-rewriting web3-security web3-wallet
Last synced: 21 Jun 2026
https://github.com/starbaser/ccproxy
Build mods for Claude Code: Hook any request, modify any response, /model "with-your-custom-model", intelligent model routing using your logic or ours
ai ai-gateway ai-proxy ai-tools anthropic claude claude-ai claude-api claude-code claude-max claudecode gemini gemini-cli litellm llm llm-gateway llm-proxy llmops openai openrouter
Last synced: 09 Jun 2026
https://github.com/romgx/openrelay
几百个免费 AI 模型配额,一键接入本地项目。| Hundreds of free AI model quotas, one-click access to local projects.
ai ai-proxy aider cerebras claude claude-code copilot cursor developer-tools free-ai free-api groq kiro llm-proxy model-router openai openclaw proxy windsurf
Last synced: 09 May 2026
https://github.com/LeenHawk/gproxy
gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditing.
Last synced: 26 Jun 2026
https://github.com/guanxiaol/WindsurfPoolAPI
Multi-account pool proxy for Windsurf — 113+ models (Claude/GPT/Gemini/Grok/Kimi) via OpenAI & Anthropic APIs, image upload, Cursor & Claude Code native / 企业级 Windsurf 多账号池化 API 代理
ai-proxy anthropic anthropic-api api claude claude-code codeium cursor gemini gpt language-server llm-proxy multi-account multimodal nodejs openai openai-api pool proxy windsurf
Last synced: 26 Jun 2026
https://github.com/nayjest/lm-proxy
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.
ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton
Last synced: 18 Jun 2026
https://github.com/guanxiaol/windsurfpoolapi
Multi-account pool proxy for Windsurf — 113+ models (Claude/GPT/Gemini/Grok/Kimi) via OpenAI & Anthropic APIs, image upload, Cursor & Claude Code native / 企业级 Windsurf 多账号池化 API 代理
ai-proxy anthropic anthropic-api api claude claude-code codeium cursor gemini gpt language-server llm-proxy multi-account multimodal nodejs openai openai-api pool proxy windsurf
Last synced: 27 Apr 2026
https://github.com/peva3/SmarterRouter
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router local-llm model-serving ollama ollama-api openai-proxy self-hosted self-hosted-ai semantic-cache
Last synced: 26 Jun 2026
https://github.com/Nayjest/lm-proxy
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.
ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton
Last synced: 09 Jun 2026
https://github.com/open-bias/open-bias
Open Source Reliability Harness: Make your agents follow rules. One line of code to enforce, trace, and improve.
agentic-ai ai-audit ai-compliance ai-firewall ai-governance ai-guardrails ai-policy ai-safety ai-security content-safety guardrails llm-guardrails llm-monitoring llm-proxy llm-safety llm-security policy-engine prompt-injection responsible-ai rule-engine
Last synced: 26 Jun 2026
https://github.com/ferro-labs/ai-gateway
Unified AI Gateway for 30+ LLMs (OpenAI, Anthropic, Bedrock, Azure etc) with Caching, Guardrails, A/B test & cost controls. Go-native Fastest & Scalable AI Gateway LiteLLM & Kong AI Gateway alternative.
ai-gateway ai-infrastructure gateway guardrails kong litellm llm llm-cost llm-proxy llm-strategy llmops mcp pii-detection prompt-management semantic-cache
Last synced: 24 May 2026
https://github.com/leenhawk/gproxy
gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditing.
Last synced: 15 Apr 2026
https://github.com/Inebrio/Routerly
Self-hosted LLM gateway that routes requests across AI providers (OpenAI, Anthropic, Gemini, Mistral, Ollama) using intelligent multi-policy scoring — including an LLM-native routing policy. Drop-in compatible: just swap the base URL. No database required, built-in cost tracking, budget enforcement and multi-tenant isolation.
ai-gateway ai-router anthropic budget-enforcement cost-tracking llm-gateway llm-proxy llm-routing multi-tenant openai-proxy self-hosted
Last synced: 22 Jun 2026
https://github.com/soapbucket/sbproxy
AI Governance Engine. One self-hostable gateway for AI traffic, APIs, MCP, and AI crawlers.
ai-gateway ai-governance anthropic api-gateway governance-engine llm-proxy load-balancer mcp openai pingora rate-limiting reverse-proxy rust waf
Last synced: 26 Jun 2026
https://github.com/omarluq/cc-relay
⚡️ Blazing fast LLMs API Gateway written in Go
anthropic bedrock claude claude-ai claude-api claude-code gemini gemini-api glm-4-7 kimi-k2 llm-api llm-gat llm-gateway-system llm-proxy mistral-ai ollama openai openai-api vertex-ai zai
Last synced: 15 Feb 2026
https://github.com/bluewave-labs/langroute
This is a robust and configurable LLM proxy server built with Node.js, Express, and PostgreSQL. It acts as an intermediary between your applications and various Large Language Model (LLM) providers
llm llm-gateway llm-proxy llmproxy proxy
Last synced: 20 Jan 2026
https://github.com/sunflower0305/claude-proxy
Claude Code / Claude Agent SDK proxy for DeepSeek, Qwen, GLM, MiniMax and Kimi via Anthropic Messages API
anthropic claude claude-agent-sdk claude-code llm-proxy
Last synced: 03 Jun 2026
https://github.com/azerozero/grob
LLM proxy with built-in DLP and regulatory compliance. Redacts secrets before they reach the API. EU AI Act, GDPR, HDS/PCI DSS ready. Multi-provider failover, live TUI, virtual keys, fan-out. 6 MB, zero deps. Rust.
ai-gateway air-gapped anthropic audit-log dlp eu-ai-act failover fan-out gdpr gemini llm-proxy multi-provider ollama openai opentelemetry rust secret-scanning sovereign streaming virtual-keys
Last synced: 14 Jun 2026
https://github.com/wa91h/local-ai-toolkit
A self-hosted AI toolkit running locally via Docker Compose, bundling an LLM gateway, workflow automation, and a chat UI — all backed by a shared PostgreSQL database.
ai ai-agent docker docker-compose litellm llm llm-gateway llm-proxy local-llm n8n ollama openwebui self-hosted workflow
Last synced: 18 May 2026
https://github.com/peva3/smarterrouter
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
ai-cache ai-gateway docker fastapi gpu-monitoring llm llm-proxy llm-router local-llm model-serving ollama ollama-api openai-proxy self-hosted self-hosted-ai semantic-cache
Last synced: 27 Feb 2026
https://github.com/tokligence/tokligence-gateway
Go LLM gateway — one interface for Claude Code, Codex, Gemini CLI, Anthropic, OpenAI, Qwen, and vLLM.
ai-gateway llm-proxy model-router openai-compatible-proxy-server token-tracking
Last synced: 01 Mar 2026
https://github.com/aikazu/kelola-router
Local-first API router for MiniMax + Kiro (AWS CodeWhisperer / Amazon Q) upstreams — OpenAI & Anthropic compatible, multi-account fallback, switchable Kiro IDE/CLI persona, RTK + Caveman compression, built-in dashboard (Hono + SQLite + Preact)
amazon-q anthropic api-router aws-codewhisperer hono kiro llm-proxy minimax openai preact sqlite typescript
Last synced: 03 Jul 2026
https://github.com/riipandi/radium
[WIP] Radium is an open-source LLM proxy gateway built for resource efficiency and high performance.
ai ai-gateway antrophic gateway llm llm-gateway llm-proxy llmops mcp-gateway openai openai-compatible openai-proxy
Last synced: 15 May 2026
https://github.com/xk1ko/aigloo
Self-hosted AI gateway. Route, translate and track requests across providers, with access keys, budgets, and a built-in dashboard.
ai-gateway anthropic claude claude-code cline codex cursor dashboard deepseek fallback gemini llm llm-gateway llm-proxy openai openai-proxy opencode qwen token-saver
Last synced: 01 Jul 2026
https://github.com/pmbstyle/openrouter-proxy
Nodejs OpenRouter proxy inference that provides all nessary endpoints for your LLM application.
llm-inference llm-proxy llm-webservice nodejs openrouter-api
Last synced: 16 Apr 2026
https://github.com/antkawam/claude-code-aws-gateway
Self-hosted API gateway for Claude Code on Amazon Bedrock. Team management, virtual API keys, per-user budgets, OIDC SSO, web search, and an admin portal.
amazon-bedrock anthropic api-gateway api-proxy aws-cdk bedrock-runtime budget-management claude claude-code developer-tools docker ecs-fargate graviton llm-proxy oidc rust self-hosted sso team-management web-search
Last synced: 01 Apr 2026
https://github.com/kianwoon/modelweaver
Multi-provider model orchestration proxy for Claude Code. Route agent roles (planning, coding, research) to different LLM providers with automatic fallback, daemon mode, desktop GUI, config hot-reload, and crash recovery.
ai-agents anthropic api-proxy claude claude-code desktop-gui developer-tools fallback hono hot-reload llm llm-proxy model-routing multi-provider openrouter proxy rate-limiting sse tauri typescript
Last synced: 15 Apr 2026
https://github.com/pysugar/oauth-llm-nexus
The universal headless bridge for OAuth-authenticated LLM services.
antigravity claude-code llm-proxy oauth openai-api reverse-proxy
Last synced: 11 Feb 2026
https://github.com/xzxy-ai/ccg-router
Claude Code and Codex CLI local router for OpenAI-compatible and Anthropic-compatible APIs
ai-coding anthropic anthropic-api anthropic-compatible claude claude-code cli codex codex-cli developer-tools go llm-proxy llm-router local-first openai openai-api openai-compatible proxy router sqlite
Last synced: 23 May 2026
https://github.com/binbandit/claude-litellm-proxy
A proxy for claude code to use liteLLM
ai ai-development anthropic api-gateway claude claude-code docker function-calling genai hono litellm llm llm-proxy multi-provider openai-api proxy streaming tool-calling typescript
Last synced: 02 May 2026
https://github.com/llimona-org/llimona
Llimona is an open and modular Python framework for building production-ready LLM gateways
asyncio llm llm-gateway llm-proxy llm-tools python
Last synced: 25 Apr 2026
https://github.com/kckempf/yallmap
An OpenTelemetry-instrumented gateway for Anthropic-compatible LLMs
ai-gateway anthropic claude claude-code langfuse llm-gateway llm-observability llm-proxy ollama opentelemetry otel typescript
Last synced: 09 Jun 2026
https://github.com/mostlydev/cllama
The blood-brain barrier for autonomous agents. A context-aware LLM governance proxy that enforces credential starvation — identity-verified, provider-routed, cost-tracked, and audit-logged.
ai-agents inference-api inference-gateway llm llm-inference llm-proxy
Last synced: 10 May 2026
https://github.com/syndicalt/provara
Intelligent multi-provider LLM gateway with adaptive routing, A/B testing, and cost optimization. Self-host it or use the managed SaaS.
ai-gateway automation cost-optimization llm llm-proxy observability self-hosted
Last synced: 11 Jun 2026
https://github.com/nullata/llamaman
A browser-based UI for launching, monitoring, and managing multiple llama.cpp server instances from inside a Docker container. Includes an Ollama-compatible API proxy
frontend llamacpp llm llm-inference llm-infrastructure llm-manager llm-proxy proxy rest-api
Last synced: 02 Apr 2026
https://github.com/1mb-dev/shim
HTTP proxy: run Claude Code against OpenAI-compatible providers (DeepSeek/OpenAI/OpenRouter/Ollama) or pass through to Anthropic, with built-in request measurement.
anthropic claude-code deepseek go llm-proxy ollama openai openrouter prometheus
Last synced: 10 Jun 2026
https://github.com/rawcontext/reflex
Episodic memory and semantic cache proxy for LLM APIs with ~40% token savings
agent-orchestration ai-agents context-graph developer-tools knowledge-graph llm-proxy semantic-cache token-optimization
Last synced: 11 Jan 2026
https://github.com/vivian254338489/tken-fastapi-ai-gateway-starter
FastAPI OpenAI-compatible AI gateway starter with custom base_url, Docker, mock mode, and portable provider examples.
ai-gateway api-proxy base-url chatgpt-api cheap-ai-api developer-tools docker fastapi llm-proxy model-routing openai-api openai-compatible python tken
Last synced: 27 Jun 2026
https://github.com/gitstq/ahg-ai-gateway
蓝鹰AI网关 BlueEagle - 全球顶尖大模型统一API网关 | 0.09x倍率 | 1:1充值 | GPT-4o/Claude-4/Gemini-2.5 | OpenAI兼容 | 免费测试额度
ai-gateway ai-models anthropic api-proxy chatgpt claude-4 claude-api deepseek gemini-api gpt-4 gpt-4o llm llm-proxy openai openai-api
Last synced: 21 Jun 2026
https://github.com/b-macker/naab-passage
🔒 Sovereign data gateway & PII protection - Zero leakage to LLMs and APIs with self-synthesizing architecture. HIPAA/GDPR compliant. Part of the NAAb Ecosystem.
anthropic api-gateway audit compliance data-privacy data-protection encryption gateway gdpr hipaa llm-proxy naab naab-ecosystem openai pii-protection polyglot privacy security soc2 zero-trust
Last synced: 06 Mar 2026
https://github.com/navneetlal/ai-gateway
Unified API gateway for LLM providers. Route requests to OpenAI, Anthropic, and more through a single OpenAI-compatible interface.
ai ai-gateway anthropic api-gateway fastify llm llm-gateway llm-proxy openai openai-api typescript
Last synced: 27 Feb 2026
https://github.com/chicogong/stream-relay-go
A lightweight Go streaming relay for LLM/TTS APIs with production-grade observability and policy controls
anthropic api-gateway docker gin golang grafana llm llm-proxy observability openai prometheus proxy rate-limiting siliconflow sse streaming tts
Last synced: 08 Feb 2026
https://github.com/study8677/llm-router
自托管 OpenAI-compatible AI Gateway:用 auto / auto-coding / auto-longtext 自动选择合适模型,支持流式、工具调用、多模态透传和 fallback。
ai-gateway ai-router auto-model developer-tools docker function-calling llm-gateway llm-proxy llm-router model-routing multimodal nodejs openai-api openai-compatible self-hosted streaming typescript
Last synced: 04 Jun 2026
https://github.com/lab34-es/llm-proxy
LLM proxy written in go with usage & guard rails support.
Last synced: 20 Apr 2026
https://github.com/tessera-llm/tessera-sdk
Drop-in LLM cost-optimization proxy. Auto-route + cache + compress + batch. Flat monthly pricing by token volume, keep 100% of savings. Free 60M tokens/mo.
ai-cost ai-proxy anthropic apache-2 claude cohere cost-optimization gemini gpt-4o groq llm llm-cost llm-proxy mistral openai python sdk tessera typescript
Last synced: 01 Jun 2026