An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with token-optimization

A curated list of projects in awesome lists tagged with token-optimization .

https://github.com/rtk-ai/rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

agentic-coding ai-coding anthropic claude-code cli command-line-tool cost-reduction developer-tools llm open-source productivity rust token-optimization

Last synced: 29 Apr 2026

https://github.com/open-compress/claw-compactor

๐Ÿฆž LLM Token Compression & Reduction Tool โ€” Cut AI agent token costs by up to 97%. 6-layer deterministic context compression for AI agent workspaces. No LLM required. Prompt compression, context window optimization & cost reduction for any LLM pipeline.

ai-agent-tools ai-cost-saving ai-infrastructure claw-compactor context-compression context-pruning context-window-optimization developer-tools llm-compression llm-context-compression llm-cost-reduction llm-token-compression llm-tools openclaw prompt-compression python-tools token-compression token-optimization token-reduction token-saving

Last synced: 01 Apr 2026

https://github.com/yvgude/lean-ctx

The Context OS for AI Development. Reduce token waste in Cursor, Claude Code, Copilot, Windsurf, Codex, Gemini & more by 60โ€“95% (up to 99% on cached reads) Shell Hook + MCP Server ยท 49 tools ยท 10 read modes ยท 90+ patterns ยท Single Rust binary

agentic-coding ai ai-coding claude-code context-engineering copilot cursor developer-tools gemini-cli llm mcp mcp-server reduce-token-costs rust token-optimization

Last synced: 05 Jun 2026

https://github.com/GMaN1911/claude-cognitive

Working memory for Claude Code - persistent context and multi-instance coordination

claude-ai claude-code context-management developer-tools productivity token-optimization

Last synced: 15 Jan 2026

https://github.com/Lap-Platform/LAP

Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10ร— smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.

agent-experience ai ai-agents api api-compression api-spec asyncapi claude cli developer-tools graphql llm mcp open-source openapi postman protobuf python sdk token-optimization

Last synced: 20 Apr 2026

https://github.com/clay-good/openlore

openlore provides persistent architectural memory for AI coding agents by turning codebases into queryable knowledge graphs featuring static analysis, living specs, automated drift detection, and graph-native MCP tools to eliminate context decay and drastically slash orientation token costs.

adr agentic-workflows ai-agents ai-coding call-graph codebase-analysis context-management developer-tools devtools drift-detection knowledge-graph living-documentation llm-tools mcp mcp-server model-context-protocol openspec software-architecture static-analysis token-optimization

Last synced: 10 Jun 2026

https://github.com/mpecan/tokf

Config-driven CLI tool that compresses command output before it reaches an LLM context

ai-tools claude-code cli command-line context-window developer-tools homebrew llm output-filter rust token-optimization toml

Last synced: 12 May 2026

https://github.com/mischasigtermans/laravel-toon

TOON encoding for Laravel. Encode data for AI/LLMs with 40-60% fewer tokens than JSON.

ai json-alternative laravel llm mcp token-optimization token-optimizer toon

Last synced: 13 Jan 2026

https://github.com/avilum/minrlm

Stop forcing LLMs to answer in one pass. Give them a runtime. Recursive Language Model that improves any LLM, while reducing token usage up to 4X.

agent ai-agents cost-optimization latency-optimization llm llm-inference llmops recursive-language-model rlm token-optimization

Last synced: 07 Apr 2026

https://github.com/sbsaga/toon

TOON โ€” Laravel AI package for compact, human-readable, token-efficient data format with JSON โ‡„ TOON conversion for ChatGPT, OpenAI, and other LLM prompts.

ai ai-library api-optimization chatgpt compact-format data-compression developer-tool human-readable json laravel laravel-package large-language-models llm openai php php-library prompt-engineering serialization token-optimization toon

Last synced: 02 Mar 2026

https://github.com/hunhee98/pluck

MCP-native code retrieval for AI agents โ€” 84-88% fewer read tokens, BM25F + semantic search, AST chunks, session dedup

ai-agents bm25 claude-code cli code-intelligence code-search codex context-window developer-tools embeddings llm mcp mcp-server rag ripgrep rust semantic-search tantivy token-optimization tree-sitter

Last synced: 01 Jun 2026

https://github.com/dean0x/skim

The most intelligent context optimization engine for coding agents. Code-aware AST parsing across 17 languages. Command rewriting. Test, build, and git output compression. Token budget cascading. Built in Rust.

ai claude-code cli code-reader developer-tools llm rust token-optimization tree-sitter

Last synced: 01 May 2026

https://github.com/rjkaes/trueline-mcp

Smarter reads, safer edits. An MCP plugin that cuts token usage and catches editing mistakes before they hit disk. Supports Claude Code, Gemini CLI, GitHub Copilot, and Codex.

agentic-coding ai-coding anthropic ast claude-code developer-tools gemini-cli github-copilot llm mcp mcp-server model-context-protocol multi-agent openai-codex token-optimization tree-sitter typescript

Last synced: 12 Apr 2026

https://github.com/coalesce-labs/catalyst

Token-efficient Claude Code workspace with parallel agents and persistent memory. Research โ†’ Plan โ†’ Implement โ†’ Validate workflow.

agent-memory agentic-coding ai-agents ai-coding claude-code-commands claude-code-plugin claude-code-plugins-marketplace claude-code-subagents context-engineering token-optimization

Last synced: 29 May 2026

https://github.com/tingjiainfuture/pixrep

Let LLMs see your codebase just like you do.

context-window llm multimodal pdf-generation token-optimization

Last synced: 07 Mar 2026

https://github.com/iagocavalcante/claude-turbo-search

Optimized file search and semantic indexing for large codebases in Claude Code

claude claude-code cli developer-tools fzf hooks qmd ripgrep semantic-search token-optimization

Last synced: 16 Feb 2026

https://github.com/agusrdz/chop

CLI output compressor for Claude Code. Reduces token consumption by 50โ€“90% by compressing verbose command output before it enters the context window. Supports 52+ commands โ€” git, docker, kubectl, npm, terraform, and more.

claude claude-ai claude-code cli context-window developer-tools golang llm productivity token-optimization

Last synced: 03 Apr 2026

https://github.com/dhanushkumarsivaji/kerf-cli

Cost intelligence for Claude Code. Real-time dashboards, pre-flight estimation, budgets, and ghost token auditing.

anthropic claude claude-code cli cost-intelligence developer-tools token-optimization typescript

Last synced: 02 Jun 2026

https://github.com/kalpeshgamit/codebase-pilot

AI context engine for Claude Code, Cursor, Windsurf โ€” pack, compress, and optimize any codebase. Save 60-90% tokens. Web dashboard on port 7456.

agent-orchestration ai-context-engine claude-code cli-tool code-compression code-context-engine code-review codebase-packer cursor-ai developer-tools llm-tools mcp-server nodejs security-scanner sub-agents token-optimization typescript vibe-coding windsurf

Last synced: 13 Apr 2026

https://github.com/juninmd/tokenix

Local semantic index CLI that reduces LLM token usage 60-90% -- built in Rust, runs 100% offline

ai ai-tools claude-code cli developer-tools embeddings ollama rust semantic-search token-optimization

Last synced: 01 Jun 2026

https://github.com/philipjohnbasile/callsieve

Stop paying AI coding agents to grep your repo. CallSieve is a local-first, deterministic retrieval layer that feeds agents compact context packets to cut token spend โ€” no cloud, no API key, 20+ agents and MCP clients.

ai cli code-search coding-agents developer-tools llm local-first mcp retrieval rust token-optimization

Last synced: 08 Jun 2026

https://github.com/jackccrawford/token-scout

For OpenClaw, Hermes and more. Find free and low-cost inference (LLM models). Use them directly. Provides both a CLI and MCP server that knows which free-tier LLM APIs exist, which ones you have keys for, and which one fits your task. Returns endpoints so can you call models directly. No proxy, no middleware, no latency tax.

agent agentic-ai ai-agents claude-code free free-inference llm mcp mcp-server model-discovery model-routing ollama openclaw openrouter token-optimization

Last synced: 29 May 2026

https://github.com/marjoballabani/hypergrep

A better grep for AI agents. Structural search, call graphs, impact analysis, semantic compression. 87% fewer tokens. 16 languages. Built in Rust.

ai-agents ast call-graph claude-code cli code-intelligence code-search copilot cursor developer-tools grep llm ripgrep rust static-analysis token-optimization tree-sitter

Last synced: 05 Apr 2026

https://github.com/kbwen/agentic-os

Governance-first OS for AI coding agents โ€” structured workflows, delivery gates, engineering guardrails, and 17 professional skills for Claude Code, Cursor, Copilot, Antigravity & Codex.

agent-framework agentic-development agents-md ai-agent ai-governance ai-guardrails ai-workflow claude-code coding-agent cursor-rules developer-tools google-antigravity llm-tools multi-agent token-optimization

Last synced: 08 Jun 2026

https://github.com/ncmonx/icemage

Token-efficient context/memory/graph CLI for AI coding agents. 70-98% cheaper. 1187/1187 tests, 40 MCP tools, Apache-2.0.

ai-agents ai-coding anthropic claude-code cli cline context-engineering cpp17 cursor developer-tools knowledge-graph llm-tools local-first mcp mcp-server prompt-engineering semantic-search sqlite token-optimization

Last synced: 06 Jun 2026

https://github.com/iamgerwin/toon-php

A lightweight, fast TOON (Token-Oriented Object Notation) library for PHP. Optimized for LLM contexts. PHP 7.0-8.0 [legacy] and 8.1 and up [modern] support.

data-format json-alternative legacy legacy-php llm php php-serialize php-toon php8 serialization token-optimization toon toon-php

Last synced: 25 Jan 2026

https://github.com/tverney/llm-proxy-babylon

Multilingual LLM proxy that optimizes non-English prompts for better quality, lower token costs, and stronger safety alignment

ai-safety aws-bedrock cost-optimization language-detection llm low-resource-languages multilingual openai-api prompt-optimization proxy token-optimization translation

Last synced: 20 Apr 2026

https://github.com/selcukgural/toonnet

High-performance .NET serialization library for TOON format - 40% fewer tokens for AI/LLM, expression tree-based (10-100x faster)

ai csharp dotnet json llm serialization source-generators token-optimization toon toon-format yaml

Last synced: 08 Feb 2026

https://github.com/castnettech/mnemosyne

LLM context compression and retrieval engine. Zero dependencies. Sub-100ms queries. 40-70% token reduction.

bm25 code-retrieval context-compression developer-tools llm open-source python tfidf token-optimization zero-dependencies

Last synced: 07 Apr 2026

https://github.com/saygex9965/-mcp-to-skill-converter

๐Ÿ”„ Convert MCP servers into Claude Skills with 90% context savings, optimizing token usage for efficient tool operation.

ai-tools automation claude-code claude-code-plugin developer-tools mcp mcp-converter model-context-protocol nodejs plugin-development skills token-optimization typescript

Last synced: 14 Apr 2026

https://github.com/ralphmoran/ticket-lens

Privacy-first CLI that transforms Jira tickets into AI-ready briefs. 60โ€“80% fewer tokens. No relay. Pipe-friendly.

ai atlassian cli context-window developer-productivity developer-tools devtools jira jira-api jira-server llm nodejs prompt-engineering token-optimization

Last synced: 09 Jun 2026

https://github.com/lancekrogers/tcount

Count tokens of files and directories

ai-tools counter developer-tools llms token-optimization tokens

Last synced: 30 May 2026

https://github.com/never00miss/allan-mcp-memory-code

๐Ÿง  Knowledge Graph Memory for AI Coding Agents - Full offline mode with Docker. Integrates with Claude, Cline, Cursor, Windsurf, and more. Auto-extracts entities & relationships. No API keys required.

ai-agents claude cline coding cursor docker graphiti knowledge-graph llm mcp memory offline-first plan token token-optimization

Last synced: 26 May 2026

https://github.com/coderdayton/semantic-cache-mcp

MCP server that reduces LLM token usage by 80%+ through intelligent file caching, semantic diffs, and content-defined chunking.

caching claude embeddings llm mcp python semantic-search token-optimization

Last synced: 21 Apr 2026

https://github.com/soturine/soturail

Local-first context rails for AI coding agents: reversible terminal compression, progressive repo reading, SDD workflows, hooks, benchmarks, memory and cache-friendly payloads.

agent-hooks ai-agents cli coding-agents context-engineering developer-tools llm-tools local-first prompt-caching rust soturail spec-driven-development terminal-compression token-optimization typescript

Last synced: 26 May 2026

https://github.com/yuchen20/context-crumb

Save Token Usage on Unstructured Document ๐Ÿ˜Ž. Let agent read docs, memories, prompts with in ultra-compressed mode through a tiny local model.

agent ai context-compaction context-compression skills token token-optimization

Last synced: 01 Jun 2026

https://github.com/ncmonx/icm-graph

Token-efficient context CLI for Claude Code, Cursor, Cline. Cuts AI coding costs 70-90% via context packs, output filters, local memory + receipts. 40 MCP tools, 122/122 tests, Apache-2.0.

ai-agents ai-coding anthropic claude-code cli cline context-engineering cpp17 cursor developer-tools knowledge-graph llm-tools local-first mcp mcp-server prompt-engineering semantic-search sqlite token-optimization

Last synced: 29 May 2026

https://github.com/mikkoparkkola/ultracos

Lossless, on-device token-cost reduction for Claude Code and LLM coding agents. Free plugin: compresses tool-result output, dedups context, compacts the system prompt โ€” stacks on Anthropic prompt caching. Rust hot path, Python fallback, fail-open. PolyForm Noncommercial.

agentic ai-agents anthropic claude claude-code context-compression cost-optimization developer-tools llm llm-tools mcp prompt-compression rust token-compression token-optimization

Last synced: 02 Jun 2026

https://github.com/blackwell-systems/gcf

GCF: token-optimized wire format for LLM tool responses. 84% fewer tokens than JSON, 34% fewer than TOON, 100% comprehension accuracy at scale.

ai-agents code-intelligence context-window format gcf graph llm mcp model-context-protocol specification token-optimization wire-format

Last synced: 06 Jun 2026

https://github.com/keradd/crux

Compression Runtime for Universal eXecution โ€” token-optimization runtime for AI coding agents (Rust, single binary, 10 layers, local-first)

ai-agents cli llm mcp rust sqlite token-optimization tree-sitter

Last synced: 03 May 2026

https://github.com/darkiceinteractive/mcp-conductor

97% fewer tokens. Parallel MCP execution through a sandboxed Deno runtime.

ai-tools claude deno mcp mcp-server token-optimization typescript

Last synced: 06 May 2026

https://github.com/fnrhombus/claude-code-pathfix

Claude Code hook that transparently converts Windows paths to POSIX in Bash commands โ€” eliminating retry loops and saving tokens

ai-coding claude-code claude-code-plugin developer-tools git-bash hooks path-normalization token-optimization windows

Last synced: 06 May 2026

https://github.com/nasirus/pytk-ai

Lightweight, zero-dependency Python library that filters shell command output for LLM token efficiency

ai-agents ai-coding cli developer-tools devtools llm output-filtering python shell token-optimization

Last synced: 06 Apr 2026

https://github.com/lugondev/format-json-llm

Convert JSON โ†” TOON and generate JSON Schema + compact TOON schema for LLM prompts. Token-efficient, with live token comparison.

converter json json-schema llm prompt-engineering schema-generator token-optimization toon vanilla-js vite

Last synced: 10 Jun 2026

https://github.com/developerjillur/nexalance-claude-code-kit

AI Development Operating System for Claude Code โ€” v4.4 LITE+ : ~70% fewer tokens vs v4.3, lazy-loaded playbooks, tier-aware Phase 0, risk-tiered review, MemPalace reliability fixes, Graphify integration, hooks-based enforcement

ai-coding ai-development anthropic claude claude-code claudemd developer-tools mempalace plugins token-optimization

Last synced: 29 May 2026

https://github.com/nadimtuhin/claude-token-optimizer

Reusable setup prompts for optimizing Claude Code documentation. Achieve 90% token savings on any project in 5 minutes.

ai-assistant automation claude-code developer-tools documentation setup-template token-optimization

Last synced: 13 May 2026

https://github.com/CodeShuX/tokenwise

Cut Claude Code spend without sacrificing quality โ€” and prove it. Haiku/Sonnet/Opus router with real $-saved numbers, not vibes.

ai-cost-optimization anthropic claude claude-code claude-skill cost-reduction developer-tools haiku llm-router model-routing opus productivity sonnet subagents token-optimization

Last synced: 29 May 2026

https://github.com/ilhan-monke/three-tier-ai-context

Hierarchical session tracking system for AI assistants that reduces token usage by 60-80%

ai ai-agents ai-context claude-code developer-tools documentation productivity session-tracking templates token-optimization

Last synced: 07 Jan 2026

https://github.com/sravan27/money-27-proof

Free AI agent cost-leak scanner + 48-hour private repo audit for Claude Code, Cursor & Codex teams (report, CI gate, fix plan). Method open-source & benchmarked. Plus AI automation rescue sprints.

agentic-coding ai-agents ai-automation ai-coding-agents automation claude-code cost-optimization developer-tools github-pages gohighlevel n8n polar repo-audit retell-ai token-optimization vapi voice-ai workflow-automation

Last synced: 31 May 2026

https://github.com/ignaciocolussi/simple_toon

Python parser and serializer for TOON (Token-Oriented Object Notation) - Reduce LLM token usage by 30-60%

data-format json llm parser python token-optimization toon

Last synced: 13 Jan 2026

https://github.com/ait88/claude-workflow-toolkit

Reusable workflow optimization toolkit for Claude Code agents.

agentic-workflow claude-code claude-skills gh-cli token-optimization

Last synced: 13 Jan 2026

https://github.com/fujiba/pdf-chunker

LLM-friendly PDF splitter & image optimizer. Chunk PDFs by size and downsample images for RAG/Bedrock.

aws-bedrock chunking claude cli image-optimization llm pdf pdf-chunker pdf-processing pdf-splitting python rag token-optimization

Last synced: 13 Jan 2026

https://github.com/yang1bai/claw-tsaver

A token-saving MCP proxy for OpenClaw users. Cuts tool call payloads by 90%+ via lazy expansion.

mcp openai python token-optimization

Last synced: 01 Jun 2026

https://github.com/digital-threads/token-pilot

Save 60-80% tokens when AI reads code โ€” MCP server for token-efficient code navigation with AST-aware structural reading

ai-coding ast claude claude-code code-navigation context-window cursor developer-tools llm mcp mcp-server model-context-protocol token-optimization tree-sitter

Last synced: 18 Apr 2026

https://github.com/rawcontext/reflex

Episodic memory and semantic cache proxy for LLM APIs with ~40% token savings

agent-orchestration ai-agents context-graph developer-tools knowledge-graph llm-proxy semantic-cache token-optimization

Last synced: 11 Jan 2026

https://github.com/preflight-dev/preflight

โœˆ๏ธ 24-tool MCP server for Claude Code: preflight checks for your prompts, cross-service context, session history search with LanceDB vectors, correction pattern learning, cost estimation

ai-coding ai-tools anthropic claude claude-code code-quality cost-estimation developer-tools devtools lancedb mcp mcp-server model-context-protocol preflight prompt-engineering prompt-quality semantic-search token-optimization typescript vector-search

Last synced: 02 Apr 2026

https://github.com/chokmah-me/claude-code-playbook

Practitioner's Playbook for Claude Code: Configuration for Token-Efficient AI Engineering

ai-development claude-ai developer-tools refactoring token-optimization typescript workflows

Last synced: 04 Mar 2026

https://github.com/gobbyai/gobby-cli

Rust CLI tools for AI-assisted development. AST-aware code search with tree-sitter + FTS5, and a YAML-configurable output compressor with 28 built-in pipelines. >90% token savings. Part of Gobby, works standalone.

ai-agents ai-coding-assistant ai-pair-programming ast claude-code cli code-search dependency-graph developer-tools fts5 gobby llm neo4j output-compaction rust sqlite symbol-navigation token-optimization tree-sitter yaml

Last synced: 06 Jun 2026

https://github.com/boygotflames/promptus-dsl

A Rust-based compiler for the .llm prompt format. Stop wasting tokens on Markdown and start treating your prompts like code. Features deterministic AST parsing, CI bench regression, and an 8.5% average reduction in token bloat.

ai-agents ai-agents-cli compiler llm parser prompt-engineering rust token-optimization

Last synced: 24 May 2026

https://github.com/cablate/claude-code-research

Independent research on Claude Code internals, Claude Agent SDK, and related tooling.

claude-agent-sdk claude-code mcp prompt-caching research reverse-engineering system-prompt token-optimization

Last synced: 03 Apr 2026

https://github.com/joshuaswarren/openclaw-tactician

Intelligent model routing for OpenClaw with quota prediction, task classification, and automatic optimization

ai-agent cost-optimization llm-routing mlx model-routing ollama openclaw openclaw-plugin quota-management token-optimization

Last synced: 19 Feb 2026

https://github.com/mupozg823/codelens-mcp-plugin

Rust MCP server for bounded code intelligence, gated mutation, and auditable agent workflows.

agent-harness ai-tools claude-code code-intelligence codex cursor developer-tools harness lsp mcp mcp-server rust semantic-search token-optimization tree-sitter

Last synced: 02 May 2026

https://github.com/mukundakatta/tokenwise

Token usage optimization toolkit โ€” count, estimate costs, optimize prompts, track budgets across LLM providers

ai llm machine-learning officethree open-source python token-optimization tokens

Last synced: 23 Apr 2026

https://github.com/tokenpak/tokenpak

Drop-in HTTP proxy that compresses LLM context, optimizes cache hits, routes smart, and tracks every dollar. Zero SDK changes required.

ai anthropic compression context-window cost-tracking developer-tools gemini llm openai proxy python token-optimization

Last synced: 09 May 2026