Projects in Awesome Lists tagged with token-optimization
A curated list of projects in awesome lists tagged with token-optimization .
https://github.com/rtk-ai/rtk
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
agentic-coding ai-coding anthropic claude-code cli command-line-tool cost-reduction developer-tools llm open-source productivity rust token-optimization
Last synced: 29 Apr 2026
https://github.com/chopratejas/headroom
The Context Optimization Layer for LLM Applications
agent ai anthropic compression context-engineering context-window fastapi langchain llm mcp openai proxy python rag token-optimization
Last synced: 14 May 2026
https://github.com/open-compress/claw-compactor
๐ฆ LLM Token Compression & Reduction Tool โ Cut AI agent token costs by up to 97%. 6-layer deterministic context compression for AI agent workspaces. No LLM required. Prompt compression, context window optimization & cost reduction for any LLM pipeline.
ai-agent-tools ai-cost-saving ai-infrastructure claw-compactor context-compression context-pruning context-window-optimization developer-tools llm-compression llm-context-compression llm-cost-reduction llm-token-compression llm-tools openclaw prompt-compression python-tools token-compression token-optimization token-reduction token-saving
Last synced: 01 Apr 2026
https://github.com/alexgreensh/token-optimizer
Find the ghost tokens. Fix them. Survive compaction. Avoid context quality decay.
agentskills claude-code claude-code-skill claude-plugin context-engineering context-window ghost-tokens token-optimization token-optimizer token-usage
Last synced: 30 May 2026
https://github.com/yvgude/lean-ctx
The Context OS for AI Development. Reduce token waste in Cursor, Claude Code, Copilot, Windsurf, Codex, Gemini & more by 60โ95% (up to 99% on cached reads) Shell Hook + MCP Server ยท 49 tools ยท 10 read modes ยท 90+ patterns ยท Single Rust binary
agentic-coding ai ai-coding claude-code context-engineering copilot cursor developer-tools gemini-cli llm mcp mcp-server reduce-token-costs rust token-optimization
Last synced: 05 Jun 2026
https://github.com/GMaN1911/claude-cognitive
Working memory for Claude Code - persistent context and multi-instance coordination
claude-ai claude-code context-management developer-tools productivity token-optimization
Last synced: 15 Jan 2026
https://github.com/Lap-Platform/LAP
Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10ร smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.
agent-experience ai ai-agents api api-compression api-spec asyncapi claude cli developer-tools graphql llm mcp open-source openapi postman protobuf python sdk token-optimization
Last synced: 20 Apr 2026
https://github.com/clay-good/openlore
openlore provides persistent architectural memory for AI coding agents by turning codebases into queryable knowledge graphs featuring static analysis, living specs, automated drift detection, and graph-native MCP tools to eliminate context decay and drastically slash orientation token costs.
adr agentic-workflows ai-agents ai-coding call-graph codebase-analysis context-management developer-tools devtools drift-detection knowledge-graph living-documentation llm-tools mcp mcp-server model-context-protocol openspec software-architecture static-analysis token-optimization
Last synced: 10 Jun 2026
https://github.com/mpecan/tokf
Config-driven CLI tool that compresses command output before it reaches an LLM context
ai-tools claude-code cli command-line context-window developer-tools homebrew llm output-filter rust token-optimization toml
Last synced: 12 May 2026
https://github.com/mischasigtermans/laravel-toon
TOON encoding for Laravel. Encode data for AI/LLMs with 40-60% fewer tokens than JSON.
ai json-alternative laravel llm mcp token-optimization token-optimizer toon
Last synced: 13 Jan 2026
https://github.com/avilum/minrlm
Stop forcing LLMs to answer in one pass. Give them a runtime. Recursive Language Model that improves any LLM, while reducing token usage up to 4X.
agent ai-agents cost-optimization latency-optimization llm llm-inference llmops recursive-language-model rlm token-optimization
Last synced: 07 Apr 2026
https://github.com/sbsaga/toon
TOON โ Laravel AI package for compact, human-readable, token-efficient data format with JSON โ TOON conversion for ChatGPT, OpenAI, and other LLM prompts.
ai ai-library api-optimization chatgpt compact-format data-compression developer-tool human-readable json laravel laravel-package large-language-models llm openai php php-library prompt-engineering serialization token-optimization toon
Last synced: 02 Mar 2026
https://github.com/hunhee98/pluck
MCP-native code retrieval for AI agents โ 84-88% fewer read tokens, BM25F + semantic search, AST chunks, session dedup
ai-agents bm25 claude-code cli code-intelligence code-search codex context-window developer-tools embeddings llm mcp mcp-server rag ripgrep rust semantic-search tantivy token-optimization tree-sitter
Last synced: 01 Jun 2026
https://github.com/dean0x/skim
The most intelligent context optimization engine for coding agents. Code-aware AST parsing across 17 languages. Command rewriting. Test, build, and git output compression. Token budget cascading. Built in Rust.
ai claude-code cli code-reader developer-tools llm rust token-optimization tree-sitter
Last synced: 01 May 2026
https://github.com/obedience-corp/festival
Organized AI Workspace | Complex Multi Phase Planning | Auditable Agent Workflows
ai ai-agents ai-tools ai-workflow ai-workflow-optimization ai-workspace autonomous-agents camp claude-code cli codex developer-tools fest festival festival-methodology goal-oriented-ai hierarchical-agents opencode project-management token-optimization
Last synced: 30 May 2026
https://github.com/rjkaes/trueline-mcp
Smarter reads, safer edits. An MCP plugin that cuts token usage and catches editing mistakes before they hit disk. Supports Claude Code, Gemini CLI, GitHub Copilot, and Codex.
agentic-coding ai-coding anthropic ast claude-code developer-tools gemini-cli github-copilot llm mcp mcp-server model-context-protocol multi-agent openai-codex token-optimization tree-sitter typescript
Last synced: 12 Apr 2026
https://github.com/coalesce-labs/catalyst
Token-efficient Claude Code workspace with parallel agents and persistent memory. Research โ Plan โ Implement โ Validate workflow.
agent-memory agentic-coding ai-agents ai-coding claude-code-commands claude-code-plugin claude-code-plugins-marketplace claude-code-subagents context-engineering token-optimization
Last synced: 29 May 2026
https://github.com/tingjiainfuture/pixrep
Let LLMs see your codebase just like you do.
context-window llm multimodal pdf-generation token-optimization
Last synced: 07 Mar 2026
https://github.com/sir-ad/nexus-prime
The Self-Evolving Agent Operating System
agentic-framework ai-agents guardrails-framework mcp-servers multi-agent-orchestration token-optimization
Last synced: 08 Apr 2026
https://github.com/clark-mackey/log-file-genius
Token-efficient log file system - reduce AI coding assistant context bloat by 93%
ai-coding-assistant ai-development augment best-practices changelog claude-code context-management cursor developer-tools devlog documentation git-workflow github-copilot markdown multi-agent productivity project-management software-documentation template token-optimization
Last synced: 15 May 2026
https://github.com/iagocavalcante/claude-turbo-search
Optimized file search and semantic indexing for large codebases in Claude Code
claude claude-code cli developer-tools fzf hooks qmd ripgrep semantic-search token-optimization
Last synced: 16 Feb 2026
https://github.com/argahsuknesib/toon-ld
Token Oriented Object Notation (TOON) for Linked Data
context-window json-ld knowledge-graph linked-data llm rag rdf rust semantic-web serialization token-optimization token-oriented-object-notation wasm
Last synced: 13 Jan 2026
https://github.com/agusrdz/chop
CLI output compressor for Claude Code. Reduces token consumption by 50โ90% by compressing verbose command output before it enters the context window. Supports 52+ commands โ git, docker, kubectl, npm, terraform, and more.
claude claude-ai claude-code cli context-window developer-tools golang llm productivity token-optimization
Last synced: 03 Apr 2026
https://github.com/dhanushkumarsivaji/kerf-cli
Cost intelligence for Claude Code. Real-time dashboards, pre-flight estimation, budgets, and ghost token auditing.
anthropic claude claude-code cli cost-intelligence developer-tools token-optimization typescript
Last synced: 02 Jun 2026
https://github.com/kalpeshgamit/codebase-pilot
AI context engine for Claude Code, Cursor, Windsurf โ pack, compress, and optimize any codebase. Save 60-90% tokens. Web dashboard on port 7456.
agent-orchestration ai-context-engine claude-code cli-tool code-compression code-context-engine code-review codebase-packer cursor-ai developer-tools llm-tools mcp-server nodejs security-scanner sub-agents token-optimization typescript vibe-coding windsurf
Last synced: 13 Apr 2026
https://github.com/juninmd/tokenix
Local semantic index CLI that reduces LLM token usage 60-90% -- built in Rust, runs 100% offline
ai ai-tools claude-code cli developer-tools embeddings ollama rust semantic-search token-optimization
Last synced: 01 Jun 2026
https://github.com/marjoballabani/hypergrep
A better grep for AI agents. Structural search, call graphs, impact analysis, semantic compression. 87% fewer tokens. 16 languages. Built in Rust.
ai-agents ast call-graph claude-code cli code-intelligence code-search copilot cursor developer-tools grep llm ripgrep rust static-analysis token-optimization tree-sitter
Last synced: 05 Apr 2026
https://github.com/jackccrawford/token-scout
For OpenClaw, Hermes and more. Find free and low-cost inference (LLM models). Use them directly. Provides both a CLI and MCP server that knows which free-tier LLM APIs exist, which ones you have keys for, and which one fits your task. Returns endpoints so can you call models directly. No proxy, no middleware, no latency tax.
agent agentic-ai ai-agents claude-code free free-inference llm mcp mcp-server model-discovery model-routing ollama openclaw openrouter token-optimization
Last synced: 29 May 2026
https://github.com/philipjohnbasile/callsieve
Stop paying AI coding agents to grep your repo. CallSieve is a local-first, deterministic retrieval layer that feeds agents compact context packets to cut token spend โ no cloud, no API key, 20+ agents and MCP clients.
ai cli code-search coding-agents developer-tools llm local-first mcp retrieval rust token-optimization
Last synced: 08 Jun 2026
https://github.com/kbwen/agentic-os
Governance-first OS for AI coding agents โ structured workflows, delivery gates, engineering guardrails, and 17 professional skills for Claude Code, Cursor, Copilot, Antigravity & Codex.
agent-framework agentic-development agents-md ai-agent ai-governance ai-guardrails ai-workflow claude-code coding-agent cursor-rules developer-tools google-antigravity llm-tools multi-agent token-optimization
Last synced: 08 Jun 2026
https://github.com/ncmonx/icemage
Token-efficient context engine for AI coding agents. v2.0: Lean & Lossless Compaction governor + zoned profile/skill memory + identity-agnostic persona continuity + source provenance + command discovery + interlinked flows + live cross-session presence wire. 1541 tests, 41 MCP tools, Elastic License 2.0.
ai-agents ai-coding anthropic claude-code cli cline context-engineering cpp17 cursor developer-tools knowledge-graph llm-tools local-first mcp mcp-server prompt-engineering semantic-search sqlite token-optimization
Last synced: 11 Jun 2026
https://github.com/sayeem3051/python-context-engineer
Build perfect LLM context from your Python codebase โ automatically
artificial-intelligence claude codebase context-engineering developer-tools gpt llm machine-learning openai prompt-engineering python token-optimization
Last synced: 10 Apr 2026
https://github.com/iamgerwin/toon-php
A lightweight, fast TOON (Token-Oriented Object Notation) library for PHP. Optimized for LLM contexts. PHP 7.0-8.0 [legacy] and 8.1 and up [modern] support.
data-format json-alternative legacy legacy-php llm php php-serialize php-toon php8 serialization token-optimization toon toon-php
Last synced: 25 Jan 2026
https://github.com/tverney/llm-proxy-babylon
Multilingual LLM proxy that optimizes non-English prompts for better quality, lower token costs, and stronger safety alignment
ai-safety aws-bedrock cost-optimization language-detection llm low-resource-languages multilingual openai-api prompt-optimization proxy token-optimization translation
Last synced: 20 Apr 2026
https://github.com/castnettech/mnemosyne
LLM context compression and retrieval engine. Zero dependencies. Sub-100ms queries. 40-70% token reduction.
bm25 code-retrieval context-compression developer-tools llm open-source python tfidf token-optimization zero-dependencies
Last synced: 07 Apr 2026
https://github.com/ralphmoran/ticket-lens
Privacy-first CLI that transforms Jira tickets into AI-ready briefs. 60โ80% fewer tokens. No relay. Pipe-friendly.
ai atlassian cli context-window developer-productivity developer-tools devtools jira jira-api jira-server llm nodejs prompt-engineering token-optimization
Last synced: 09 Jun 2026
https://github.com/lancekrogers/tcount
Count tokens of files and directories
ai-tools counter developer-tools llms token-optimization tokens
Last synced: 30 May 2026
https://github.com/never00miss/allan-mcp-memory-code
๐ง Knowledge Graph Memory for AI Coding Agents - Full offline mode with Docker. Integrates with Claude, Cline, Cursor, Windsurf, and more. Auto-extracts entities & relationships. No API keys required.
ai-agents claude cline coding cursor docker graphiti knowledge-graph llm mcp memory offline-first plan token token-optimization
Last synced: 26 May 2026
https://github.com/saygex9965/-mcp-to-skill-converter
๐ Convert MCP servers into Claude Skills with 90% context savings, optimizing token usage for efficient tool operation.
ai-tools automation claude-code claude-code-plugin developer-tools mcp mcp-converter model-context-protocol nodejs plugin-development skills token-optimization typescript
Last synced: 14 Apr 2026
https://github.com/coderdayton/semantic-cache-mcp
MCP server that reduces LLM token usage by 80%+ through intelligent file caching, semantic diffs, and content-defined chunking.
caching claude embeddings llm mcp python semantic-search token-optimization
Last synced: 21 Apr 2026
https://github.com/selcukgural/toonnet
High-performance .NET serialization library for TOON format - 40% fewer tokens for AI/LLM, expression tree-based (10-100x faster)
ai csharp dotnet json llm serialization source-generators token-optimization toon toon-format yaml
Last synced: 08 Feb 2026
https://github.com/soturine/soturail
Local-first context rails for AI coding agents: reversible terminal compression, progressive repo reading, SDD workflows, hooks, benchmarks, memory and cache-friendly payloads.
agent-hooks ai-agents cli coding-agents context-engineering developer-tools llm-tools local-first prompt-caching rust soturail spec-driven-development terminal-compression token-optimization typescript
Last synced: 26 May 2026
https://github.com/mikkoparkkola/ultracos
Lossless, on-device token-cost reduction for Claude Code and LLM coding agents. Free plugin: compresses tool-result output, dedups context, compacts the system prompt โ stacks on Anthropic prompt caching. Rust hot path, Python fallback, fail-open. PolyForm Noncommercial.
agentic ai-agents anthropic claude claude-code context-compression cost-optimization developer-tools llm llm-tools mcp prompt-compression rust token-compression token-optimization
Last synced: 02 Jun 2026
https://github.com/ncmonx/icm-graph
Token-efficient context CLI for Claude Code, Cursor, Cline. Cuts AI coding costs 70-90% via context packs, output filters, local memory + receipts. 40 MCP tools, 122/122 tests, Apache-2.0.
ai-agents ai-coding anthropic claude-code cli cline context-engineering cpp17 cursor developer-tools knowledge-graph llm-tools local-first mcp mcp-server prompt-engineering semantic-search sqlite token-optimization
Last synced: 29 May 2026
https://github.com/yuchen20/context-crumb
Save Token Usage on Unstructured Document ๐. Let agent read docs, memories, prompts with in ultra-compressed mode through a tiny local model.
agent ai context-compaction context-compression skills token token-optimization
Last synced: 01 Jun 2026
https://github.com/blackwell-systems/gcf
GCF: token-optimized wire format for LLM tool responses. 84% fewer tokens than JSON, 34% fewer than TOON, 100% comprehension accuracy at scale.
ai-agents code-intelligence context-window format gcf graph llm mcp model-context-protocol specification token-optimization wire-format
Last synced: 06 Jun 2026
https://github.com/alohaninja/shift
SHIFT (Smart Hybrid Input Filtering & Transformation)
agentic-coding ai-coding anthropic claude-code cli command-line-tool cost-reduction developer-tools llm open-code open-source productivity rust token-optimization
Last synced: 29 Apr 2026
https://github.com/aisl-web/aisl
AI Native Semantic Language
ai ai-agents ai-assistant anthropic-claude cost-optimization dataformat gemini grok json-alternative llm machine-learning openai problem-solving token-optimization
Last synced: 09 Jun 2026
https://github.com/keradd/crux
Compression Runtime for Universal eXecution โ token-optimization runtime for AI coding agents (Rust, single binary, 10 layers, local-first)
ai-agents cli llm mcp rust sqlite token-optimization tree-sitter
Last synced: 03 May 2026
https://github.com/darkiceinteractive/mcp-conductor
97% fewer tokens. Parallel MCP execution through a sandboxed Deno runtime.
ai-tools claude deno mcp mcp-server token-optimization typescript
Last synced: 06 May 2026
https://github.com/fnrhombus/claude-code-pathfix
Claude Code hook that transparently converts Windows paths to POSIX in Bash commands โ eliminating retry loops and saving tokens
ai-coding claude-code claude-code-plugin developer-tools git-bash hooks path-normalization token-optimization windows
Last synced: 06 May 2026
https://github.com/nasirus/pytk-ai
Lightweight, zero-dependency Python library that filters shell command output for LLM token efficiency
ai-agents ai-coding cli developer-tools devtools llm output-filtering python shell token-optimization
Last synced: 06 Apr 2026
https://github.com/lugondev/format-json-llm
Convert JSON โ TOON and generate JSON Schema + compact TOON schema for LLM prompts. Token-efficient, with live token comparison.
converter json json-schema llm prompt-engineering schema-generator token-optimization toon vanilla-js vite
Last synced: 10 Jun 2026
https://github.com/developerjillur/nexalance-claude-code-kit
AI Development Operating System for Claude Code โ v4.4 LITE+ : ~70% fewer tokens vs v4.3, lazy-loaded playbooks, tier-aware Phase 0, risk-tiered review, MemPalace reliability fixes, Graphify integration, hooks-based enforcement
ai-coding ai-development anthropic claude claude-code claudemd developer-tools mempalace plugins token-optimization
Last synced: 29 May 2026
https://github.com/nadimtuhin/claude-token-optimizer
Reusable setup prompts for optimizing Claude Code documentation. Achieve 90% token savings on any project in 5 minutes.
ai-assistant automation claude-code developer-tools documentation setup-template token-optimization
Last synced: 13 May 2026
https://github.com/CodeShuX/tokenwise
Cut Claude Code spend without sacrificing quality โ and prove it. Haiku/Sonnet/Opus router with real $-saved numbers, not vibes.
ai-cost-optimization anthropic claude claude-code claude-skill cost-reduction developer-tools haiku llm-router model-routing opus productivity sonnet subagents token-optimization
Last synced: 29 May 2026
https://github.com/ilhan-monke/three-tier-ai-context
Hierarchical session tracking system for AI assistants that reduces token usage by 60-80%
ai ai-agents ai-context claude-code developer-tools documentation productivity session-tracking templates token-optimization
Last synced: 07 Jan 2026
https://github.com/stefanimp/context-prism
Multilingual, token-aware context routing for Obsidian and AI assistants.
ai ai-assistants context-engineering context-routing knowledge-management local-first markdown multilingual note-taking obsidian obsidian-plugin productivity token-optimization typescript
Last synced: 28 May 2026
https://github.com/sravan27/money-27-proof
Free AI agent cost-leak scanner + 48-hour private repo audit for Claude Code, Cursor & Codex teams (report, CI gate, fix plan). Method open-source & benchmarked. Plus AI automation rescue sprints.
agentic-coding ai-agents ai-automation ai-coding-agents automation claude-code cost-optimization developer-tools github-pages gohighlevel n8n polar repo-audit retell-ai token-optimization vapi voice-ai workflow-automation
Last synced: 31 May 2026
https://github.com/ignaciocolussi/simple_toon
Python parser and serializer for TOON (Token-Oriented Object Notation) - Reduce LLM token usage by 30-60%
data-format json llm parser python token-optimization toon
Last synced: 13 Jan 2026
https://github.com/ait88/claude-workflow-toolkit
Reusable workflow optimization toolkit for Claude Code agents.
agentic-workflow claude-code claude-skills gh-cli token-optimization
Last synced: 13 Jan 2026
https://github.com/fujiba/pdf-chunker
LLM-friendly PDF splitter & image optimizer. Chunk PDFs by size and downsample images for RAG/Bedrock.
aws-bedrock chunking claude cli image-optimization llm pdf pdf-chunker pdf-processing pdf-splitting python rag token-optimization
Last synced: 13 Jan 2026
https://github.com/yang1bai/claw-tsaver
A token-saving MCP proxy for OpenClaw users. Cuts tool call payloads by 90%+ via lazy expansion.
mcp openai python token-optimization
Last synced: 01 Jun 2026
https://github.com/digital-threads/token-pilot
Save 60-80% tokens when AI reads code โ MCP server for token-efficient code navigation with AST-aware structural reading
ai-coding ast claude claude-code code-navigation context-window cursor developer-tools llm mcp mcp-server model-context-protocol token-optimization tree-sitter
Last synced: 18 Apr 2026
https://github.com/rawcontext/reflex
Episodic memory and semantic cache proxy for LLM APIs with ~40% token savings
agent-orchestration ai-agents context-graph developer-tools knowledge-graph llm-proxy semantic-cache token-optimization
Last synced: 11 Jan 2026
https://github.com/preflight-dev/preflight
โ๏ธ 24-tool MCP server for Claude Code: preflight checks for your prompts, cross-service context, session history search with LanceDB vectors, correction pattern learning, cost estimation
ai-coding ai-tools anthropic claude claude-code code-quality cost-estimation developer-tools devtools lancedb mcp mcp-server model-context-protocol preflight prompt-engineering prompt-quality semantic-search token-optimization typescript vector-search
Last synced: 02 Apr 2026
https://github.com/chokmah-me/claude-code-playbook
Practitioner's Playbook for Claude Code: Configuration for Token-Efficient AI Engineering
ai-development claude-ai developer-tools refactoring token-optimization typescript workflows
Last synced: 04 Mar 2026
https://github.com/gobbyai/gobby-cli
Rust CLI tools for AI-assisted development. AST-aware code search with tree-sitter + FTS5, and a YAML-configurable output compressor with 28 built-in pipelines. >90% token savings. Part of Gobby, works standalone.
ai-agents ai-coding-assistant ai-pair-programming ast claude-code cli code-search dependency-graph developer-tools fts5 gobby llm neo4j output-compaction rust sqlite symbol-navigation token-optimization tree-sitter yaml
Last synced: 06 Jun 2026
https://github.com/boygotflames/promptus-dsl
A Rust-based compiler for the .llm prompt format. Stop wasting tokens on Markdown and start treating your prompts like code. Features deterministic AST parsing, CI bench regression, and an 8.5% average reduction in token bloat.
ai-agents ai-agents-cli compiler llm parser prompt-engineering rust token-optimization
Last synced: 24 May 2026
https://github.com/cablate/claude-code-research
Independent research on Claude Code internals, Claude Agent SDK, and related tooling.
claude-agent-sdk claude-code mcp prompt-caching research reverse-engineering system-prompt token-optimization
Last synced: 03 Apr 2026
https://github.com/joshuaswarren/openclaw-tactician
Intelligent model routing for OpenClaw with quota prediction, task classification, and automatic optimization
ai-agent cost-optimization llm-routing mlx model-routing ollama openclaw openclaw-plugin quota-management token-optimization
Last synced: 19 Feb 2026
https://github.com/mupozg823/codelens-mcp-plugin
Rust MCP server for bounded code intelligence, gated mutation, and auditable agent workflows.
agent-harness ai-tools claude-code code-intelligence codex cursor developer-tools harness lsp mcp mcp-server rust semantic-search token-optimization tree-sitter
Last synced: 02 May 2026
https://github.com/mukundakatta/tokenwise
Token usage optimization toolkit โ count, estimate costs, optimize prompts, track budgets across LLM providers
ai llm machine-learning officethree open-source python token-optimization tokens
Last synced: 23 Apr 2026
https://github.com/tokenpak/tokenpak
Drop-in HTTP proxy that compresses LLM context, optimizes cache hits, routes smart, and tracks every dollar. Zero SDK changes required.
ai anthropic compression context-window cost-tracking developer-tools gemini llm openai proxy python token-optimization
Last synced: 09 May 2026