Projects in Awesome Lists tagged with token-optimization
A curated list of projects in awesome lists tagged with token-optimization .
https://github.com/rtk-ai/rtk
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
agentic-coding ai-coding anthropic claude-code cli command-line-tool cost-reduction developer-tools llm open-source productivity rust token-optimization
Last synced: 18 Apr 2026
https://github.com/open-compress/claw-compactor
🦞 LLM Token Compression & Reduction Tool — Cut AI agent token costs by up to 97%. 6-layer deterministic context compression for AI agent workspaces. No LLM required. Prompt compression, context window optimization & cost reduction for any LLM pipeline.
ai-agent-tools ai-cost-saving ai-infrastructure claw-compactor context-compression context-pruning context-window-optimization developer-tools llm-compression llm-context-compression llm-cost-reduction llm-token-compression llm-tools openclaw prompt-compression python-tools token-compression token-optimization token-reduction token-saving
Last synced: 01 Apr 2026
https://github.com/chopratejas/headroom
The Context Optimization Layer for LLM Applications
agent ai anthropic compression context-engineering context-window fastapi langchain llm mcp openai proxy python rag token-optimization
Last synced: 25 Apr 2026
https://github.com/yvgude/lean-ctx
Reduce AI coding costs by 99% — MCP Server + Shell Hook for Cursor, Claude Code, Copilot, Windsurf, Gemini CLI & 24 tools. Single Rust binary, zero telemetry.
agentic-coding ai ai-coding claude-code context-engineering copilot cursor developer-tools gemini-cli llm mcp mcp-server reduce-token-costs rust token-optimization
Last synced: 25 Apr 2026
https://github.com/GMaN1911/claude-cognitive
Working memory for Claude Code - persistent context and multi-instance coordination
claude-ai claude-code context-management developer-tools productivity token-optimization
Last synced: 15 Jan 2026
https://github.com/alexgreensh/token-optimizer
Find the ghost tokens. Fix them. Survive compaction. Avoid context quality decay.
agentskills claude-code claude-code-skill claude-plugin context-engineering context-window ghost-tokens token-optimization token-optimizer token-usage
Last synced: 21 Apr 2026
https://github.com/Lap-Platform/LAP
Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10× smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.
agent-experience ai ai-agents api api-compression api-spec asyncapi claude cli developer-tools graphql llm mcp open-source openapi postman protobuf python sdk token-optimization
Last synced: 20 Apr 2026
https://github.com/mpecan/tokf
Config-driven CLI tool that compresses command output before it reaches an LLM context
ai-tools claude-code cli command-line context-window developer-tools homebrew llm output-filter rust token-optimization toml
Last synced: 18 Apr 2026
https://github.com/mischasigtermans/laravel-toon
TOON encoding for Laravel. Encode data for AI/LLMs with 40-60% fewer tokens than JSON.
ai json-alternative laravel llm mcp token-optimization token-optimizer toon
Last synced: 13 Jan 2026
https://github.com/avilum/minrlm
Stop forcing LLMs to answer in one pass. Give them a runtime. Recursive Language Model that improves any LLM, while reducing token usage up to 4X.
agent ai-agents cost-optimization latency-optimization llm llm-inference llmops recursive-language-model rlm token-optimization
Last synced: 07 Apr 2026
https://github.com/sbsaga/toon
TOON — Laravel AI package for compact, human-readable, token-efficient data format with JSON ⇄ TOON conversion for ChatGPT, OpenAI, and other LLM prompts.
ai ai-library api-optimization chatgpt compact-format data-compression developer-tool human-readable json laravel laravel-package large-language-models llm openai php php-library prompt-engineering serialization token-optimization toon
Last synced: 02 Mar 2026
https://github.com/rjkaes/trueline-mcp
Smarter reads, safer edits. An MCP plugin that cuts token usage and catches editing mistakes before they hit disk. Supports Claude Code, Gemini CLI, GitHub Copilot, and Codex.
agentic-coding ai-coding anthropic ast claude-code developer-tools gemini-cli github-copilot llm mcp mcp-server model-context-protocol multi-agent openai-codex token-optimization tree-sitter typescript
Last synced: 12 Apr 2026
https://github.com/coalesce-labs/catalyst
Token-efficient Claude Code workspace with parallel agents and persistent memory. Research → Plan → Implement → Validate workflow.
agent-memory agentic-coding ai-agents ai-coding claude-code-commands claude-code-plugin claude-code-plugins-marketplace claude-code-subagents context-engineering token-optimization
Last synced: 24 Apr 2026
https://github.com/tingjiainfuture/pixrep
Let LLMs see your codebase just like you do.
context-window llm multimodal pdf-generation token-optimization
Last synced: 07 Mar 2026
https://github.com/sir-ad/nexus-prime
The Self-Evolving Agent Operating System
agentic-framework ai-agents guardrails-framework mcp-servers multi-agent-orchestration token-optimization
Last synced: 08 Apr 2026
https://github.com/iagocavalcante/claude-turbo-search
Optimized file search and semantic indexing for large codebases in Claude Code
claude claude-code cli developer-tools fzf hooks qmd ripgrep semantic-search token-optimization
Last synced: 16 Feb 2026
https://github.com/agusrdz/chop
CLI output compressor for Claude Code. Reduces token consumption by 50–90% by compressing verbose command output before it enters the context window. Supports 52+ commands — git, docker, kubectl, npm, terraform, and more.
claude claude-ai claude-code cli context-window developer-tools golang llm productivity token-optimization
Last synced: 03 Apr 2026
https://github.com/argahsuknesib/toon-ld
Token Oriented Object Notation (TOON) for Linked Data
context-window json-ld knowledge-graph linked-data llm rag rdf rust semantic-web serialization token-optimization token-oriented-object-notation wasm
Last synced: 13 Jan 2026
https://github.com/kalpeshgamit/codebase-pilot
AI context engine for Claude Code, Cursor, Windsurf — pack, compress, and optimize any codebase. Save 60-90% tokens. Web dashboard on port 7456.
agent-orchestration ai-context-engine claude-code cli-tool code-compression code-context-engine code-review codebase-packer cursor-ai developer-tools llm-tools mcp-server nodejs security-scanner sub-agents token-optimization typescript vibe-coding windsurf
Last synced: 13 Apr 2026
https://github.com/marjoballabani/hypergrep
A better grep for AI agents. Structural search, call graphs, impact analysis, semantic compression. 87% fewer tokens. 16 languages. Built in Rust.
ai-agents ast call-graph claude-code cli code-intelligence code-search copilot cursor developer-tools grep llm ripgrep rust static-analysis token-optimization tree-sitter
Last synced: 05 Apr 2026
https://github.com/tverney/llm-proxy-babylon
Multilingual LLM proxy that optimizes non-English prompts for better quality, lower token costs, and stronger safety alignment
ai-safety aws-bedrock cost-optimization language-detection llm low-resource-languages multilingual openai-api prompt-optimization proxy token-optimization translation
Last synced: 20 Apr 2026
https://github.com/iamgerwin/toon-php
A lightweight, fast TOON (Token-Oriented Object Notation) library for PHP. Optimized for LLM contexts. PHP 7.0-8.0 [legacy] and 8.1 and up [modern] support.
data-format json-alternative legacy legacy-php llm php php-serialize php-toon php8 serialization token-optimization toon toon-php
Last synced: 25 Jan 2026
https://github.com/sayeem3051/python-context-engineer
Build perfect LLM context from your Python codebase — automatically
artificial-intelligence claude codebase context-engineering developer-tools gpt llm machine-learning openai prompt-engineering python token-optimization
Last synced: 10 Apr 2026
https://github.com/selcukgural/toonnet
High-performance .NET serialization library for TOON format - 40% fewer tokens for AI/LLM, expression tree-based (10-100x faster)
ai csharp dotnet json llm serialization source-generators token-optimization toon toon-format yaml
Last synced: 08 Feb 2026
https://github.com/castnettech/mnemosyne
LLM context compression and retrieval engine. Zero dependencies. Sub-100ms queries. 40-70% token reduction.
bm25 code-retrieval context-compression developer-tools llm open-source python tfidf token-optimization zero-dependencies
Last synced: 07 Apr 2026
https://github.com/saygex9965/-mcp-to-skill-converter
🔄 Convert MCP servers into Claude Skills with 90% context savings, optimizing token usage for efficient tool operation.
ai-tools automation claude-code claude-code-plugin developer-tools mcp mcp-converter model-context-protocol nodejs plugin-development skills token-optimization typescript
Last synced: 14 Apr 2026
https://github.com/coderdayton/semantic-cache-mcp
MCP server that reduces LLM token usage by 80%+ through intelligent file caching, semantic diffs, and content-defined chunking.
caching claude embeddings llm mcp python semantic-search token-optimization
Last synced: 21 Apr 2026
https://github.com/tokenpak/tokenpak
Drop-in HTTP proxy that compresses LLM context, optimizes cache hits, routes smart, and tracks every dollar. Zero SDK changes required.
ai anthropic compression context-window cost-tracking developer-tools gemini llm openai proxy python token-optimization
Last synced: 24 Apr 2026
https://github.com/nasirus/pytk-ai
Lightweight, zero-dependency Python library that filters shell command output for LLM token efficiency
ai-agents ai-coding cli developer-tools devtools llm output-filtering python shell token-optimization
Last synced: 06 Apr 2026
https://github.com/clark-mackey/log-file-genius
Token-efficient log file system for AI coding assistants - reduce context bloat by 93%
ai-coding-assistant ai-development augment best-practices changelog claude-code context-management cursor developer-tools devlog documentation git-workflow github-copilot markdown multi-agent productivity project-management software-documentation template token-optimization
Last synced: 01 Nov 2025
https://github.com/joshuaswarren/openclaw-tactician
Intelligent model routing for OpenClaw with quota prediction, task classification, and automatic optimization
ai-agent cost-optimization llm-routing mlx model-routing ollama openclaw openclaw-plugin quota-management token-optimization
Last synced: 19 Feb 2026
https://github.com/fujiba/pdf-chunker
LLM-friendly PDF splitter & image optimizer. Chunk PDFs by size and downsample images for RAG/Bedrock.
aws-bedrock chunking claude cli image-optimization llm pdf pdf-chunker pdf-processing pdf-splitting python rag token-optimization
Last synced: 13 Jan 2026
https://github.com/ait88/claude-workflow-toolkit
Reusable workflow optimization toolkit for Claude Code agents.
agentic-workflow claude-code claude-skills gh-cli token-optimization
Last synced: 13 Jan 2026
https://github.com/ignaciocolussi/simple_toon
Python parser and serializer for TOON (Token-Oriented Object Notation) - Reduce LLM token usage by 30-60%
data-format json llm parser python token-optimization toon
Last synced: 13 Jan 2026
https://github.com/aisl-web/aisl
AI Native Semantic Language
ai ai-agents ai-assistant anthropic-claude cost-optimization dataformat gemini grok json-alternative llm machine-learning openai problem-solving token-optimization
Last synced: 22 Nov 2025
https://github.com/ilhan-monke/three-tier-ai-context
Hierarchical session tracking system for AI assistants that reduces token usage by 60-80%
ai ai-agents ai-context claude-code developer-tools documentation productivity session-tracking templates token-optimization
Last synced: 07 Jan 2026
https://github.com/nadimtuhin/claude-token-optimizer
Reusable setup prompts for optimizing Claude Code documentation. Achieve 90% token savings on any project in 5 minutes.
ai-assistant automation claude-code developer-tools documentation setup-template token-optimization
Last synced: 15 Apr 2026
https://github.com/digital-threads/token-pilot
Save 60-80% tokens when AI reads code — MCP server for token-efficient code navigation with AST-aware structural reading
ai-coding ast claude claude-code code-navigation context-window cursor developer-tools llm mcp mcp-server model-context-protocol token-optimization tree-sitter
Last synced: 18 Apr 2026
https://github.com/preflight-dev/preflight
✈️ 24-tool MCP server for Claude Code: preflight checks for your prompts, cross-service context, session history search with LanceDB vectors, correction pattern learning, cost estimation
ai-coding ai-tools anthropic claude claude-code code-quality cost-estimation developer-tools devtools lancedb mcp mcp-server model-context-protocol preflight prompt-engineering prompt-quality semantic-search token-optimization typescript vector-search
Last synced: 02 Apr 2026
https://github.com/chokmah-me/claude-code-playbook
Practitioner's Playbook for Claude Code: Configuration for Token-Efficient AI Engineering
ai-development claude-ai developer-tools refactoring token-optimization typescript workflows
Last synced: 04 Mar 2026
https://github.com/gobbyai/gobby-cli
Rust CLI tools for AI-assisted development. AST-aware code search with tree-sitter + FTS5, and a YAML-configurable output compressor with 28 built-in pipelines. >90% token savings. Part of Gobby, works standalone.
ai-agents ai-coding-assistant ai-pair-programming ast claude-code cli code-search dependency-graph developer-tools fts5 gobby llm neo4j output-compaction rust sqlite symbol-navigation token-optimization tree-sitter yaml
Last synced: 19 Apr 2026
https://github.com/cablate/claude-code-research
Independent research on Claude Code internals, Claude Agent SDK, and related tooling.
claude-agent-sdk claude-code mcp prompt-caching research reverse-engineering system-prompt token-optimization
Last synced: 03 Apr 2026
https://github.com/mupozg823/codelens-mcp-plugin
Harness-native compressed context engine for AI coding agents — Pure Rust MCP server, 89+ tools, 25 languages, 50-87% token reduction
agent-harness ai-tools claude-code code-intelligence codex cursor developer-tools harness lsp mcp mcp-server rust semantic-search token-optimization tree-sitter
Last synced: 21 Apr 2026
https://github.com/mukundakatta/tokenwise
Token usage optimization toolkit — count, estimate costs, optimize prompts, track budgets across LLM providers
ai llm machine-learning officethree open-source python token-optimization tokens
Last synced: 23 Apr 2026
https://github.com/rawcontext/reflex
Episodic memory and semantic cache proxy for LLM APIs with ~40% token savings
agent-orchestration ai-agents context-graph developer-tools knowledge-graph llm-proxy semantic-cache token-optimization
Last synced: 11 Jan 2026