An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with token-optimization

A curated list of projects in awesome lists tagged with token-optimization .

https://github.com/rtk-ai/rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

agentic-coding ai-coding anthropic claude-code cli command-line-tool cost-reduction developer-tools llm open-source productivity rust token-optimization

Last synced: 18 Apr 2026

https://github.com/open-compress/claw-compactor

🦞 LLM Token Compression & Reduction Tool — Cut AI agent token costs by up to 97%. 6-layer deterministic context compression for AI agent workspaces. No LLM required. Prompt compression, context window optimization & cost reduction for any LLM pipeline.

ai-agent-tools ai-cost-saving ai-infrastructure claw-compactor context-compression context-pruning context-window-optimization developer-tools llm-compression llm-context-compression llm-cost-reduction llm-token-compression llm-tools openclaw prompt-compression python-tools token-compression token-optimization token-reduction token-saving

Last synced: 01 Apr 2026

https://github.com/yvgude/lean-ctx

Reduce AI coding costs by 99% — MCP Server + Shell Hook for Cursor, Claude Code, Copilot, Windsurf, Gemini CLI & 24 tools. Single Rust binary, zero telemetry.

agentic-coding ai ai-coding claude-code context-engineering copilot cursor developer-tools gemini-cli llm mcp mcp-server reduce-token-costs rust token-optimization

Last synced: 25 Apr 2026

https://github.com/GMaN1911/claude-cognitive

Working memory for Claude Code - persistent context and multi-instance coordination

claude-ai claude-code context-management developer-tools productivity token-optimization

Last synced: 15 Jan 2026

https://github.com/Lap-Platform/LAP

Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10× smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.

agent-experience ai ai-agents api api-compression api-spec asyncapi claude cli developer-tools graphql llm mcp open-source openapi postman protobuf python sdk token-optimization

Last synced: 20 Apr 2026

https://github.com/mpecan/tokf

Config-driven CLI tool that compresses command output before it reaches an LLM context

ai-tools claude-code cli command-line context-window developer-tools homebrew llm output-filter rust token-optimization toml

Last synced: 18 Apr 2026

https://github.com/mischasigtermans/laravel-toon

TOON encoding for Laravel. Encode data for AI/LLMs with 40-60% fewer tokens than JSON.

ai json-alternative laravel llm mcp token-optimization token-optimizer toon

Last synced: 13 Jan 2026

https://github.com/avilum/minrlm

Stop forcing LLMs to answer in one pass. Give them a runtime. Recursive Language Model that improves any LLM, while reducing token usage up to 4X.

agent ai-agents cost-optimization latency-optimization llm llm-inference llmops recursive-language-model rlm token-optimization

Last synced: 07 Apr 2026

https://github.com/sbsaga/toon

TOON — Laravel AI package for compact, human-readable, token-efficient data format with JSON ⇄ TOON conversion for ChatGPT, OpenAI, and other LLM prompts.

ai ai-library api-optimization chatgpt compact-format data-compression developer-tool human-readable json laravel laravel-package large-language-models llm openai php php-library prompt-engineering serialization token-optimization toon

Last synced: 02 Mar 2026

https://github.com/rjkaes/trueline-mcp

Smarter reads, safer edits. An MCP plugin that cuts token usage and catches editing mistakes before they hit disk. Supports Claude Code, Gemini CLI, GitHub Copilot, and Codex.

agentic-coding ai-coding anthropic ast claude-code developer-tools gemini-cli github-copilot llm mcp mcp-server model-context-protocol multi-agent openai-codex token-optimization tree-sitter typescript

Last synced: 12 Apr 2026

https://github.com/coalesce-labs/catalyst

Token-efficient Claude Code workspace with parallel agents and persistent memory. Research → Plan → Implement → Validate workflow.

agent-memory agentic-coding ai-agents ai-coding claude-code-commands claude-code-plugin claude-code-plugins-marketplace claude-code-subagents context-engineering token-optimization

Last synced: 24 Apr 2026

https://github.com/tingjiainfuture/pixrep

Let LLMs see your codebase just like you do.

context-window llm multimodal pdf-generation token-optimization

Last synced: 07 Mar 2026

https://github.com/iagocavalcante/claude-turbo-search

Optimized file search and semantic indexing for large codebases in Claude Code

claude claude-code cli developer-tools fzf hooks qmd ripgrep semantic-search token-optimization

Last synced: 16 Feb 2026

https://github.com/agusrdz/chop

CLI output compressor for Claude Code. Reduces token consumption by 50–90% by compressing verbose command output before it enters the context window. Supports 52+ commands — git, docker, kubectl, npm, terraform, and more.

claude claude-ai claude-code cli context-window developer-tools golang llm productivity token-optimization

Last synced: 03 Apr 2026

https://github.com/kalpeshgamit/codebase-pilot

AI context engine for Claude Code, Cursor, Windsurf — pack, compress, and optimize any codebase. Save 60-90% tokens. Web dashboard on port 7456.

agent-orchestration ai-context-engine claude-code cli-tool code-compression code-context-engine code-review codebase-packer cursor-ai developer-tools llm-tools mcp-server nodejs security-scanner sub-agents token-optimization typescript vibe-coding windsurf

Last synced: 13 Apr 2026

https://github.com/marjoballabani/hypergrep

A better grep for AI agents. Structural search, call graphs, impact analysis, semantic compression. 87% fewer tokens. 16 languages. Built in Rust.

ai-agents ast call-graph claude-code cli code-intelligence code-search copilot cursor developer-tools grep llm ripgrep rust static-analysis token-optimization tree-sitter

Last synced: 05 Apr 2026

https://github.com/tverney/llm-proxy-babylon

Multilingual LLM proxy that optimizes non-English prompts for better quality, lower token costs, and stronger safety alignment

ai-safety aws-bedrock cost-optimization language-detection llm low-resource-languages multilingual openai-api prompt-optimization proxy token-optimization translation

Last synced: 20 Apr 2026

https://github.com/iamgerwin/toon-php

A lightweight, fast TOON (Token-Oriented Object Notation) library for PHP. Optimized for LLM contexts. PHP 7.0-8.0 [legacy] and 8.1 and up [modern] support.

data-format json-alternative legacy legacy-php llm php php-serialize php-toon php8 serialization token-optimization toon toon-php

Last synced: 25 Jan 2026

https://github.com/selcukgural/toonnet

High-performance .NET serialization library for TOON format - 40% fewer tokens for AI/LLM, expression tree-based (10-100x faster)

ai csharp dotnet json llm serialization source-generators token-optimization toon toon-format yaml

Last synced: 08 Feb 2026

https://github.com/castnettech/mnemosyne

LLM context compression and retrieval engine. Zero dependencies. Sub-100ms queries. 40-70% token reduction.

bm25 code-retrieval context-compression developer-tools llm open-source python tfidf token-optimization zero-dependencies

Last synced: 07 Apr 2026

https://github.com/saygex9965/-mcp-to-skill-converter

🔄 Convert MCP servers into Claude Skills with 90% context savings, optimizing token usage for efficient tool operation.

ai-tools automation claude-code claude-code-plugin developer-tools mcp mcp-converter model-context-protocol nodejs plugin-development skills token-optimization typescript

Last synced: 14 Apr 2026

https://github.com/coderdayton/semantic-cache-mcp

MCP server that reduces LLM token usage by 80%+ through intelligent file caching, semantic diffs, and content-defined chunking.

caching claude embeddings llm mcp python semantic-search token-optimization

Last synced: 21 Apr 2026

https://github.com/tokenpak/tokenpak

Drop-in HTTP proxy that compresses LLM context, optimizes cache hits, routes smart, and tracks every dollar. Zero SDK changes required.

ai anthropic compression context-window cost-tracking developer-tools gemini llm openai proxy python token-optimization

Last synced: 24 Apr 2026

https://github.com/nasirus/pytk-ai

Lightweight, zero-dependency Python library that filters shell command output for LLM token efficiency

ai-agents ai-coding cli developer-tools devtools llm output-filtering python shell token-optimization

Last synced: 06 Apr 2026

https://github.com/joshuaswarren/openclaw-tactician

Intelligent model routing for OpenClaw with quota prediction, task classification, and automatic optimization

ai-agent cost-optimization llm-routing mlx model-routing ollama openclaw openclaw-plugin quota-management token-optimization

Last synced: 19 Feb 2026

https://github.com/fujiba/pdf-chunker

LLM-friendly PDF splitter & image optimizer. Chunk PDFs by size and downsample images for RAG/Bedrock.

aws-bedrock chunking claude cli image-optimization llm pdf pdf-chunker pdf-processing pdf-splitting python rag token-optimization

Last synced: 13 Jan 2026

https://github.com/ait88/claude-workflow-toolkit

Reusable workflow optimization toolkit for Claude Code agents.

agentic-workflow claude-code claude-skills gh-cli token-optimization

Last synced: 13 Jan 2026

https://github.com/ignaciocolussi/simple_toon

Python parser and serializer for TOON (Token-Oriented Object Notation) - Reduce LLM token usage by 30-60%

data-format json llm parser python token-optimization toon

Last synced: 13 Jan 2026

https://github.com/ilhan-monke/three-tier-ai-context

Hierarchical session tracking system for AI assistants that reduces token usage by 60-80%

ai ai-agents ai-context claude-code developer-tools documentation productivity session-tracking templates token-optimization

Last synced: 07 Jan 2026

https://github.com/nadimtuhin/claude-token-optimizer

Reusable setup prompts for optimizing Claude Code documentation. Achieve 90% token savings on any project in 5 minutes.

ai-assistant automation claude-code developer-tools documentation setup-template token-optimization

Last synced: 15 Apr 2026

https://github.com/digital-threads/token-pilot

Save 60-80% tokens when AI reads code — MCP server for token-efficient code navigation with AST-aware structural reading

ai-coding ast claude claude-code code-navigation context-window cursor developer-tools llm mcp mcp-server model-context-protocol token-optimization tree-sitter

Last synced: 18 Apr 2026

https://github.com/preflight-dev/preflight

✈️ 24-tool MCP server for Claude Code: preflight checks for your prompts, cross-service context, session history search with LanceDB vectors, correction pattern learning, cost estimation

ai-coding ai-tools anthropic claude claude-code code-quality cost-estimation developer-tools devtools lancedb mcp mcp-server model-context-protocol preflight prompt-engineering prompt-quality semantic-search token-optimization typescript vector-search

Last synced: 02 Apr 2026

https://github.com/chokmah-me/claude-code-playbook

Practitioner's Playbook for Claude Code: Configuration for Token-Efficient AI Engineering

ai-development claude-ai developer-tools refactoring token-optimization typescript workflows

Last synced: 04 Mar 2026

https://github.com/gobbyai/gobby-cli

Rust CLI tools for AI-assisted development. AST-aware code search with tree-sitter + FTS5, and a YAML-configurable output compressor with 28 built-in pipelines. >90% token savings. Part of Gobby, works standalone.

ai-agents ai-coding-assistant ai-pair-programming ast claude-code cli code-search dependency-graph developer-tools fts5 gobby llm neo4j output-compaction rust sqlite symbol-navigation token-optimization tree-sitter yaml

Last synced: 19 Apr 2026

https://github.com/cablate/claude-code-research

Independent research on Claude Code internals, Claude Agent SDK, and related tooling.

claude-agent-sdk claude-code mcp prompt-caching research reverse-engineering system-prompt token-optimization

Last synced: 03 Apr 2026

https://github.com/mupozg823/codelens-mcp-plugin

Harness-native compressed context engine for AI coding agents — Pure Rust MCP server, 89+ tools, 25 languages, 50-87% token reduction

agent-harness ai-tools claude-code code-intelligence codex cursor developer-tools harness lsp mcp mcp-server rust semantic-search token-optimization tree-sitter

Last synced: 21 Apr 2026

https://github.com/mukundakatta/tokenwise

Token usage optimization toolkit — count, estimate costs, optimize prompts, track budgets across LLM providers

ai llm machine-learning officethree open-source python token-optimization tokens

Last synced: 23 Apr 2026

https://github.com/rawcontext/reflex

Episodic memory and semantic cache proxy for LLM APIs with ~40% token savings

agent-orchestration ai-agents context-graph developer-tools knowledge-graph llm-proxy semantic-cache token-optimization

Last synced: 11 Jan 2026