Projects in Awesome Lists tagged with context-window
A curated list of projects in awesome lists tagged with context-window .
https://github.com/chopratejas/headroom
The Context Optimization Layer for LLM Applications
agent ai anthropic compression context-engineering context-window fastapi langchain llm mcp openai proxy python rag token-optimization
Last synced: 14 May 2026
https://github.com/alexgreensh/token-optimizer
Find the ghost tokens. Fix them. Survive compaction. Avoid context quality decay.
agentskills claude-code claude-code-skill claude-plugin context-engineering context-window ghost-tokens token-optimization token-optimizer token-usage
Last synced: 30 May 2026
https://github.com/datamllab/longlm
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
context-window large-language-models llm longlm self-extend selfextend
Last synced: 28 Jun 2025
https://github.com/datamllab/LongLM
[ICML'24] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
context-window large-language-models llm longlm self-extend selfextend
Last synced: 08 May 2025
https://github.com/mnemon-dev/mnemon
LLM-supervised persistent memory for AI agents — graph-based recall, cross-session knowledge, single binary. Works with Claude Code, OpenClaw, and any CLI agent.
agent-framework agent-memory ai-agent ai-tools claude claude-code claude-memory cli context-window golang knowledge-graph llm-agent llm-memory llm-supervised mcp memory openclaw persistent-memory rag sqlite
Last synced: 31 May 2026
https://github.com/smart-mcp-proxy/mcpproxy-go
Supercharge AI Agents, Safely
ai ai-agents audit-logging bm25 cli context-window developer-tools docker golang llm llm-tools local-first mcp mcp-proxy mcp-server model-context-protocol proxy-server security tool-routing web-ui
Last synced: 24 May 2026
https://github.com/mpecan/tokf
Config-driven CLI tool that compresses command output before it reaches an LLM context
ai-tools claude-code cli command-line context-window developer-tools homebrew llm output-filter rust token-optimization toml
Last synced: 14 Jun 2026
https://github.com/memvid/claude-brain
Give Claude Code photographic memory in ONE portable file. No database, no SQLite, no ChromaDB - just a single .mv2 file you can git commit, scp, or share. Native Rust core with sub-ms operations.
ai-tools anthropic claude claude-agents claude-ai claude-code claude-skills context-window developer-tools llm-memory long-term-memory memvid persistent-memory portable rag rust single-file
Last synced: 15 Feb 2026
https://github.com/zzet/gortex
High-performance code graph and code intelligence engine, supports 257 languages, multi repositories, with access via CLI, MCP Server, and API. Built for AI coding agents - expose only needed information, cutting token usage up to 50x. 100% local.
ai-tools antigravity claude-code code-analysis code-assistant context-window context-window-optimization context-window-optimizer copilot cursor developer-tools graphrag kiro knowledge-graph local-first mcp-server prompts skills tokens windsurf
Last synced: 17 Jun 2026
https://github.com/claudioemmanuel/squeez
Hook-based token compressor for 5 AI CLI hosts (Claude Code, Copilot CLI, OpenCode, Gemini CLI, Codex CLI). Up to 95% bash compression, signature-mode for code reads, cross-call dedup, MCP server, self-teaching protocol. Zero runtime deps.
ai-cli bash-hook claude-code codex-cli context-engineering context-window copilot-cli gemini-cli llm llm-tools mcp-server opencode rust session-memory signature-extraction token-compression token-optimizer zero-dependency
Last synced: 18 May 2026
https://github.com/brandondocusen/cntxtpy
A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Model tooling #LLM #AI #Python #CodeAnalysis #ContextWindow #DeveloperTools
architecture-insights code-documentation code-visualization codebase-analysis context-management context-window decorators dependency-analysis dependency-mapping developer-tools knowledge-graph large-language-models llm llm-integration machine-learning module-relationships numpy open-source python token-reduction
Last synced: 07 Apr 2025
https://github.com/t8/memoryport
Local-first permanent, persistent memory for all agents and humans
Last synced: 16 Apr 2026
https://github.com/rasros/lx
Recursively find, filter, and format code files for ChatGPT and Claude context windows directly from your terminal.
chatgpt claude cli clipboard code-analysis context-window developer-tools gitignore go go-lang llm productivity prompt-engineering terminal
Last synced: 25 Jan 2026
https://github.com/hunhee98/pluck
MCP-native code retrieval for AI agents — 84-88% fewer read tokens, BM25F + semantic search, AST chunks, session dedup
ai-agents bm25 claude-code cli code-intelligence code-search codex context-window developer-tools embeddings llm mcp mcp-server rag ripgrep rust semantic-search tantivy token-optimization tree-sitter
Last synced: 01 Jun 2026
https://github.com/brandondocusen/cntxtcs
A lightweight tool to optimize your C# project for LLM context windows by using a knowledge graph | Code structure visualization | Static analysis for AI | Large Language Model tooling | .NET ecosystem support #LLM #AI #CSharp #DotNet #CodeAnalysis #ContextWindow #DeveloperTools
architecture-insights code-documentation code-visualization codebase-analysis context-management context-window csharp dependency-analysis dependency-mapping developer-tools dotnet knowledge-graph large-language-models llm llm-integration machine-learning module-relationships open-source python token-reduction
Last synced: 05 May 2025
https://github.com/topos-labs/infiniloom
High-performance repository context generator for LLMs - Transform codebases into optimized formats for Claude, GPT-4/5, Gemini, and other LLMs
ast claude cli code-analysis context-window developer-tools gpt-4 llm nodejs python-bindings rust tokenizer tree-sitter
Last synced: 16 Mar 2026
https://github.com/brandondocusen/cntxtjv
A discovery and compression tool for your Java codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project #LLM #AI #Java #CodeAnalysis #ContextWindow #DeveloperTools #StaticAnalysis #CodeVisualization
architecture-insights code-documentation code-visualization codebase-analysis context-management context-window dependency-analysis dependency-mapping developer-tools knowledge-graph large-language-models llm llm-integration machine-learning module-relationships open-source python python-frameworks spring token-reduction
Last synced: 10 Apr 2025
https://github.com/tingjiainfuture/pixrep
Let LLMs see your codebase just like you do.
context-window llm multimodal pdf-generation token-optimization
Last synced: 07 Mar 2026
https://github.com/elijas/baml-agents
Building Agents with LLM structured generation (BAML), MCP Tools, and 12-Factor Agents principles
12-factor 12-factor-agents ai ai-agents ai-agents-framework baml context-window framework llms mcp-client memory orchestration rag structured-generation
Last synced: 07 May 2025
https://github.com/sorunokoe/scopeon
AI context observability for Claude Code & friends — token breakdown, cache ROI, cost tracking, CI gates
ai claude cli context-window cost-tracking developer-tools llm mcp observability rust tokens tui
Last synced: 27 Apr 2026
https://github.com/sakebomb/mcp-recall
mcp-recall compresses MCP tool outputs (94 KB → 3.5 KB · 96%) and stores full results in SQLite for retrieval — up to 30x more tool calls per session for heavy MCP workloads.
bun claude claude-code claude-code-plugin context-window mcp sqlite typescript
Last synced: 08 Apr 2026
https://github.com/dfrostar/neuralmind
🧠 Adaptive Neural Knowledge System - 40-70x token reduction for AI code understanding
ai ai-coding chromadb code-understanding context-window knowledge-graph llm mcp semantic-search token-reduction
Last synced: 16 May 2026
https://github.com/maystudios/maxsimcli
Solve context rot in AI coding. Spec-driven development system for Claude Code, OpenCode, Gemini CLI, and Codex — with parallel subagents, atomic commits, and a live dashboard.
agentic-coding ai-tooling ai-workflow claude-code cli context-engineering context-window developer-tools gemini-cli llm meta-prompting opencode spec-driven-development subagent typescript
Last synced: 06 Apr 2026
https://github.com/argahsuknesib/toon-ld
Token Oriented Object Notation (TOON) for Linked Data
context-window json-ld knowledge-graph linked-data llm rag rdf rust semantic-web serialization token-optimization token-oriented-object-notation wasm
Last synced: 13 Jan 2026
https://github.com/mensfeld/llm-docs-builder
Transform and optimize your markdown documentation for Large Language Models (LLMs) and RAG systems. Generate llms.txt automatically.
ai ai-documentation context-window documentation large-language-models llms llms-txt rag ruby text-processing tokenization
Last synced: 20 Jan 2026
https://github.com/agusrdz/chop
CLI output compressor for Claude Code. Reduces token consumption by 50–90% by compressing verbose command output before it enters the context window. Supports 52+ commands — git, docker, kubectl, npm, terraform, and more.
claude claude-ai claude-code cli context-window developer-tools golang llm productivity token-optimization
Last synced: 03 Apr 2026
https://github.com/akadeepesh/contextzip
CLI tool for intelligent project packaging with framework detection, smart exclusions, and git-aware file selection.
ai-tools automation chatgpt claude-ai cli code-packaging context-window developer-tools devtools git github llm python zip
Last synced: 25 May 2026
https://github.com/iampantherr/securecontext
Secure memory & context optimization MCP plugin for Claude Code. Drop-in replacement for context-mode with credential isolation, SSRF protection, MemGPT-style persistent memory, and hybrid BM25+vector search. 84 security tests, zero cloud sync.
ai-agent-memory claude-code claude-code-plugin context-management context-mode-alternative context-window hybrid-search knowledge-base llm-memory mcp mcp-server memgpt persistent-memory secure-mcp zeroclaw
Last synced: 03 May 2026
https://github.com/rodaddy/mcp2cli
CLI bridge that wraps MCP servers as bash-invokable commands, recovering ~11K tokens of context window per session
ai-tools claude claude-code cli context-window developer-tools llm mcp model-context-protocol typescript
Last synced: 13 Apr 2026
https://github.com/xxxdorixxx/repotoprompt
Turns your local codebase into a secure, token-optimized context prompt for LLMs like ChatGPT and Claude.
chatgpt claude context-window devtools electron gemini llm nodejs openai productivity typescript
Last synced: 26 Nov 2025
https://github.com/paradite/llm-info
Information on LLM models, context window token limit, output token limit, pricing and more.
context-window information language-model llm models pricing token token-limit
Last synced: 13 May 2025
https://github.com/srobinson/markdown-matters
Structural markdown intelligence for LLMs — search, index, and summarize with 80% fewer tokens
ai-tools cli context-window documentation embeddings llm markdown mcp semantic-search typescript
Last synced: 04 Apr 2026
https://github.com/nambok/mentedb
A cognition aware database engine for AI agent memory. Purpose built in Rust with WAL, HNSW, knowledge graphs, and speculative context pre assembly. Not a wrapper, a ground up storage engine that thinks.
agent-memory ai ai-agents cognitive-architecture context-window database knowledge-graph langchain llm memory rag rust storage-engine vector-database
Last synced: 10 Apr 2026
https://github.com/yttrium400/reducethemtokens
Compress any code repo into a compact skeleton to reduce LLM token usage. 90%+ reduction with full structural retention.
ai-tools claude code-analysis context-window cursor developer-tools llm python token-reduction tree-sitter
Last synced: 15 Jun 2026
https://github.com/nikolay-e/treemapper
Export your entire codebase to ChatGPT/Claude in one command. Structure + contents in YAML/JSON — optimized for LLM context windows.
ai anthropic chatgpt claude cli code-context code-review code-to-prompt codebase context-window developer-tools diff-context directory-tree git-diff llm llm-context openai prompt-engineering python yaml
Last synced: 09 Apr 2026
https://github.com/juanmacruzherrera/claude-layered-memory-architecture
Three-layer memory architecture for long-term AI learning with Claude
ai-memory antrophic architecture claude claude-ai claude-code claude-desktop claude-skill claude-skills claude-skills-creator context-window education educational-ai learning long-term-memory memory-management rag skills socratic-method
Last synced: 24 Apr 2026
https://github.com/ultracontext/ultracontext-node
The context API for AI agents
agents ai ai-agents api context-api context-engineering context-management context-window llm llm-ops nodejs sdk typescript ultracontext versioning
Last synced: 13 Jan 2026
https://github.com/ultracontext/ultracontext-python
The context API for AI agents
agents ai ai-agents api context-api context-engineering context-management context-window llm llm-ops python sdk ultracontext versioning
Last synced: 14 Jan 2026
https://github.com/bgreenwell/dotagents
A proposed standard for the .agents/ directory to prevent context bloat and improve agent reasoning in complex codebases.
agent-memory agents-md ai-agents claude-code coding-assistant context context-window dotagents gemini-cli standard
Last synced: 11 Feb 2026
https://github.com/phlx0/tokenmiser
Cut Claude Code token usage 50–80%. Auto-generates a codebase index so Claude finds files instantly instead of reading everything.
ai-tools anthropic claude claude-code cli codebase-index context-window developer-tools productivity tokens
Last synced: 16 Apr 2026
https://github.com/guorunjie/codex-relay-baton-guardian
Keep Codex long tasks running while you sleep: detects compact failures and context-window overflow, preserves the latest task state, and starts one safe fork relay.
agent-recovery ai-agents codex codex-cli context-compaction context-recovery context-window desktop-automation developer-tools long-running-tasks recovery relay-baton
Last synced: 09 Jun 2026
https://github.com/yazansalhi/claude-token-saver
Claude Code plugin that blocks token-wasting Bash/Read patterns and tells the model the cheaper way. Tracks savings with /token-stats.
ai-tools claude claude-code context-window cost-optimization hooks plugin tokens
Last synced: 27 Apr 2026
https://github.com/dotcommander/repomap
Token-budgeted repository maps for LLM context windows — scan, parse, rank, budget, format. Go library + CLI.
cli code-navigation codebase context-window developer-tools go golang llm repository-map tree-sitter
Last synced: 27 Apr 2026
https://github.com/ralphmoran/ticket-lens
Privacy-first CLI that transforms Jira tickets into AI-ready briefs. 60–80% fewer tokens. No relay. Pipe-friendly.
ai atlassian cli context-window developer-productivity developer-tools devtools jira jira-api jira-server llm nodejs prompt-engineering token-optimization
Last synced: 09 Jun 2026
https://github.com/mcp-tool-shop-org/context-window-manager
MCP server for lossless LLM context restoration via KV cache persistence
ai claude context-management context-window kv-cache llm llm-inference lmcache machine-learning mcp mcp-server memory model-context-protocol python rag session-management token-management vllm
Last synced: 27 Feb 2026
https://github.com/shreesha1207/context-switcher
AI Context Bridge is a browser extension that lets you capture a full conversation from one AI chatbot and continue it on another by generating a portable, copy‑ready summary.
Last synced: 04 Apr 2026
https://github.com/adrianwedd/rlm-mcp
MCP server implementing Recursive Language Model pattern for processing arbitrarily long contexts. Enables Claude Code to work with 1M+ character documents through session-based chunking, BM25 search, and artifact provenance tracking.
bm25-search claude claude-code context-window document-processing fastmcp llm mcp model-context-protocol recursive-language-model
Last synced: 21 Feb 2026
https://github.com/tokenpak/tokenpak
Drop-in HTTP proxy that compresses LLM context, optimizes cache hits, routes smart, and tracks every dollar. Zero SDK changes required.
ai anthropic compression context-window cost-tracking developer-tools gemini llm openai proxy python token-optimization
Last synced: 09 May 2026
https://github.com/blackwell-systems/gcf
GCF: token-optimized wire format for LLM tool responses. 84% fewer tokens than JSON, 34% fewer than TOON, 100% comprehension accuracy at scale.
ai-agents code-intelligence context-window format gcf graph llm mcp model-context-protocol specification token-optimization wire-format
Last synced: 06 Jun 2026
https://github.com/frkl81/codeblast
El puente definitivo entre tu proyecto y los LLMs de alto contexto. CodeBlast empaqueta miles de archivos en un solo payload estructurado y limpio en segundos. Dile adiós a la ceguera de contexto de los agentes de IDE: extrae tu arquitectura completa, filtra binarios y potencia tu desarrollo con IA.
ai-tools chatgpt claude code-to-prompt context-window csharp developer-tools dotnet-10 gemini large-language-models llm productivity prompt-engineering token-counter vibe-coding wpf
Last synced: 24 May 2026
https://github.com/aemal/rag-killer
A tool that analyzes your content to determine if you need a RAG pipeline or if modern language models can handle your text directly. It compares your content's token requirements against model context windows to help you make an informed architectural decision.
ai ai-agents aiagents context-window llm rag
Last synced: 24 Apr 2026
https://github.com/aniboy2k-gif/memory-health
Claude Code skill to diagnose and optimize auto-memory files — preventing silent context truncation before it degrades your AI assistant
ai-tools anthropic-claude bash claude claude-code claude-skill context-window developer-tools llm memory-management memory-optimization productivity
Last synced: 26 Apr 2026
https://github.com/yaronkoresh/pakem
pakem is a repository packaging system designed to convert source trees into portable artifacts for analysis, indexing, sharing, and restoration workflows.
cli code-analysis codebase context-window devtools python token-counter xml
Last synced: 07 Apr 2026
https://github.com/boadij/pi-downshift
🕹️ Downshift - A tiny Pi Coding Agent extension that switches from a premium model to an economy model when your context gets expensive.
coding-agent context-window llm model-routing model-switching pi-coding-agent pi-extension pi-package subagents token-cost
Last synced: 07 Jun 2026
https://github.com/voidd0/ctxstuff
pack context for LLM prompts. token-aware file globbing for OpenAI/Anthropic/Gemini.
ai anthropic claude cli context-window devtools gemini gpt javascript llm nodejs openai tokenization
Last synced: 20 Jun 2026
https://github.com/nshkrdotcom/dexterity
Authoritative, ranked, token-budgeted codebase context for Elixir agents. Solves context window limitations by providing a deterministic Elixir semantic graph and agent-ready Repo Map for LLMs.
agent ai ai-agents anthropic-mcp codebase-analysis context context-management context-window developer-tools elixir elixir-lang graph-analysis indexer llm lsp mcp mcp-server nshkr-devtools otp pagerank
Last synced: 01 May 2026
https://github.com/haru0416-dev/coffer
Byte-exact, reversible compression of LLM tool-output, with an exact compute-digest — an MCP server and a transparent HTTP proxy.
ai-agents compression content-addressable-storage context-window llm mcp model-context-protocol proxy rust
Last synced: 22 Jun 2026
https://github.com/pallaprolus/contextcut
Pack a repository into ultra-dense, AI-optimized Markdown — gitignore-aware, noise-pruned, with token estimates
ai cli context-window developer-tools llm rust
Last synced: 12 Jun 2026
https://github.com/shayashav/flatten-mcp
Flatten Claude Code sessions: keep every prompt and event verbatim, resume at a lower token count. An MCP server.
anthropic claude claude-code context-window mcp mcp-server model-context-protocol tokens
Last synced: 13 Jun 2026
https://github.com/zircote/rlm-rs
Rust CLI implementing the Recursive Language Model (RLM) pattern for Claude Code. Process documents 100x larger than context windows through intelligent chunking, SQLite persistence, and recursive sub-LLM orchestration.
ai-tools chunking claude claude-code cli command-line context-window devtools document-processing llm mit-license mmap rayon recursive-language-model rlm rust rust-2024 semantic-chunking sqlite text-processing
Last synced: 08 Feb 2026
https://github.com/a2cr/a2cr
MCP server for client-encrypted AI agent handoffs with WorkBaton and WorkStash
agent-memory ai ai-agents claude-code codex coding-agents context-management context-window developer-tools llm mcp mcp-server model-context-protocol python workbaton workstash
Last synced: 03 Jun 2026
https://github.com/aaronlab/claude-context-visualizer
Interactive visualization of Claude context window token usage — animated SVG gauge, analytics, i18n, PWA, dark glassmorphism UI
ai-tools anthropic claude context-window glassmorphism llm no-dependencies pwa svg token-visualizer
Last synced: 02 Jun 2026
https://github.com/digital-threads/token-pilot
Save 60-80% tokens when AI reads code — MCP server for token-efficient code navigation with AST-aware structural reading
ai-coding ast claude claude-code code-navigation context-window cursor developer-tools llm mcp mcp-server model-context-protocol token-optimization tree-sitter
Last synced: 18 Apr 2026