https://github.com/profrandom92/comptext-revolution
π CompText Revolution β token-efficient DSL kernel for LLM agents. DSL compiler + MCP server + SQLite FTS5 index + session memory + sandbox runner in one unified system.
https://github.com/profrandom92/comptext-revolution
Last synced: 26 days ago
JSON representation
π CompText Revolution β token-efficient DSL kernel for LLM agents. DSL compiler + MCP server + SQLite FTS5 index + session memory + sandbox runner in one unified system.
- Host: GitHub
- URL: https://github.com/profrandom92/comptext-revolution
- Owner: ProfRandom92
- License: mit
- Created: 2026-04-28T09:48:14.000Z (about 2 months ago)
- Default Branch: main
- Last Pushed: 2026-04-29T09:27:13.000Z (about 2 months ago)
- Last Synced: 2026-04-29T10:13:10.316Z (about 2 months ago)
- Language: JavaScript
- Size: 537 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
```
βββββββ βββββββ ββββ βββββββββββ ββββββββββββββββββββ ββββββββββββ
ββββββββββββββββββββββ βββββββββββββββββββββββββββββββββββββββββββββββ
βββ βββ ββββββββββββββββββββββ βββ ββββββ ββββββ βββ
βββ βββ βββββββββββββββββββββ βββ ββββββ ββββββ βββ
ββββββββββββββββββββ βββ ββββββ βββ ββββββββββββ βββ βββ
βββββββ βββββββ βββ ββββββ βββ βββββββββββ βββ βββ
βββββββ βββββββββββ βββ βββββββ βββ βββ βββββββββββββββ βββββββ ββββ βββ
βββββββββββββββββββ βββββββββββββββ βββ βββββββββββββββββββββββββββββ βββ
ββββββββββββββ βββ ββββββ ββββββ βββ βββ βββ ββββββ βββββββββ βββ
ββββββββββββββ ββββ βββββββ ββββββ βββ βββ βββ ββββββ βββββββββββββ
βββ βββββββββββ βββββββ ββββββββββββββββββββββββββ βββ βββββββββββββββ ββββββ
βββ βββββββββββ βββββ βββββββ ββββββββ βββββββ βββ βββ βββββββ βββ βββββ
```
### The Universal Token Compression Platform for LLMs
*Compress smarter. Remember more. Spend less.*
[](https://www.typescriptlang.org/)
[](https://www.python.org/)
[](https://fastapi.tiangolo.com/)
[](./k8s/)
[](https://modelcontextprotocol.io/)
[](https://vitest.dev/)
[](LICENSE)
```
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β "Please provide comprehensive documentation for all function parameters" β
β β CompText L5 β
β "prvd docs fn params" β 58% fewer tokens β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
---
## Table of Contents
- [Why CompText?](#-why-comptext)
- [Architecture](#-architecture)
- [KVTC Compression Engine](#-kvtc-compression-engine)
- [MCP Tools](#-mcp-tools-15-total)
- [Memory Palace](#-memory-palace)
- [Quick Start](#-quick-start)
- [Monorepo Structure](#-monorepo-structure)
- [Python REST API](#-python-rest-api)
- [Test Suite](#-test-suite)
- [Kubernetes Deployment](#-kubernetes-deployment)
- [Performance](#-performance)
---
## β‘ Why CompText?
Every token costs money. Every token wastes latency. CompText solves both.
```
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β TOKEN COST REALITY β
β β
β Raw prompt: ββββββββββββββββββββββββββββββββ 1,200 tokens $0.018β
β CompText L2: βββββββββββββββββββββ 750 tokens $0.011β
β CompText L5: ββββββββββββ 430 tokens $0.006β
β β
β At 10,000 daily requests β save $120/day β $43,800/year β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
**CompText Revolution** is a production-grade, dual-stack platform (TypeScript + Python) that:
- **Compresses** any text through 5 progressive levels (whitespace β skeleton)
- **Remembers** context in a hierarchical `[[Palace:Wing:Room]]` memory system
- **Indexes** documents into SQLite FTS5 with BM25 semantic ranking
- **Exposes** 15 MCP tools to Claude Desktop with Python-first routing + TypeScript fallback
- **Scales** to Kubernetes with Canary rollout, Prometheus metrics, and GitOps
---
## π Architecture
```
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β COMPTEXT REVOLUTION β
β β
β βββββββββββββββββββ β
β β Claude Desktop β βββββββββ MCP Protocol (stdio JSON-RPC) β
β βββββββββββββββββββ β β
β β β
β βββββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββββ β
β β MCP SERVER (TypeScript) β β
β β β β
β β ββββββββββββββββββββ ββββββββββββββββββββ β β
β β β Tool Handler βββββΊβ Python Bridge β βββΊ HTTP :8000 β β
β β β (15 tools) β β (HTTP client) β β β
β β ββββββββββββββββββββ ββββββββββββββββββββ β β
β β β β β
β β ββββΊ TypeScript Fallback Engine (offline mode) β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β HTTP/REST β
β βββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββββ β
β β PYTHON BACKEND (FastAPI) β β
β β β β
β β ββββββββββββ βββββββββββββ ββββββββββββ ββββββββββββββββ β β
β β β KVTC β β MemPalace β β CAS β β Database β β β
β β β 5 levels β β [[P:W:R]] β β SHA-256 β β SQLite FTS5 β β β
β β ββββββββββββ βββββββββββββ ββββββββββββ ββββββββββββββββ β β
β β β β
β β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β β
β β β Prometheus Metrics Β· SafetyGate β β β
β β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β @comptext/core (TypeScript Β· tsup) β β
β β ββββββββββββββ βββββββββββββββββ ββββββββββββββββββββββββ β β
β β β Compiler β β LLM Tokenizer β β Hybrid DSL Router β β β
β β β Levels 1-5 β β js-tiktoken β β text/query/config/ β β β
β β ββββββββββββββ βββββββββββββββββ ββββββββββββββββββββββββ β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
---
## π§ KVTC Compression Engine
**Knowledge Vector Token Compression** β 5 progressive levels, each building on the last.
```
INPUT: "This is basically a comprehensive documentation system for the configuration parameters."
β
ββ Level 1 Whitespace Normalization
β "This is basically a comprehensive documentation system for the configuration parameters."
β Trims, collapses spaces, normalizes newlines
β
ββ Level 2 Filler Word Removal
β "comprehensive documentation system configuration parameters."
β Removes: basically, actually, really, just, very, quiteβ¦
β
ββ Level 3 Article & Stop Word Removal
β "comprehensive documentation system configuration parameters."
β Removes: a, an, the, and, for, of, in, to, withβ¦
β
ββ Level 4 Abbreviation Substitution
β "compr docs sys cfg params."
β Maps ~200 common words to short codes
β
ββ Level 5 Skeleton Compression (most aggressive)
"cmpr docs sys cfg prms."
Removes vowels from non-essential syllables
```
| Level | Name | Typical Reduction | Best For |
|:-----:|------|:-----------------:|----------|
| 1 | Normalize | ~5% | Always-on baseline |
| 2 | Filler | ~15% | Casual conversation |
| 3 | Articles | ~25% | Technical documentation |
| 4 | Abbreviate | ~35% | Dense code / config |
| 5 | Skeleton | ~45β55% | Maximum token savings |
```python
from ct_vault_core.kvtc import KVTCContextController
kvtc = KVTCContextController()
result = kvtc.compress(
"Please provide comprehensive documentation for all function parameters",
level=5
)
# result.compressed β "prvd docs fn params"
# result.savings_pct β 58.3
# result.tokens_in β 12
# result.tokens_out β 5
```
---
## π MCP Tools (15 Total)
All tools route to the Python backend first; TypeScript fallback activates automatically if Python is offline.
```
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β TOOL MAP β
β β
β COMPRESSION ββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β ct_compress Compress a document (level 1-5) β
β ct_compress_batch Compress multiple docs in one call β
β ct_encode Encode to [CT:v1:Lx] DSL format with metadata β
β ct_parse Parse and extract CompText header + content β
β ct_compress_output Auto-escalate levels until token budget is met β
β β
β MEMORY βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β mem_remember Store content in [[Palace:Wing:Room:Drawer]] β
β mem_recall BM25 search across all memory locations β
β mem_list List all palace / wing / room locations β
β mem_delete Prune a specific memory location β
β β
β CONTEXT ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β ctx_index Index a document into SQLite FTS5 β
β ctx_search Full-text search with BM25 ranking β
β ctx_checkpoint Save current session state snapshot β
β β
β STORAGE ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β cas_store Content-addressed store β returns SHA-256 hash β
β cas_fetch Retrieve blob by SHA-256 hash β
β β
β METRICS ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β ct_token_stats Global compression stats & savings report β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
### Claude Desktop Integration
```json
// %APPDATA%\Claude\claude_desktop.json
{
"mcpServers": {
"comptext": {
"command": "node",
"args": ["C:/path/to/comptext-revolution/packages/mcp-server/dist/index.js"],
"env": { "PYTHON_BACKEND_URL": "http://localhost:8000" }
}
}
}
```
---
## π Memory Palace
Hierarchical, persistent memory with LOCI-style spatial addressing.
```
[[Palace : Wing : Room : Drawer]]
β β β ββ Optional fine-grained slot
β β ββ Specific topic within the wing
β ββ Domain or category
ββ Top-level namespace
[[math:algebra:equations]] β all quadratic notes
[[project:comptext:decisions]] β architectural decisions
[[user:preferences:coding:style]] β personal coding preferences
[[research:papers:attention:notes]] β annotation for a paper
```
```python
# Store
await palace.remember("math", "algebra", "quadratic",
"x = (-b Β± β(bΒ²-4ac)) / 2a")
# BM25 semantic search
results = await palace.recall("quadratic formula")
# β [{"location": "[[math:algebra:quadratic]]", "content": "..."}]
# List all locations in a palace
locations = palace.list_all(palace_filter="math")
# Delete a room
palace.delete("math", "algebra", "quadratic")
```
---
## π Quick Start
### Prerequisites
```bash
node --version # 18+
pnpm --version # 9+
python3 --version # 3.10+
```
### 1. Clone & Install
```bash
git clone https://github.com/ProfRandom92/comptext-revolution.git
cd comptext-revolution
pnpm install
```
### 2. Build TypeScript
```bash
pnpm build
# Builds all 7 packages via tsup (ESM + CJS + .d.ts)
```
### 3. Install Python Backend
```bash
cd packages-py
pip install -e ".[dev]"
```
### 4. Start Services
```bash
# Terminal A β Python REST API (port 8000)
cd packages-py
uvicorn ct_vault_core.rest_api:app --host 0.0.0.0 --port 8000 --reload
# Terminal B β MCP Server
node packages/mcp-server/dist/index.js
```
### 5. CLI
```bash
# Compress a file
pnpm cli compress ./my-doc.txt --level 3
# Compress stdin
echo "Hello world this is a verbose sentence" | pnpm cli compress --level 5
# Token savings report
pnpm cli stats
```
---
## π Monorepo Structure
```
comptext-revolution/
β
βββ packages/ TypeScript (pnpm workspace)
β βββ core/ @comptext/core
β β βββ src/
β β β βββ compiler.ts Levels 1-5 DSL compiler
β β β βββ levels.ts applyLevel1..5 implementations
β β β βββ hybrid.ts detectInputType + compressHybrid
β β β βββ llm-tokenizer.ts js-tiktoken BPE counter
β β β βββ decompressor.ts decompress + decompressLevel
β β β βββ __tests__/ 52 vitest tests
β β βββ package.json tsup build, exports map
β β
β βββ indexer/ SQLite FTS5 + BM25
β βββ session-memory/ Palace/Wing/Room snapshots
β βββ sandbox-runner/ Isolated Python/Bash execution
β βββ mcp-server/
β β βββ src/
β β βββ tools.ts 15 tool definitions (JSON Schema)
β β βββ tool-handler.ts switch router β bridge or fallback
β β βββ python-bridge.ts HTTP client to :8000
β βββ sdk/ Programmatic TypeScript API
β
βββ packages-py/ Python backend
β βββ ct_vault_core/
β β βββ kvtc.py KVTCContextController (5 levels)
β β βββ mem_palace.py MemPalaceDB + LOCI addressing
β β βββ cas.py ContentAddressedStore (SHA-256)
β β βββ database.py Async SQLite + FTS5
β β βββ rest_api.py FastAPI app (15 endpoints)
β β βββ safety_gate.py Risk scoring + rate limiting
β βββ tests/ 32 pytest tests
β
βββ apps/cli/ CLI application
βββ k8s/ 8 Kubernetes manifests
βββ docker-compose.yml Local dev orchestration
βββ Dockerfile.python Python backend image
βββ Dockerfile.mcp MCP server image
```
---
## π Python REST API
FastAPI backend on `http://localhost:8000`
| Method | Endpoint | Description |
|:------:|----------|-------------|
| `POST` | `/compress` | Compress text at level 1-5 |
| `POST` | `/remember` | Store in MemPalace |
| `POST` | `/recall` | BM25 search in MemPalace |
| `GET` | `/mem/list` | List all memory locations |
| `POST` | `/mem/delete` | Delete a memory location |
| `POST` | `/cas/store` | Store blob, receive SHA-256 |
| `GET` | `/cas/{sha}` | Retrieve blob by hash |
| `POST` | `/ctx/checkpoint` | Save session snapshot |
| `POST` | `/encode` | Encode to `[CT:v1:Lx]` DSL |
| `POST` | `/parse` | Parse CompText format |
| `POST` | `/compress-output` | Auto-escalate to token budget |
| `GET` | `/token-stats` | Global compression metrics |
| `GET` | `/health` | Health check |
| `GET` | `/metrics` | Prometheus metrics |
```bash
# Compress at level 4
curl -s -X POST http://localhost:8000/compress \
-H "Content-Type: application/json" \
-d '{"text": "Please provide comprehensive documentation", "level": 4}'
# β {"compressed":"prvd compr docs","savings_pct":45.2,"tokens_in":6,"tokens_out":3}
# Interactive API docs
open http://localhost:8000/docs
```
---
## π§ͺ Test Suite
**84 tests β all passing**
```
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β β
β TypeScript (vitest) 52 / 52 β β
β ββ compress.test.ts 19 tests β
β ββ compression.integration.test.ts 13 tests β
β ββ hybrid.test.ts 12 tests β
β ββ llm-tokenizer.test.ts 8 tests β
β β
β Python (pytest) 32 / 32 β β
β ββ test_kvtc.py 6 tests β
β ββ test_mem_palace.py 8 tests β
β ββ test_database.py 5 tests β
β ββ test_safety_gate.py 5 tests β
β ββ test_integration.py 8 tests β
β β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
```bash
# TypeScript
pnpm --filter @comptext/core test run
# Python
cd packages-py && python -m pytest tests/ -v
# Both at once
pnpm test && cd packages-py && python -m pytest tests/ -q
```
---
## βΈοΈ Kubernetes Deployment
Production-ready manifests with Canary rollout via Flagger.
```
k8s/
βββ namespace.yaml comptext-revolution namespace
βββ deployment.yaml MCP server (3 replicas Β· HPA 3-10)
βββ deployment-python.yaml Python backend (3 replicas)
βββ service.yaml ClusterIP services
βββ ingress.yaml NGINX ingress + TLS
βββ hpa.yaml HorizontalPodAutoscaler
βββ pvc.yaml PersistentVolumeClaim (vault data)
βββ canary.yaml Flagger Canary β 3-phase, 22 days
```
**Canary phases:**
```
Phase 1 (Days 1β7) 10% traffic ββ latency p99 < 200ms βββΊ pass
Phase 2 (Days 8β15) 40% traffic ββ error rate < 1% βββΊ pass
Phase 3 (Days 16β22) 90% traffic ββ all metrics green βββΊ promote
```
```bash
kubectl apply -f k8s/
kubectl get canary -n comptext-revolution -w
```
---
## π Performance
```
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β BENCHMARK RESULTS β
β β
β Compression Latency β
β ββ L1-L3 < 1 ms ββββ β
β ββ L4-L5 < 5 ms ββββββββ β
β ββ p99 8 ms ββββββββββββ β
β β
β Token Savings (typical prose) β
β ββ Level 2 15-20 % ββββββββ β
β ββ Level 3 20-30 % ββββββββββββ β
β ββ Level 4 30-40 % ββββββββββββββββ β
β ββ Level 5 40-55 % ββββββββββββββββββββββββ β
β β
β Throughput β
β ββ Compress API 5,000 req/s β
β ββ FTS5 index 10,000 docs/min β
β ββ MemPalace recall O(1) palace lookup β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
---
## π Related
| Repo | Role |
|------|------|
| [comptext-codex](https://github.com/ProfRandom92/comptext-codex) | Original Python compression foundation |
| [comptext-mcp-server](https://github.com/ProfRandom92/comptext-mcp-server) | Original MCP integration |
| [comptext-dsl](https://github.com/ProfRandom92/comptext-dsl) | DSL specification |
> This monorepo consolidates all three into a single production platform.
---
## π€ Contributing
PRs, issues, and ideas welcome. The codebase is fully typed (TypeScript strict + Python type hints) and test-driven.
```bash
pnpm install && pnpm build
cd packages-py && pip install -e ".[dev]"
# Verify
pnpm --filter @comptext/core test run # 52/52
cd packages-py && python -m pytest # 32/32
```
---
**MIT License Β© 2026 ProfRandom92**
*Built with TypeScript Β· Python Β· FastAPI Β· SQLite Β· Kubernetes*
```
Every token counts.
Compress the revolution.
```