https://github.com/outbit/claude-code-tokenbudget
Claude code plugin to implement a budget or quota across all backends (Bedrock, Vertex, direct API) and provides guardrails against surprise token costs. No surprise bills, no config beyond a single env var, no external dependencies.
https://github.com/outbit/claude-code-tokenbudget
budget claude claudecode code limit plugin quota token tokens
Last synced: 29 days ago
JSON representation
Claude code plugin to implement a budget or quota across all backends (Bedrock, Vertex, direct API) and provides guardrails against surprise token costs. No surprise bills, no config beyond a single env var, no external dependencies.
- Host: GitHub
- URL: https://github.com/outbit/claude-code-tokenbudget
- Owner: outbit
- License: mit
- Created: 2026-04-18T02:02:09.000Z (about 2 months ago)
- Default Branch: main
- Last Pushed: 2026-05-09T16:04:08.000Z (about 1 month ago)
- Last Synced: 2026-05-09T17:22:47.780Z (about 1 month ago)
- Topics: budget, claude, claudecode, code, limit, plugin, quota, token, tokens
- Language: Python
- Homepage:
- Size: 192 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
[](https://github.com/thedavidwhiteside/claude-code-tokenbudget/actions/workflows/test.yml)
# Claude Code Token Quota Plugin
Claude Code has no built-in spending guardrails. This plugin tracks your token usage and hard stops new prompts once you hit your limit. Supports daily, weekly (rolling 7-day), and monthly (calendar month) quotas. It works with **any backend**: Bedrock, Vertex, direct API, or subscription.
## How it works
| Hook | Event | Action |
|------|-------|--------|
| `enforce_quota.py` | `UserPromptSubmit` | Blocks the prompt if any quota (daily/weekly/monthly) is exceeded |
| `track_tokens.py` | `Stop` | Records token usage after each turn |
Usage is stored in `~/.claude-token-quota/YYYY-MM-DD.json` per day. Daily quotas reset at midnight; the weekly quota uses a rolling 7-day window; the monthly quota resets on the 1st.
---
## Installation
```bash
claude plugin marketplace add thedavidwhiteside/claude-code-tokenbudget
claude plugin install tokenbudget@claude-code-tokenbudget
```
This installs the plugin. It will be active in all future Claude Code sessions without any extra flags.
### Try before installing
If you want to test the plugin without a permanent install:
```bash
git clone https://github.com/thedavidwhiteside/claude-code-tokenbudget.git
cd claude-code-tokenbudget
claude --plugin-dir .
```
The plugin is active only for that session. Nothing is written to your global config.
## Uninstall
```bash
claude plugin uninstall tokenbudget@claude-code-tokenbudget
```
### Configuration
Override any of these in your `~/.claude/settings.json`:
```json
{
"env": {
"TOKEN_QUOTA_DAILY": "1000000",
"TOKEN_QUOTA_WEEKLY": "5000000",
"TOKEN_QUOTA_MONTHLY": "15000000",
"TOKEN_QUOTA_DIR": "~/.claude-token-quota",
"TOKEN_QUOTA_RETAIN_DAYS": "31",
"TOKEN_QUOTA_WARN_CRITICAL": "95",
"TOKEN_QUOTA_WARN": "85",
"TOKEN_QUOTA_SNOOZE_TOKENS": "1000000"
}
}
```
| Variable | Default | Description |
|---|---|---|
| `TOKEN_QUOTA_DAILY` | `1000000` | Daily token limit |
| `TOKEN_QUOTA_WEEKLY` | _(unset)_ | Rolling 7-day token limit (optional) |
| `TOKEN_QUOTA_MONTHLY` | _(unset)_ | Calendar-month token limit (optional) |
| `TOKEN_QUOTA_DIR` | `~/.claude-token-quota` | Where ledger files are stored |
| `TOKEN_QUOTA_RETAIN_DAYS` | `31` | How many days of usage history to keep (≥31 required for accurate monthly totals in 31-day months) |
| `TOKEN_QUOTA_WARN_CRITICAL` | `95` | % of daily limit at which a visible warning is shown |
| `TOKEN_QUOTA_WARN` | `85` | % of daily limit at which a stderr warning is shown |
| `TOKEN_QUOTA_COST_PER_M` | _(unset)_ | Blended cost per 1M tokens — enables `~$X.XX` estimates in status output |
| `TOKEN_QUOTA_SNOOZE_TOKENS` | `1000000` | Extra tokens added to all limits when `/tokenbudget:snooze` is run |
Weekly and monthly limits are opt-in — omit them to enforce only the daily limit. When multiple limits are set, any one being exceeded blocks new prompts.
**Rough token budgets by spend goal — AWS Bedrock example (Claude Sonnet 4.6):**
> **Note:** Prices below are AWS Bedrock examples only and will change. For current rates check the [AWS Bedrock pricing page](https://aws.amazon.com/bedrock/pricing/). Direct API users: see the [Anthropic pricing page](https://www.anthropic.com/pricing) for your model's rates, then apply the same blended-cost formula below.
Sonnet 4.6 standard pricing on Bedrock: ~$3.00 / 1M input tokens, ~$15.00 / 1M output tokens.
Assuming a ~4:1 input-to-output ratio, blended cost is roughly $5.40 / 1M tokens.
| Daily spend goal | ~Token budget |
|---|---|
| ~$5/day | 925,000 |
| ~$10/day | 1,850,000 |
| ~$20/day | 3,700,000 |
The default is 1,000,000 tokens/day (~$5.40/day at the example rates).
---
## Check status
Run `/tokenbudget:status` inside any Claude Code session to see today's usage.

When your quota is exceeded:

---
## FAQ
**Why not just set a spending limit in Claude.ai?**
Claude.ai spending limits only apply to your claude.ai subscription. If you're using Claude Code through the direct API, AWS Bedrock, or Vertex AI, those limits don't apply — your API key has no built-in cap. This plugin fills that gap by enforcing a hard stop at the Claude Code layer, regardless of which backend you're on.
---
## Caveats
- Token counts are read from the session transcript after each turn. They should be accurate but may differ slightly from your AWS bill due to rounding.
- The enforcer checks usage *before* a turn starts, so the very last turn before the limit may slightly exceed it (same behavior as Anthropic's own quota system).
- Requires Python 3.10+ (no external dependencies).
### Running tests
```bash
python3 -m unittest tests/test_plugin.py -v
```
---
## Contributing
See [CONTRIBUTING.md](CONTRIBUTING.md).