Token Conservation

Name: Token Conservation
Author: athola

athola/claude-night-market

Cap agent context reads and token spend at session start before large diffs or long analyses blow your quota.

Install

npx skills add https://github.com/athola/claude-night-market --skill token-conservation

What is this skill?

Mandatory five-step TodoWrite workflow: quota-check, context-plan, delegation-check, compression-review, logging
Discovery read budgets before any Read: max 8 (spec build), 5 (bug fix), 1 (scoped refactor)
Treats each Read and content-mode Grep as a counted read; Glob and files-with-matches Grep are free
Tracks 5-hour rolling and weekly caps from session status before setting a max token target
alwaysApply progressive-loading skill for every session and before massive context loads

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Recommended Skills

Microsoft Foundrymicrosoft/azure-skills

Microsoft Foundry skill guides agents through the full Azure AI Foundry lifecycle—containerizing agents, pushing to ACR,…377k installs·1.2k stars

Azure Aimicrosoft/azure-skills

azure-ai is a Prism-oriented quick reference for Microsoft Azure AI work, with the published body centered on the Azure …375k installs·1.2k stars

Azure Hosted Copilot Sdkmicrosoft/azure-skills

Azure Hosted Copilot SDK is Microsoft's entry skill for repos using @github/copilot-sdk—it detects CopilotClient usage, …346k installs·1.2k stars

Lark Eventlarksuite/cli

Lark real-time subscription skill via lark-cli event consume for building bots and streaming webhook-style agent workers…208k installs·13.7k stars

Running Claude Code Via Litellm Copilotxixu-me/skills

Running Claude Code via LiteLLM Copilot walks through pointing Claude Code at a local LiteLLM proxy that forwards Anthro…200k installs·61 stars

Setup Matt Pocock Skillsmattpocock/skills

One-time per-repo setup so Matt Pocock engineering skills share correct issue tracker, triage strings, and domain docume…180k installs·121k stars

Journey fit

Primary fit

BuildAgent skills & templates

Canonical shelf on agent-tooling because the skill governs how coding agents load context and delegate work during implementation. Fits agent-tooling as operational discipline for Claude Code-style sessions, not a one-off feature task.

Common Questions / FAQ

Is Token Conservation safe to install?

skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

SKILL.md

READMESKILL.md - Token Conservation

# Token Conservation Workflow

## When To Use
- Run at the start of every session and whenever prompt sizes or tool calls begin to spike.
- Mandatory before launching long-running analyses, wide diffs, or massive context loads.

## When NOT To Use

- Context-optimization already handles the scenario
- Simple queries with minimal context

## Required TodoWrite Items
1. `token-conservation:quota-check`
2. `token-conservation:context-plan`
3. `token-conservation:delegation-check`
4. `token-conservation:compression-review`
5. `token-conservation:logging`

## Step 1 – Quota Check (`quota-check`)
- Record current session duration and weekly usage (from `/status` or notebook).
  Note the 5-hour rolling cap and weekly cap highlighted in the Claude community notice.
- Capture remaining budget and set a max token target for this task.

## Step 2 – Context Plan (`context-plan`)
- **Set a discovery read budget BEFORE reading any files.** Count each `Read` call
  and each content-mode `Grep` as one read. Glob and files-with-matches Grep are free.
  - Implement from spec/requirements: **max 8 reads**
  - Bug fix at known location: **max 5 reads**
  - Refactor with known scope: **max 1 read per file being changed**
  - Open exploration: **max 15 reads**
- **Read order** (most valuable first): spec/requirements, files to modify,
  imports/interfaces, then stop and start writing.
- **When budget is spent**: ask the user if more context is needed. Do NOT
  self-authorize additional reads. Only explicit user approval overrides the budget.
- Prefer `Read` with `offset`/`limit` params or `Grep` tool over loading whole files.
  A `Read` targeting <50 lines counts as 0.5 reads. Avoid `cat`/`sed`/`awk` via Bash:
  Claude Code 2.1.21+ steers toward native file tools (Read, Edit, Write, Grep, Glob).
- **PDFs (Claude Code 2.1.30+)**: Use `Read` with `pages: "1-5"` for targeted PDF reading
  instead of loading entire documents. Large PDFs (>10 pages) return a lightweight
  reference when @-mentioned, so use the `pages` parameter to read specific sections.
  Hard limits: **100 pages max, 20MB max per PDF**. Exceeding these previously locked
  sessions permanently (fixed in 2.1.31).
- Convert prose instructions into bullet lists before prompting so only essential
  info hits the model.

## Step 3 – Delegation Check (`delegation-check`)
- Evaluate whether compute-intensive tasks can go to Qwen MCP or other external
  tooling (use `qwen-delegation` skill if needed).
- For local work, favor deterministic scripts (formatters, analyzers) instead
  of LLM reasoning when possible.

## Step 4 – Compression Review (`compression-review`)
- Summarize prior steps/results before adding new context.
  Remove redundant history, collapse logs, and avoid reposting identical code.
- Use `prompt caching` ideas: reference prior outputs instead of restating them
  when the model has already processed the information (cite snippet IDs).
- Decide whether the current thread should be compacted:
  - If only recent context is stale, use **"Summarize from here"** (Claude Code 2.1.32+)
    via the message selector to partially summarize the conversation. This preserves
    recent context while compressing older portions
  - If the active workflow is finished and earlier context will not be reused,
    instruct the user to run `/new`
  - If progress requires the existing thread but the window is bloated,
    prompt them to run `/compact` before continuing
- **Automatic memory** (Claude Code 2.1.32+): Claude now records and recalls session
  memories automatically. This adds minor token overhead but improves cross-session
  continuity. No action needed: be aware it contributes to baseli

What is this skill?

Mandatory five-step TodoWrite workflow: quota-check, context-plan, delegation-check, compression-review, logging

Discovery read budgets before any Read: max 8 (spec build), 5 (bug fix), 1 (scoped refactor)

Treats each Read and content-mode Grep as a counted read; Glob and files-with-matches Grep are free

Tracks 5-hour rolling and weekly caps from session status before setting a max token target

alwaysApply progressive-loading skill for every session and before massive context loads

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Journey fit

Primary fit

BuildAgent skills & templates

SKILL.md

READMESKILL.md - Token Conservation

# Token Conservation Workflow

## When To Use
- Run at the start of every session and whenever prompt sizes or tool calls begin to spike.
- Mandatory before launching long-running analyses, wide diffs, or massive context loads.

## When NOT To Use

- Context-optimization already handles the scenario
- Simple queries with minimal context

## Required TodoWrite Items
1. `token-conservation:quota-check`
2. `token-conservation:context-plan`
3. `token-conservation:delegation-check`
4. `token-conservation:compression-review`
5. `token-conservation:logging`

## Step 1 – Quota Check (`quota-check`)
- Record current session duration and weekly usage (from `/status` or notebook).
  Note the 5-hour rolling cap and weekly cap highlighted in the Claude community notice.
- Capture remaining budget and set a max token target for this task.

## Step 2 – Context Plan (`context-plan`)
- **Set a discovery read budget BEFORE reading any files.** Count each `Read` call
  and each content-mode `Grep` as one read. Glob and files-with-matches Grep are free.
  - Implement from spec/requirements: **max 8 reads**
  - Bug fix at known location: **max 5 reads**
  - Refactor with known scope: **max 1 read per file being changed**
  - Open exploration: **max 15 reads**
- **Read order** (most valuable first): spec/requirements, files to modify,
  imports/interfaces, then stop and start writing.
- **When budget is spent**: ask the user if more context is needed. Do NOT
  self-authorize additional reads. Only explicit user approval overrides the budget.
- Prefer `Read` with `offset`/`limit` params or `Grep` tool over loading whole files.
  A `Read` targeting <50 lines counts as 0.5 reads. Avoid `cat`/`sed`/`awk` via Bash:
  Claude Code 2.1.21+ steers toward native file tools (Read, Edit, Write, Grep, Glob).
- **PDFs (Claude Code 2.1.30+)**: Use `Read` with `pages: "1-5"` for targeted PDF reading
  instead of loading entire documents. Large PDFs (>10 pages) return a lightweight
  reference when @-mentioned, so use the `pages` parameter to read specific sections.
  Hard limits: **100 pages max, 20MB max per PDF**. Exceeding these previously locked
  sessions permanently (fixed in 2.1.31).
- Convert prose instructions into bullet lists before prompting so only essential
  info hits the model.

## Step 3 – Delegation Check (`delegation-check`)
- Evaluate whether compute-intensive tasks can go to Qwen MCP or other external
  tooling (use `qwen-delegation` skill if needed).
- For local work, favor deterministic scripts (formatters, analyzers) instead
  of LLM reasoning when possible.

## Step 4 – Compression Review (`compression-review`)
- Summarize prior steps/results before adding new context.
  Remove redundant history, collapse logs, and avoid reposting identical code.
- Use `prompt caching` ideas: reference prior outputs instead of restating them
  when the model has already processed the information (cite snippet IDs).
- Decide whether the current thread should be compacted:
  - If only recent context is stale, use **"Summarize from here"** (Claude Code 2.1.32+)
    via the message selector to partially summarize the conversation. This preserves
    recent context while compressing older portions
  - If the active workflow is finished and earlier context will not be reused,
    instruct the user to run `/new`
  - If progress requires the existing thread but the window is bloated,
    prompt them to run `/compact` before continuing
- **Automatic memory** (Claude Code 2.1.32+): Claude now records and recalls session
  memories automatically. This adds minor token overhead but improves cross-session
  continuity. No action needed: be aware it contributes to baseli

Install

What is this skill?

Recommended Skills

Journey fit

Is Token Conservation safe to install?

SKILL.md

This week for builders

Install

What is this skill?

Recommended Skills

Journey fit

Is Token Conservation safe to install?

SKILL.md