Get Token Insights

Name: Get Token Insights
Author: gupsammy

gupsammy/claudest

A solo builder uses get-token-insights to warn when prompt cache is about to expire due to idle time, avoiding costly token re-creation on the next API call.

Install

npx skills add https://github.com/gupsammy/claudest --skill get-token-insights

What is this skill?

Warns before 5-min prompt cache expiry
Prevents costly token re-creation
Blocks duplicate warnings per idle gap

Adoption & trust: 1 installs on skills.sh; 253 GitHub stars; 2/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Recommended Skills

Azure Diagnosticsmicrosoft/azure-skills

Azure Diagnostics walks agents through systematic production troubleshooting on Azure—checking resource health, AppLens …374k installs·1.2k stars

Diagnosemattpocock/skills

Matt Pocock-style diagnose skill that prioritizes deterministic pass/fail signals then walks through structured debuggin…187k installs·121k stars

Systematic Debuggingobra/superpowers

Systematic Debugging is an agent skill that forces a root-cause-first workflow before any proposed fix for bugs, test fa…134k installs·221k stars

Safe Debuglllllllama/rigorpilot-skills

safe-debug implements Rigor Debug / Rigor Audit mode for deep-learning research repos: your agent reads the traceback or…32.3k installs·412 stars

Mastramastra-ai/skills

The mastra skill is a structured troubleshooting companion for solo builders shipping TypeScript agents on the Mastra fr…18.5k installs·57 stars

Insforge Debuginsforge/agent-skills

InsForge Debug guides solo builders through structured diagnosis on InsForge-backed projects when something breaks in pr…9.2k installs·27 stars

Journey fit

Primary fit

OperateMonitoring & observability

This tool operates in the "operate" stage because it monitors runtime token cache behavior and prevents performance degradation in active sessions. The monitoring subphase is appropriate because the tool actively tracks cache TTL and alerts users to state changes that impact token efficiency.

Common Questions / FAQ

Is Get Token Insights safe to install?

skills.sh reports 2 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

SKILL.md

READMESKILL.md - Get Token Insights

#!/usr/bin/env python3
"""
UserPromptSubmit hook: warn once when prompt cache has likely expired.

Claude Code uses the 5-minute prompt cache TTL (ephemeral_5m). When the user
has been idle >5 minutes since Claude last responded, the cached context
will expire on this turn — costing full re-creation tokens. This hook fires
before the prompt is sent and blocks once per idle gap so the user can
compact before the expensive re-creation happens.

State: ~/.claude-memory/cache-warn/<session_id>.json (written by cache-warn-stop.py)
  - last_stop_time: ISO timestamp of last Claude response
  - warned_gaps: list of gap buckets already warned — prevents double-warn

Gap bucket = floor(last_stop_time_unix / 60) — stable per idle gap even if
the hook fires multiple times after the same idle period.
"""

from __future__ import annotations

import json
import math
import sys
from datetime import datetime, timezone
from pathlib import Path

CACHE_TTL_SECONDS = 300       # Claude Code uses 5-minute prompt cache TTL (ephemeral_5m, changed ~Apr 3 2026)
WARN_THRESHOLD_SECONDS = 300  # Warn at exactly 5 min (TTL expiry)
CACHE_WARN_DIR = Path.home() / ".claude-memory" / "cache-warn"


def _gap_bucket(last_stop_time_iso: str) -> str:
    """Stable ID for this idle gap — floor of last_stop_time in whole minutes."""
    dt = datetime.fromisoformat(last_stop_time_iso.replace("Z", "+00:00"))
    return str(math.floor(dt.timestamp() / 60))


def _format_tokens(n: int) -> str:
    if n >= 1000:
        return f"{n / 1000:.0f}k"
    return str(n)


def _safe_state_path(cache_dir: Path, prefix: str, session_id: str) -> Path | None:
    """Return resolved path only if it stays within cache_dir; else None."""
    try:
        candidate = (cache_dir / f"{prefix}{session_id}.json").resolve()
        candidate.relative_to(cache_dir.resolve())
        return candidate
    except (ValueError, OSError, RuntimeError):
        return None


def check_resume_warn(session_id: str) -> str | None:
    """If a resume-pending flag exists for this session, consume it and return a warning."""
    flag_path = _safe_state_path(CACHE_WARN_DIR, "resume-pending-", session_id)
    if flag_path is None or not flag_path.exists():
        return None

    try:
        flag = json.loads(flag_path.read_text())
        cached_tokens = flag.get("cached_tokens", 0)
    except (json.JSONDecodeError, OSError):
        cached_tokens = 0
    finally:
        try:
            flag_path.unlink()
        except OSError:
            pass

    token_str = f"~{_format_tokens(cached_tokens)} cached tokens" if cached_tokens else "cached context"
    return (
        f"Resumed session: this turn will re-process {token_str} from scratch "
        f"(5-minute cache TTL likely expired while session was inactive).\n"
        f"Run /compact to reduce re-creation cost, then re-send — or re-send now to proceed."
    )


def main() -> None:
    try:
        _main()
    except Exception:
        # Never hard-block a UserPromptSubmit hook — always let the prompt through
        print(json.dumps({"continue": True}))


def _main() -> None:
    try:
        hook_input = json.load(sys.stdin)
    except (json.JSONDecodeError, EOFError):
        hook_input = {}

    session_id = hook_input.get("session_id", "")
    if not session_id:
        print(json.dumps({"continue": True}))
        return

    # Skip cache warning for system-generated messages — these are not human prompts
    # and blocking them breaks background task result delivery.
    _prompt = hook_input.get("prompt", "").lstrip()
    _SYSTEM_PREFIXES = (
        "<task-notification>",
        "<local-command-caveat>",
        "<command-name>",
        "<command-message>",
    )
    if any(_prompt.startswith(p) for p in _SYSTEM_PREFIXES):
        print(json.dumps({"continue": True}))
        return

    # Check resume warning first — takes priority over idle-gap warning
    resume_warning = check_resume_warn(session_id)
    if resume_warning:
        # A

Journey fit

Primary fit

OperateMonitoring & observability

SKILL.md

READMESKILL.md - Get Token Insights

#!/usr/bin/env python3
"""
UserPromptSubmit hook: warn once when prompt cache has likely expired.

Claude Code uses the 5-minute prompt cache TTL (ephemeral_5m). When the user
has been idle >5 minutes since Claude last responded, the cached context
will expire on this turn — costing full re-creation tokens. This hook fires
before the prompt is sent and blocks once per idle gap so the user can
compact before the expensive re-creation happens.

State: ~/.claude-memory/cache-warn/<session_id>.json (written by cache-warn-stop.py)
  - last_stop_time: ISO timestamp of last Claude response
  - warned_gaps: list of gap buckets already warned — prevents double-warn

Gap bucket = floor(last_stop_time_unix / 60) — stable per idle gap even if
the hook fires multiple times after the same idle period.
"""

from __future__ import annotations

import json
import math
import sys
from datetime import datetime, timezone
from pathlib import Path

CACHE_TTL_SECONDS = 300       # Claude Code uses 5-minute prompt cache TTL (ephemeral_5m, changed ~Apr 3 2026)
WARN_THRESHOLD_SECONDS = 300  # Warn at exactly 5 min (TTL expiry)
CACHE_WARN_DIR = Path.home() / ".claude-memory" / "cache-warn"


def _gap_bucket(last_stop_time_iso: str) -> str:
    """Stable ID for this idle gap — floor of last_stop_time in whole minutes."""
    dt = datetime.fromisoformat(last_stop_time_iso.replace("Z", "+00:00"))
    return str(math.floor(dt.timestamp() / 60))


def _format_tokens(n: int) -> str:
    if n >= 1000:
        return f"{n / 1000:.0f}k"
    return str(n)


def _safe_state_path(cache_dir: Path, prefix: str, session_id: str) -> Path | None:
    """Return resolved path only if it stays within cache_dir; else None."""
    try:
        candidate = (cache_dir / f"{prefix}{session_id}.json").resolve()
        candidate.relative_to(cache_dir.resolve())
        return candidate
    except (ValueError, OSError, RuntimeError):
        return None


def check_resume_warn(session_id: str) -> str | None:
    """If a resume-pending flag exists for this session, consume it and return a warning."""
    flag_path = _safe_state_path(CACHE_WARN_DIR, "resume-pending-", session_id)
    if flag_path is None or not flag_path.exists():
        return None

    try:
        flag = json.loads(flag_path.read_text())
        cached_tokens = flag.get("cached_tokens", 0)
    except (json.JSONDecodeError, OSError):
        cached_tokens = 0
    finally:
        try:
            flag_path.unlink()
        except OSError:
            pass

    token_str = f"~{_format_tokens(cached_tokens)} cached tokens" if cached_tokens else "cached context"
    return (
        f"Resumed session: this turn will re-process {token_str} from scratch "
        f"(5-minute cache TTL likely expired while session was inactive).\n"
        f"Run /compact to reduce re-creation cost, then re-send — or re-send now to proceed."
    )


def main() -> None:
    try:
        _main()
    except Exception:
        # Never hard-block a UserPromptSubmit hook — always let the prompt through
        print(json.dumps({"continue": True}))


def _main() -> None:
    try:
        hook_input = json.load(sys.stdin)
    except (json.JSONDecodeError, EOFError):
        hook_input = {}

    session_id = hook_input.get("session_id", "")
    if not session_id:
        print(json.dumps({"continue": True}))
        return

    # Skip cache warning for system-generated messages — these are not human prompts
    # and blocking them breaks background task result delivery.
    _prompt = hook_input.get("prompt", "").lstrip()
    _SYSTEM_PREFIXES = (
        "<task-notification>",
        "<local-command-caveat>",
        "<command-name>",
        "<command-message>",
    )
    if any(_prompt.startswith(p) for p in _SYSTEM_PREFIXES):
        print(json.dumps({"continue": True}))
        return

    # Check resume warning first — takes priority over idle-gap warning
    resume_warning = check_resume_warn(session_id)
    if resume_warning:
        # A

Install

What is this skill?

Recommended Skills

Journey fit

Is Get Token Insights safe to install?

SKILL.md

This week for builders

Install

What is this skill?

Recommended Skills

Journey fit

Is Get Token Insights safe to install?

SKILL.md