Safe Debug

Name: Safe Debug
Author: lllllllama

lllllllama/rigorpilot-skills

176k installs
512 repo stars
Updated July 26, 2026
lllllllama/rigorpilot-skills

safe-debug is a Claude Code skill for conservative diagnosis and minimal patching of deep learning errors with explicit human approval.

About

A debugging skill for deep learning research. Use it when you have a traceback, training failure, or error symptom and want conservative diagnosis before any code changes.

Conservative diagnosis of tracebacks, CUDA OOM, and shape mismatches
Proposes minimal fixes with explicit human approval before patching
Separates debug fixes from research contributions clearly

Safe Debug by the numbers

175,924 all-time installs (skills.sh)
+25,325 installs in the week ending Jul 28, 2026 (Skillselion tracking)
Ranked #6 of 610 Debugging skills by installs in the Skillselion catalog
Security screen: LOW risk (skills.sh audit)
Data as of Jul 28, 2026 (Skillselion catalog sync)

At a glance

safe-debug capabilities & compatibility

Capabilities: error diagnosis · root cause narrowing · patch planning · evidence collection
Use cases: debugging · code review · research

npx skills add https://github.com/lllllllama/rigorpilot-skills --skill safe-debug

Add your badge

Show developers this skill is listed on Skillselion. Paste this into your README.

[![Listed on Skillselion](https://skillselion.com/badge/skills/lllllllama/rigorpilot-skills/safe-debug.svg)](https://skillselion.com/skills/lllllllama/rigorpilot-skills/safe-debug)

Installs	176k
repo stars	★ 512
Security audit	3 / 3 scanners passed
Last updated	July 26, 2026
Repository	lllllllama/rigorpilot-skills ↗

What it does

Diagnose deep learning training and inference errors with minimal patch suggestions.

Who is it for?

Diagnosing tracebacks,CUDA OOM analysis,NaN loss investigation,Training failure diagnosis

Skip if: Broad refactoring,Speculative adaptation,Automatic exploratory patching,General repository familiarization

When should I use this skill?

The user provides a traceback, terminal error, or concrete training/inference failure symptom and wants diagnosis with minimal patch suggestions.

What you get

DIAGNOSIS.md identifying root cause, PATCH_PLAN.md with minimal suggested fixes, and status.json documenting the issue.

debug_outputs/DIAGNOSIS.md
debug_outputs/PATCH_PLAN.md
debug_outputs/status.json

By the numbers

3 structured debug output files: DIAGNOSIS.md, PATCH_PLAN.md, status.json

Files

SKILL.mdMarkdownGitHub ↗

safe-debug

Use this as the Rigor Debug / Rigor Audit skill. The installed slug remains safe-debug for compatibility.

Use the shared operating principles in ../../references/agent-operating-principles.md; this skill should guide conservative diagnosis without blocking the model from finding the local root cause.

When to apply

The user provides a traceback, terminal error, or concrete training or inference failure symptom.
The user wants diagnosis, root-cause narrowing, and minimal patch suggestions before code is changed.
The user wants a safe debug flow with explicit human approval before mutation.

When not to apply

When the user wants a broad repository walkthrough without an active failure.
When the task is speculative experimentation or code adaptation.
When the user is asking for a large refactor or readability rewrite.

Clear boundaries

Diagnose first.
Do not modify repository code by default.
If a patch is needed, propose the smallest fix and require explicit approval first.
Escalate savepoint or branch creation before medium-risk or high-risk changes.
A debug fix is not automatically a research contribution; if it changes

experiment meaning or comparability, say so explicitly.

Output expectations

debug_outputs/DIAGNOSIS.md
debug_outputs/PATCH_PLAN.md
debug_outputs/status.json

Notes

Use references/debug-policy.md, ../../references/research-rigor-principles.md, and the shared references/research-pitfall-checklist.md.

display_name: Rigor Debug / Rigor Audit
short_description: Rigor Debug / Rigor Audit mode for conservative failure diagnosis before patching.
default_prompt: Diagnose this deep learning research error conservatively. Analyze the traceback or symptom first, explain the likely cause, suggest the smallest safe fix, and do not patch code unless explicitly authorized.

#!/usr/bin/env python3
"""Conservative research debugging without automatic patching."""

from __future__ import annotations

import argparse
import json
from pathlib import Path
from typing import Dict, List


CATEGORY_RULES = [
    ("cuda_oom", ["cuda out of memory", "outofmemoryerror", "oom"]),
    ("checkpoint_mismatch", ["size mismatch", "missing key", "unexpected key", "checkpoint"]),
    ("distributed_issue", ["nccl", "distributed", "ddp", "rank"]),
    ("device_mismatch", ["expected all tensors to be on the same device", "same device"]),
    ("shape_mismatch", ["shape", "dimension", "size mismatch"]),
    ("loss_nan", ["loss is nan", "nan", "not converging"]),
    ("file_missing", ["filenotfounderror", "no such file", "cannot find path"]),
]


def classify_error(text: str) -> str:
    lower = text.lower()
    for category, signals in CATEGORY_RULES:
        if any(signal in lower for signal in signals):
            return category
    if "traceback" in lower or "runtimeerror" in lower or "valueerror" in lower:
        return "runtime_failure"
    return "unknown"


def suggested_actions(category: str) -> List[str]:
    mapping = {
        "cuda_oom": [
            "Check effective batch size, input resolution, and mixed-precision settings before patching model code.",
            "Prefer a configuration-only reduction before touching architecture.",
        ],
        "checkpoint_mismatch": [
            "Verify checkpoint source, model variant, and load strictness assumptions.",
            "Confirm whether the mismatch is expected before introducing compatibility code.",
        ],
        "distributed_issue": [
            "Inspect launch command, world size, and environment variables before patching training logic.",
            "Reproduce with a single process when possible to narrow the issue safely.",
        ],
        "device_mismatch": [
            "Trace where tensors and modules move across CPU and GPU boundaries.",
            "Prefer a minimal device-placement fix over a broad refactor.",
        ],
        "shape_mismatch": [
            "Log tensor shapes at the failing boundary without changing unrelated code paths.",
            "Check config, dataset, and head dimensions before editing model internals.",
        ],
        "loss_nan": [
            "Inspect data ranges, loss inputs, mixed precision, and learning rate before changing architecture.",
            "Use a shorter controlled run to confirm whether NaNs appear at startup or later.",
        ],
        "file_missing": [
            "Validate dataset, checkpoint, and config paths before editing code.",
            "Prefer a path fix or documented setup correction over logic changes.",
        ],
        "runtime_failure": [
            "Trace the failing file and symbol before proposing any patch.",
            "Confirm whether the failure is environment-related, config-related, or code-related.",
        ],
        "unknown": [
            "Collect the full command, stack trace, and recent code change before patching anything.",
            "Narrow the failure surface with the smallest reproducible example available.",
        ],
    }
    return mapping[category]


def analyze_error(text: str) -> Dict[str, object]:
    category = classify_error(text)
    needs_savepoint = category in {"checkpoint_mismatch", "distributed_issue", "shape_mismatch", "loss_nan"}
    return {
        "category": category,
        "summary": f"Detected debug category: `{category}`.",
        "needs_explicit_patch_approval": True,
        "needs_savepoint_before_patch": needs_savepoint,
        "actions": suggested_actions(category),
        "error_excerpt": "\n".join(text.splitlines()[:12]) or text,
    }


def write_outputs(output_dir: Path, data: Dict[str, object]) -> None:
    output_dir.mkdir(parents=True, exist_ok=True)

    diagnosis = [
        "# Debug Diagnosis",
        "",
        f"- Category: `{data['category']}`",
        f"- Patch authorized: `False`",
        f"- Savepoint recommended before patching: `{data['needs_savepoint_before_patch']}`",
        "",
        "## Error excerpt",
        "",
        "```text",
        data["error_excerpt"],
        "```",
        "",
        "## Conservative analysis",
        "",
        data["summary"],
        "",
    ]
    (output_dir / "DIAGNOSIS.md").write_text("\n".join(diagnosis), encoding="utf-8")

    patch_plan = [
        "# Patch Plan",
        "",
        "- Do not modify repository code until the researcher approves the proposed fix.",
        "- Prefer the smallest configuration or path fix before touching core model logic.",
        f"- Savepoint recommended: `{data['needs_savepoint_before_patch']}`",
        "",
        "## Suggested actions",
        "",
        *[f"- {item}" for item in data["actions"]],
        "",
    ]
    (output_dir / "PATCH_PLAN.md").write_text("\n".join(patch_plan), encoding="utf-8")

    status = {
        "schema_version": "1.0",
        "status": "diagnosed",
        "category": data["category"],
        "patch_authorized": False,
        "needs_explicit_patch_approval": data["needs_explicit_patch_approval"],
        "needs_savepoint_before_patch": data["needs_savepoint_before_patch"],
        "suggested_actions": data["actions"],
        "outputs": {
            "diagnosis": "debug_outputs/DIAGNOSIS.md",
            "patch_plan": "debug_outputs/PATCH_PLAN.md",
            "status": "debug_outputs/status.json",
        },
    }
    (output_dir / "status.json").write_text(json.dumps(status, indent=2, ensure_ascii=False), encoding="utf-8")


def main() -> int:
    parser = argparse.ArgumentParser(description="Conservative deep learning research debugging.")
    parser.add_argument("--error-file", help="Path to a text file containing the error or symptom.")
    parser.add_argument("--error-text", help="Inline error or symptom text.")
    parser.add_argument("--output-dir", default="debug_outputs", help="Directory for debug outputs.")
    parser.add_argument("--json", action="store_true", help="Emit JSON to stdout instead of writing files.")
    args = parser.parse_args()

    if not args.error_file and not args.error_text:
        raise SystemExit("Provide --error-file or --error-text.")

    text = args.error_text or Path(args.error_file).read_text(encoding="utf-8", errors="ignore")
    data = analyze_error(text)
    if args.json:
        print(json.dumps(data, indent=2, ensure_ascii=False))
        return 0

    write_outputs(Path(args.output_dir).resolve(), data)
    print(json.dumps(data, indent=2, ensure_ascii=False))
    return 0


if __name__ == "__main__":
    raise SystemExit(main())

Related skills

Azure DiagnosticsSystematically diagnose and resolve production issues on Microsoft Azure using official Microsoft guidance.472k1.3k

Azure MessagingQuickly diagnose and fix connection, authentication, and message-processing failures when using Azure Event Hubs or Service Bus SDKs.460k1.3k

Use My BrowserWhen their agent task requires access to the live browser session, rendered DOM state, authenticated dashboards, localhost apps, or DevTools-selected elements instea269k72

Diagnosing BugsGet a systematic, step-by-step process that surfaces the real root cause instead of guessing at bugs.222k183k

Systematic DebuggingFollow a repeatable four-phase process that forces root-cause discovery before any code changes.197k260k

Sentry CliCapture errors, upload source maps, manage releases, and query events directly from the terminal or CI without leaving their workflow.111k

Forks & variants (3)

Safe Debug has 3 known copies in the catalog totaling 440 installs. They canonicalize to this original listing.

lllllllama - 401 installs
lllllllama - 29 installs
lllllllama - 10 installs

How it compares

Use safe-debug for read-only failure triage; use explore-code only after explicit authorization to apply exploratory fixes on an isolated branch.

FAQ

Does this modify code automatically?

No. Diagnosis is first; if a patch is needed, it's proposed and requires explicit approval.

Is a debug fix a research contribution?

Not automatically. If it changes experiment meaning or comparability, that's stated explicitly.

Is Safe Debug safe to install?

skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

Debuggingbackenddevops

Safe Debug

About

Safe Debug by the numbers

safe-debug capabilities & compatibility

Add your badge

What it does

Who is it for?

When should I use this skill?

What you get

By the numbers

Files

safe-debug

When to apply

When not to apply

Clear boundaries

Output expectations

Notes

Debug Policy

Default protocol

Required outputs

Forbidden behavior

Related skills

Forks & variants (3)

How it compares

FAQ

Does this modify code automatically?

Is a debug fix a research contribution?

Is Safe Debug safe to install?

About

Safe Debug by the numbers

safe-debug capabilities & compatibility

Add your badge

What it does

Who is it for?

When should I use this skill?

What you get

By the numbers

Files

safe-debug

When to apply

When not to apply

Clear boundaries

Output expectations

Notes

Debug Policy

Default protocol

Required outputs

Forbidden behavior

Related skills

Forks & variants (3)

How it compares

FAQ

Does this modify code automatically?

Is a debug fix a research contribution?

Is Safe Debug safe to install?

This week in AI coding