
Root Cause Tracing
Find root cause before proposing fixes when tests fail, builds break, or behavior is unexpected.
Install
npx skills add https://github.com/obra/superpowers-skills --skill root-cause-tracingWhat is this skill?
- Four-phase process: investigate root cause, analyze patterns, hypothesize, implement verified fix.
- Iron law: no fixes without completing Phase 1 investigation first.
- Multi-component instrumentation pattern for CI, build, and signing pipelines.
Adoption & trust: 76 installs on skills.sh; 692 GitHub stars; 3/3 security scanners passed (skills.sh audits).
Recommended Skills
Azure Diagnosticsmicrosoft/azure-skills
Diagnosemattpocock/skills
Systematic Debuggingobra/superpowers
Safe Debuglllllllama/rigorpilot-skills
Mastramastra-ai/skills
Insforge Debuginsforge/agent-skills
Journey fit
Primary fit
Systematic debugging is operate-phase work—diagnosing failures in running systems and CI. Errors subphase fits because the skill forbids symptom fixes until evidence from reproduction, logs, and multi-component tracing is gathered.
Common Questions / FAQ
Is Root Cause Tracing safe to install?
skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Root Cause Tracing
# Root Cause Tracing ## Overview Bugs often manifest deep in the call stack (git init in wrong directory, file created in wrong location, database opened with wrong path). Your instinct is to fix where the error appears, but that's treating a symptom. **Core principle:** Trace backward through the call chain until you find the original trigger, then fix at the source. ## When to Use ```dot digraph when_to_use { "Bug appears deep in stack?" [shape=diamond]; "Can trace backwards?" [shape=diamond]; "Fix at symptom point" [shape=box]; "Trace to original trigger" [shape=box]; "BETTER: Also add defense-in-depth" [shape=box]; "Bug appears deep in stack?" -> "Can trace backwards?" [label="yes"]; "Can trace backwards?" -> "Trace to original trigger" [label="yes"]; "Can trace backwards?" -> "Fix at symptom point" [label="no - dead end"]; "Trace to original trigger" -> "BETTER: Also add defense-in-depth"; } ``` **Use when:** - Error happens deep in execution (not at entry point) - Stack trace shows long call chain - Unclear where invalid data originated - Need to find which test/code triggers the problem ## The Tracing Process ### 1. Observe the Symptom ``` Error: git init failed in /Users/jesse/project/packages/core ``` ### 2. Find Immediate Cause **What code directly causes this?** ```typescript await execFileAsync('git', ['init'], { cwd: projectDir }); ``` ### 3. Ask: What Called This? ```typescript WorktreeManager.createSessionWorktree(projectDir, sessionId) → called by Session.initializeWorkspace() → called by Session.create() → called by test at Project.create() ``` ### 4. Keep Tracing Up **What value was passed?** - `projectDir = ''` (empty string!) - Empty string as `cwd` resolves to `process.cwd()` - That's the source code directory! ### 5. Find Original Trigger **Where did empty string come from?** ```typescript const context = setupCoreTest(); // Returns { tempDir: '' } Project.create('name', context.tempDir); // Accessed before beforeEach! ``` ## Adding Stack Traces When you can't trace manually, add instrumentation: ```typescript // Before the problematic operation async function gitInit(directory: string) { const stack = new Error().stack; console.error('DEBUG git init:', { directory, cwd: process.cwd(), nodeEnv: process.env.NODE_ENV, stack, }); await execFileAsync('git', ['init'], { cwd: directory }); } ``` **Critical:** Use `console.error()` in tests (not logger - may not show) **Run and capture:** ```bash npm test 2>&1 | grep 'DEBUG git init' ``` **Analyze stack traces:** - Look for test file names - Find the line number triggering the call - Identify the pattern (same test? same parameter?) ## Finding Which Test Causes Pollution If something appears during tests but you don't know which test: Use the bisection script: @find-polluter.sh ```bash ./find-polluter.sh '.git' 'src/**/*.test.ts' ``` Runs tests one-by-one, stops at first polluter. See script for usage. ## Real Example: Empty projectDir **Symptom:** `.git` created in `packages/core/` (source code) **Trace chain:** 1. `git init` runs in `process.cwd()` ← empty cwd parameter 2. WorktreeManager called with empty projectDir 3. Session.create() passed empty string 4. Test accessed `context.tempDir` before beforeEach 5. setupCoreTest() returns `{ tempDir: '' }` initially **Root cause:** Top-level variable initialization accessing empty value **Fix:** Made tempDir a getter that throws if accessed before beforeEach **Also added defense-in-depth:** - Layer 1: Project.create() validates directory - Layer 2: WorkspaceManager validates not empty - Layer 3: NODE_ENV guard refuses git init outside tmpdir - Layer 4: Stack trac