Judgment Day

Name: Judgment Day
Author: fearovex

fearovex/claude-config·Apache-2.0

Run two parallel blind judge sub-agents on the same implementation, merge findings, fix issues, and re-judge until both pass or escalate after two iterations.

Overview

Judgment Day is an agent skill most often used in Ship (also Build) that runs parallel blind dual-judge reviews, synthesizes findings, fixes issues, and re-judges until both pass or escalates after two iterations.

Install

npx skills add https://github.com/fearovex/claude-config --skill judgment-day

What is this skill?

Launches two independent blind judge sub-agents in parallel on the same review target
Synthesizes conflicting findings, applies fixes, and re-judges with a two-iteration cap before escalation
Resolves applicable project skills via Skill Resolver Protocol before any judge runs
Explicit triggers: judgment day, dual review, review adversarial, and Spanish equivalents
Procedural format v1.1 under Apache-2.0 for repeatable high-confidence review
Maximum 2 review-fix-rejudge iterations before escalation
Metadata version 1.1

Compatible agents: Claude Code, Cursor, Codex, Windsurf

Adoption & trust: 1 installs on skills.sh; 1 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

What problem does it solve?

A single reviewer—or you rushing a merge—can miss edge cases, architecture gaps, or correlated blind spots right before production.

Who is it for?

Solo builders shipping sizable implementations who want structured dual review with fix-and-rejudge loops before merging to main.

Skip if: Trivial one-line edits, tasks where you have not defined review scope, or situations where you cannot run parallel sub-agents or apply fixes in the repo.

When should I use this skill?

User says judgment day, judgment-day, review adversarial, dual review, doble review, juzgar, or que lo juzguen; or after significant implementations before merging when single-reviewer blind spots are costly.

What do I get? / Deliverables

You get a synthesized pass from two independent judges after applied fixes, or a clear escalation when two review iterations still fail—so merge decisions rest on adversarial consensus.

Synthesized dual-judge findings
Applied fixes from failed review rounds
Pass from both judges or escalation after iteration cap

Recommended Skills

Improve Codebase Architecturemattpocock/skills

Improve Codebase Architecture is an agent skill that teaches how to deepen a cluster of shallow modules without breaking…226k installs·121k stars

Zoom Outmattpocock/skills

Lightweight meta-prompt skill that tells the agent to zoom out and deliver a domain-aligned overview of modules and call…181k installs·121k stars

Caveman Reviewjuliusbrussee/caveman

Formats code review as single actionable lines: location, problem, fix, with minimal noise.139k installs·70k stars

Requesting Code Reviewobra/superpowers

Requesting Code Review is an agent skill from the Superpowers collection that gives solo and indie builders a copy-ready…119k installs·221k stars

Receiving Code Reviewobra/superpowers

Superpowers methodology for agents receiving code review: prioritize technical correctness over social comfort, verify e…96.2k installs·221k stars

Request Refactor Planmattpocock/skills

request-refactor-plan is a structured agent workflow for solo and small-team maintainers who want refactors filed as act…30.5k installs·121k stars

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Judgment Day is shelved under Ship because its primary trigger is pre-merge, high-stakes review when a wrong blind spot is more expensive than extra review rounds. Review subphase matches adversarial dual-judge synthesis and fix loops aimed at merge readiness rather than initial implementation or deployment.

Also useful

BuildBackend, data & payments

Also useful

ShipTesting & QA

Where it fits

Example use

BuildBackend, data & payments

After finishing a large API or agent feature, run Judgment Day on the touched modules before opening the merge PR.

Example use

ShipCode review

Invoke when the user says que lo juzguen to force parallel judges on the PR diff scope.

Example use

ShipTesting & QA

Use findings from dual judges to harden tests or edge-case handling discovered only under adversarial review.

Example use

ValidatePrototype & spike

Stress-test a prototype architecture document or spike implementation before committing to full build.

How it compares

Use instead of a single generic code-review chat when you need blind parallel judges and an enforced re-review loop before merge.

Common Questions / FAQ

Who is judgment-day for?

Judgment Day is for indie developers and small teams using agentic IDEs who want adversarial, dual-blind review before merging significant code or architecture changes.

When should I use judgment-day?

Use it in Ship/review when you say judgment day or dual review; after large Build implementations; before merge when bug cost exceeds review time; whenever one reviewer might miss edge cases.

Is judgment-day safe to install?

It orchestrates sub-agents and may modify code during fix loops—review the Security Audits panel on this page and run it on branches with backups or PRs, not directly on untested production.

SKILL.md

READMESKILL.md - Judgment Day

**Triggers**: "judgment day", "judgment-day", "review adversarial", "dual review", "doble review", "juzgar", "que lo juzguen".

## When to Use

- User explicitly asks for "judgment day", "judgment-day", or equivalent trigger phrases
- After significant implementations before merging
- When high-confidence review of code, features, or architecture is needed
- When a single reviewer might miss edge cases or have blind spots
- When the cost of a production bug is higher than the cost of two review rounds

## Critical Patterns

### Pattern 0: Skill Resolution (BEFORE launching judges)

Follow the **Skill Resolver Protocol** (`_shared/skill-resolver.md`) before launching ANY sub-agent:

1. Obtain the skill registry: search engram (`mem_search(query: "skill-registry", project: "{project}")`) → fallback to `.agent-config/skill-registry.md` from the project root → skip if none
2. Identify the target files/scope — what code will the judges review?
3. Match relevant skills from the registry's **Compact Rules** by:
   - **Code context**: file extensions/paths of the target (e.g., `.go` → go-testing; `.tsx` → react-19, typescript)
   - **Task context**: "review code" → framework/language skills; "create PR" → branch-pr skill
4. Build a `## Project Standards (auto-resolved)` block with the matching compact rules
5. Inject this block into BOTH Judge prompts AND the Fix Agent prompt (identical for all)

This ensures judges review against project-specific standards, not just generic best practices.

**If no registry exists**: warn the user ("No skill registry found — judges will review without project-specific standards. Run `skill-registry` to fix this.") and proceed with generic review only.

### Pattern 1: Parallel Blind Review

- Launch **TWO** sub-agents via `delegate` (async, parallel — never sequential)
- Each agent receives the **same target** but works **independently**
- **Neither agent knows about the other** — no cross-contamination
- Both use identical review criteria but may find different issues
- NEVER do the review yourself as the orchestrator — your job is coordination only

### Pattern 2: Verdict Synthesis

The **orchestrator** (NOT a sub-agent) compares results after both `delegation_read` calls return:

```
Confirmed   → found by BOTH agents          → high confidence, fix immediately
Suspect A   → found ONLY by Judge A         → needs triage
Suspect B   → found ONLY by Judge B         → needs triage
Contradiction → agents DISAGREE on the same thing → flag for manual decision
```

Present findings as a structured verdict table (see Output Format).

### Pattern 3: Fix and Re-judge

1. If **confirmed issues** exist → delegate a **Fix Agent** (separate delegation)
2. After Fix Agent completes → re-launch **both judges in parallel** (same blind protocol, fresh delegates)
3. **Max 2 fix iterations.** If still failing → JUDGMENT: ESCALATED — report to user with full history
4. If both judges return clean → JUDGMENT: APPROVED ✅

---

## Decision Tree

```
User asks for "judgment day"
│
├── Target is specific files/feature/component?
│   ├── YES → continue
│   └── NO → ask user to specify scope before proceeding
│
▼
Resolve skills (Pattern 0): read registry → match by code + task context → build Project Standards block
▼
Launch Judge A + Judge B in parallel (delegate, async) — with Project Standards injected
▼
Wait for both to complete (delegation_read both)
▼
Synthesize verdict
│
├── No issues found?
│   └── JUDGMENT: APPROVED ✅ (stop here)
│
├

What is this skill?

Launches two independent blind judge sub-agents in parallel on the same review target

Synthesizes conflicting findings, applies fixes, and re-judges with a two-iteration cap before escalation

Resolves applicable project skills via Skill Resolver Protocol before any judge runs

Explicit triggers: judgment day, dual review, review adversarial, and Spanish equivalents

Procedural format v1.1 under Apache-2.0 for repeatable high-confidence review

Maximum 2 review-fix-rejudge iterations before escalation

Metadata version 1.1

Compatible agents: Claude Code, Cursor, Codex, Windsurf

Adoption & trust: 1 installs on skills.sh; 1 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

What do I get? / Deliverables

You get a synthesized pass from two independent judges after applied fixes, or a clear escalation when two review iterations still fail—so merge decisions rest on adversarial consensus.

Synthesized dual-judge findings

Applied fixes from failed review rounds

Pass from both judges or escalation after iteration cap

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Also useful

BuildBackend, data & payments

Also useful

ShipTesting & QA

Where it fits

Example use

BuildBackend, data & payments

After finishing a large API or agent feature, run Judgment Day on the touched modules before opening the merge PR.

Example use

ShipCode review

Invoke when the user says que lo juzguen to force parallel judges on the PR diff scope.

Example use

ShipTesting & QA

Use findings from dual judges to harden tests or edge-case handling discovered only under adversarial review.

Example use

ValidatePrototype & spike

Stress-test a prototype architecture document or spike implementation before committing to full build.

SKILL.md

READMESKILL.md - Judgment Day

**Triggers**: "judgment day", "judgment-day", "review adversarial", "dual review", "doble review", "juzgar", "que lo juzguen".

## When to Use

- User explicitly asks for "judgment day", "judgment-day", or equivalent trigger phrases
- After significant implementations before merging
- When high-confidence review of code, features, or architecture is needed
- When a single reviewer might miss edge cases or have blind spots
- When the cost of a production bug is higher than the cost of two review rounds

## Critical Patterns

### Pattern 0: Skill Resolution (BEFORE launching judges)

Follow the **Skill Resolver Protocol** (`_shared/skill-resolver.md`) before launching ANY sub-agent:

1. Obtain the skill registry: search engram (`mem_search(query: "skill-registry", project: "{project}")`) → fallback to `.agent-config/skill-registry.md` from the project root → skip if none
2. Identify the target files/scope — what code will the judges review?
3. Match relevant skills from the registry's **Compact Rules** by:
   - **Code context**: file extensions/paths of the target (e.g., `.go` → go-testing; `.tsx` → react-19, typescript)
   - **Task context**: "review code" → framework/language skills; "create PR" → branch-pr skill
4. Build a `## Project Standards (auto-resolved)` block with the matching compact rules
5. Inject this block into BOTH Judge prompts AND the Fix Agent prompt (identical for all)

This ensures judges review against project-specific standards, not just generic best practices.

**If no registry exists**: warn the user ("No skill registry found — judges will review without project-specific standards. Run `skill-registry` to fix this.") and proceed with generic review only.

### Pattern 1: Parallel Blind Review

- Launch **TWO** sub-agents via `delegate` (async, parallel — never sequential)
- Each agent receives the **same target** but works **independently**
- **Neither agent knows about the other** — no cross-contamination
- Both use identical review criteria but may find different issues
- NEVER do the review yourself as the orchestrator — your job is coordination only

### Pattern 2: Verdict Synthesis

The **orchestrator** (NOT a sub-agent) compares results after both `delegation_read` calls return:

```
Confirmed   → found by BOTH agents          → high confidence, fix immediately
Suspect A   → found ONLY by Judge A         → needs triage
Suspect B   → found ONLY by Judge B         → needs triage
Contradiction → agents DISAGREE on the same thing → flag for manual decision
```

Present findings as a structured verdict table (see Output Format).

### Pattern 3: Fix and Re-judge

1. If **confirmed issues** exist → delegate a **Fix Agent** (separate delegation)
2. After Fix Agent completes → re-launch **both judges in parallel** (same blind protocol, fresh delegates)
3. **Max 2 fix iterations.** If still failing → JUDGMENT: ESCALATED — report to user with full history
4. If both judges return clean → JUDGMENT: APPROVED ✅

---

## Decision Tree

```
User asks for "judgment day"
│
├── Target is specific files/feature/component?
│   ├── YES → continue
│   └── NO → ask user to specify scope before proceeding
│
▼
Resolve skills (Pattern 0): read registry → match by code + task context → build Project Standards block
▼
Launch Judge A + Judge B in parallel (delegate, async) — with Project Standards injected
▼
Wait for both to complete (delegation_read both)
▼
Synthesize verdict
│
├── No issues found?
│   └── JUDGMENT: APPROVED ✅ (stop here)
│
├

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is judgment-day for?

When should I use judgment-day?

Is judgment-day safe to install?

SKILL.md

This week for builders

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is judgment-day for?

When should I use judgment-day?

Is judgment-day safe to install?

SKILL.md