Codex Review

Name: Codex Review
Author: hyperb1iss

hyperb1iss/hyperskills

Run structured, multi-pass AI code reviews on branch diffs before merge using copy-paste prompts for Codex, Claude, or any reviewer CLI.

Overview

Codex Review Prompt Templates is an agent skill most often used in Ship (also Build, Operate) that supplies model-agnostic diff-review prompts for Codex, Claude, and other reviewer CLIs.

Install

npx skills add https://github.com/hyperb1iss/hyperskills --skill codex-review

What is this skill?

Model-agnostic prompt pack: General Review plus Security Deep-Dive and other focused passes
General pass prioritizes correctness, security (OWASP Top 10:2025), performance, and maintainability with explicit skip
Each finding requires file/line citation, concrete risk, specific fix, and 0.0–1.0 confidence
Works via `codex exec`, `claude -p`, or piping `git diff main...HEAD` into either reviewer
Documents when native `codex review` structured mode makes custom prompts unnecessary
4 review dimensions in General Review (correctness, security, performance, maintainability)
Confidence rated 0.0–1.0 per finding

Compatible agents: Claude Code, Codex, Cursor, any compatible agent

Adoption & trust: 563 installs on skills.sh; 13 GitHub stars; 3/3 security scanners passed (skills.sh audits).

What problem does it solve?

You are about to merge a branch but your agent reviews are vague, skip security, or argue about style instead of real regressions.

Who is it for?

Solo builders shipping from feature branches who want stacked general + security review passes without custom prompt engineering each time.

Skip if: Teams that only need `codex review`’s built-in structured output and never customize reviewer instructions.

When should I use this skill?

You need thorough branch diff reviews via Codex, Claude `-p`, or piped `git diff` and want dedicated General or Security Deep-Dive prompt text.

What do I get? / Deliverables

You run repeatable diff reviews with cited findings, confidence scores, and a clear correct/incorrect verdict before merge or hotfix deploy.

Executed review with cited findings and confidence scores
Overall patch correct or incorrect verdict

Recommended Skills

Improve Codebase Architecturemattpocock/skills

Improve Codebase Architecture is an agent skill that teaches how to deepen a cluster of shallow modules without breaking…226k installs·121k stars

Zoom Outmattpocock/skills

Lightweight meta-prompt skill that tells the agent to zoom out and deliver a domain-aligned overview of modules and call…181k installs·121k stars

Caveman Reviewjuliusbrussee/caveman

Formats code review as single actionable lines: location, problem, fix, with minimal noise.139k installs·70k stars

Requesting Code Reviewobra/superpowers

Requesting Code Review is an agent skill from the Superpowers collection that gives solo and indie builders a copy-ready…119k installs·221k stars

Receiving Code Reviewobra/superpowers

Superpowers methodology for agents receiving code review: prioritize technical correctness over social comfort, verify e…96.2k installs·221k stars

Request Refactor Planmattpocock/skills

request-refactor-plan is a structured agent workflow for solo and small-team maintainers who want refactors filed as act…30.5k installs·121k stars

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Ship/review is the canonical shelf because the skill is prompt templates for reviewing diffs between main and HEAD before release. Review subphase matches diff-based correctness, security, performance, and maintainability passes—not one-off lint fixes.

Also useful

BuildBackend, data & payments

Also useful

OperateIteration & experiments

Where it fits

Example use

ShipCode review

Pipe `git diff main...HEAD` into `codex exec` with the General Review template before merging a payments refactor.

Example use

ShipSecurity

Run the Security Deep-Dive prompt after the general pass when the diff touches auth or user input.

Example use

BuildIntegrations & version control

Review a third-party webhook handler PR with maintainability and race-condition checks called out in the template.

Example use

OperateIteration & experiments

Validate a production hotfix branch with the verdict-style general prompt before emergency release.

How it compares

Use instead of ad-hoc “review my PR” chat prompts that lack severity dimensions and verdict structure.

Common Questions / FAQ

Who is codex-review for?

Indie and solo developers using Claude Code, Codex, or Cursor who review git diffs via CLI and want consistent, thorough prompts across agents.

When should I use codex-review?

In Ship/review before merge; during Build when validating a large integration PR; in Operate when reviewing hotfix diffs—especially as a first General pass then a Security Deep-Dive.

Is codex-review safe to install?

It is prompt text only—no network or shell by itself—but piping diffs may expose code to your reviewer provider; review the Security Audits panel on this page before trusting the package.

SKILL.md

READMESKILL.md - Codex Review

# Codex Review Prompt Templates

Ready-to-use prompts for each review pass. These are model-agnostic — they work with any reviewer CLI.

## Usage

Pass prompts as the final argument to the reviewer CLI:

```bash
# From Claude (Codex reviews)
codex exec "PROMPT_TEXT_HERE"

# From Codex (Claude reviews)
claude -p "PROMPT_TEXT_HERE"

# Or pipe a diff into either
git diff main...HEAD | codex exec "PROMPT_TEXT_HERE"
git diff main...HEAD | claude -p "PROMPT_TEXT_HERE"
```

For Codex's structured `codex review` command, prompts aren't needed — it has its own review format.

## General Review

Best as the first pass. Broad coverage across all dimensions.

```
Review the changes between main and HEAD with extreme thoroughness. Prioritize:
1. Correctness — logic errors, edge cases, null handling, race conditions
2. Security — injection, auth gaps, secrets exposure, OWASP Top 10:2025
3. Performance — algorithmic complexity, N+1 queries, memory leaks
4. Maintainability — coupling, abstraction leaks, API consistency

For each finding:
- Cite exact file and line range
- Explain the bug/risk concretely (not "this could be improved")
- Suggest a specific fix
- Rate confidence 0.0-1.0

Skip: formatting, naming style, minor documentation gaps.
Overall verdict: "patch is correct" or "patch is incorrect" with justification.
```

## Security Deep-Dive

```
You are a senior application security engineer reviewing a code change.

Analyze the diff between the current branch and main for:
1. Injection vulnerabilities (SQL, XSS, command, LDAP, template)
2. Authentication & authorization flaws
3. Secrets / credential exposure (hardcoded keys, tokens in logs)
4. Insecure deserialization or data handling
5. SSRF, path traversal, open redirects
6. Cryptographic misuse (weak algorithms, improper randomness)
7. Dependency risks (known CVEs, typosquatting)
8. Error handling that leaks internal state

For each finding, provide:
- Severity: critical / high / medium / low
- Attack vector description
- Affected file and line range
- Concrete remediation with code example
- Confidence: 0.0-1.0

If no security issues found, state that explicitly with your confidence level.
Do NOT flag style or non-security concerns.
```

## Architecture Review

```
You are a principal software architect reviewing a code change for design quality.

Evaluate the diff between current branch and main:
1. Does this change respect existing architectural boundaries?
2. Are abstractions at the right level — not too leaky, not over-engineered?
3. Does coupling increase or decrease? Quantify if possible.
4. Is the API surface consistent with existing patterns in the codebase?
5. Does this change make the system harder to test, extend, or maintain?
6. Are there backwards compatibility concerns?
7. Would a different design achieve the same goal more cleanly?

For each concern:
- Reference specific files and patterns
- Explain the architectural principle being violated
- Suggest a concrete alternative approach
- Rate impact: blocks-merge / should-fix / nice-to-have
- Confidence: 0.0-1.0

Skip: implementation details, performance micro-optimizations, style.
```

## Performance Review

```
You are a performance engineer reviewing a code change for efficiency.

Analyze the diff between current branch and main:
1. Algorithmic complexity — is there O(n^2) where O(n) or O(n log n) suffices?
2. Database queries — N+1 patterns, missing indexes, unnecessary JOINs
3. Memory — leaks, unnecessary copies, unbounded growth
4. I/O — blocking calls on hot paths, missing async/streaming
5. Caching — missed opportunities, cache invalidation bugs
6. Bundle/binary size — unnecessary dependencies, tree-shaking failures
7. Concurrency — lock contention, thread-safety, deadlock potential

For each finding:
- Estimated impact magnitude (minor / moderate / severe)
- Affected hot path or user-facing scenario
- Concrete optimization with before/after code
- Whether a benchmark is warranted
- Confidence: 0.0-1.0

Skip: pr

What is this skill?

Model-agnostic prompt pack: General Review plus Security Deep-Dive and other focused passes

General pass prioritizes correctness, security (OWASP Top 10:2025), performance, and maintainability with explicit skip

Each finding requires file/line citation, concrete risk, specific fix, and 0.0–1.0 confidence

Works via `codex exec`, `claude -p`, or piping `git diff main...HEAD` into either reviewer

Documents when native `codex review` structured mode makes custom prompts unnecessary

4 review dimensions in General Review (correctness, security, performance, maintainability)

Confidence rated 0.0–1.0 per finding

Compatible agents: Claude Code, Codex, Cursor, any compatible agent

Adoption & trust: 563 installs on skills.sh; 13 GitHub stars; 3/3 security scanners passed (skills.sh audits).

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Also useful

BuildBackend, data & payments

Also useful

OperateIteration & experiments

Where it fits

Example use

ShipCode review

Pipe `git diff main...HEAD` into `codex exec` with the General Review template before merging a payments refactor.

Example use

ShipSecurity

Run the Security Deep-Dive prompt after the general pass when the diff touches auth or user input.

Example use

BuildIntegrations & version control

Review a third-party webhook handler PR with maintainability and race-condition checks called out in the template.

Example use

OperateIteration & experiments

Validate a production hotfix branch with the verdict-style general prompt before emergency release.

SKILL.md

READMESKILL.md - Codex Review

# Codex Review Prompt Templates

Ready-to-use prompts for each review pass. These are model-agnostic — they work with any reviewer CLI.

## Usage

Pass prompts as the final argument to the reviewer CLI:

```bash
# From Claude (Codex reviews)
codex exec "PROMPT_TEXT_HERE"

# From Codex (Claude reviews)
claude -p "PROMPT_TEXT_HERE"

# Or pipe a diff into either
git diff main...HEAD | codex exec "PROMPT_TEXT_HERE"
git diff main...HEAD | claude -p "PROMPT_TEXT_HERE"
```

For Codex's structured `codex review` command, prompts aren't needed — it has its own review format.

## General Review

Best as the first pass. Broad coverage across all dimensions.

```
Review the changes between main and HEAD with extreme thoroughness. Prioritize:
1. Correctness — logic errors, edge cases, null handling, race conditions
2. Security — injection, auth gaps, secrets exposure, OWASP Top 10:2025
3. Performance — algorithmic complexity, N+1 queries, memory leaks
4. Maintainability — coupling, abstraction leaks, API consistency

For each finding:
- Cite exact file and line range
- Explain the bug/risk concretely (not "this could be improved")
- Suggest a specific fix
- Rate confidence 0.0-1.0

Skip: formatting, naming style, minor documentation gaps.
Overall verdict: "patch is correct" or "patch is incorrect" with justification.
```

## Security Deep-Dive

```
You are a senior application security engineer reviewing a code change.

Analyze the diff between the current branch and main for:
1. Injection vulnerabilities (SQL, XSS, command, LDAP, template)
2. Authentication & authorization flaws
3. Secrets / credential exposure (hardcoded keys, tokens in logs)
4. Insecure deserialization or data handling
5. SSRF, path traversal, open redirects
6. Cryptographic misuse (weak algorithms, improper randomness)
7. Dependency risks (known CVEs, typosquatting)
8. Error handling that leaks internal state

For each finding, provide:
- Severity: critical / high / medium / low
- Attack vector description
- Affected file and line range
- Concrete remediation with code example
- Confidence: 0.0-1.0

If no security issues found, state that explicitly with your confidence level.
Do NOT flag style or non-security concerns.
```

## Architecture Review

```
You are a principal software architect reviewing a code change for design quality.

Evaluate the diff between current branch and main:
1. Does this change respect existing architectural boundaries?
2. Are abstractions at the right level — not too leaky, not over-engineered?
3. Does coupling increase or decrease? Quantify if possible.
4. Is the API surface consistent with existing patterns in the codebase?
5. Does this change make the system harder to test, extend, or maintain?
6. Are there backwards compatibility concerns?
7. Would a different design achieve the same goal more cleanly?

For each concern:
- Reference specific files and patterns
- Explain the architectural principle being violated
- Suggest a concrete alternative approach
- Rate impact: blocks-merge / should-fix / nice-to-have
- Confidence: 0.0-1.0

Skip: implementation details, performance micro-optimizations, style.
```

## Performance Review

```
You are a performance engineer reviewing a code change for efficiency.

Analyze the diff between current branch and main:
1. Algorithmic complexity — is there O(n^2) where O(n) or O(n log n) suffices?
2. Database queries — N+1 patterns, missing indexes, unnecessary JOINs
3. Memory — leaks, unnecessary copies, unbounded growth
4. I/O — blocking calls on hot paths, missing async/streaming
5. Caching — missed opportunities, cache invalidation bugs
6. Bundle/binary size — unnecessary dependencies, tree-shaking failures
7. Concurrency — lock contention, thread-safety, deadlock potential

For each finding:
- Estimated impact magnitude (minor / moderate / severe)
- Affected hot path or user-facing scenario
- Concrete optimization with before/after code
- Whether a benchmark is warranted
- Confidence: 0.0-1.0

Skip: pr

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is codex-review for?

When should I use codex-review?

Is codex-review safe to install?

SKILL.md

This week for builders

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is codex-review for?

When should I use codex-review?

Is codex-review safe to install?

SKILL.md