
Skill Review
Load structured SKILL.md compliance and description-quality rubrics when you are authoring, refactoring, or auditing agent skills before publishing to a catalog.
Overview
Skill-review is an agent skill most often used in Build (also Ship review) that supplies detailed rubrics for SKILL.md structure, assets layout, and description triggers when auditing agent skills.
Install
npx skills add https://github.com/sanyuan0704/code-review-expert --skill skill-reviewWhat is this skill?
- Structure compliance checks: required frontmatter, 500-line SKILL.md hard limit, orphan file detection
- Description quality rubric with keyword-bombing patterns and anti-patterns for weak triggers
- Explicit red flags for deep reference nesting and scripts missing invocation docs
- Designed as Step 2 reference material for a broader code-review-expert workflow
- Separates assets/ (output-only) from references/ (context-loaded) expectations
- Flags SKILL.md approaching limit at >400 lines with hard cap at 500 lines
- Structure section defines Required, Expected, and Red Flags compliance tiers
Adoption & trust: 1 installs on skills.sh; 3.6k GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).
What problem does it solve?
You wrote a SKILL.md but lack a consistent checklist for frontmatter, file layout, line limits, and discovery-friendly descriptions before sharing the package.
Who is it for?
Solo builders curating or open-sourcing Claude/Cursor skills who want the same compliance lens a marketplace editor would use.
Skip if: Application security pentests, runtime performance profiling, or teams that only review TypeScript/React PRs without SKILL.md packaging concerns.
When should I use this skill?
You are analyzing or editing an agent skill package and need the detailed Review Criteria Reference during structured review Step 2.
What do I get? / Deliverables
Your review pass produces actionable flags on structure, description quality, and orphaned artifacts so you can refactor the skill into catalog-ready shape.
- Structured compliance findings on SKILL.md layout and description quality
- Action list for extracting references, fixing orphans, and improving discovery copy
Recommended Skills
Journey fit
Spans multiple journey phases - primary shelf plus alternate fits below.
Skill packages are built artifacts; the first time most solo builders need this rubric is while crafting agent-tooling under Build, even though the same checks apply before ship-quality releases. Agent-tooling is the canonical shelf because the criteria target SKILL.md structure, references/, scripts/, and discovery-oriented descriptions—not application runtime code review alone.
Where it fits
Split an 600-line SKILL.md into references/ after the rubric flags the hard 500-line limit.
Run Step 2 description-quality checks before tagging a skill release your users will install.
Rewrite a thin description with natural question triggers so Prism and skills.sh search surfaces the skill.
How it compares
Specialized skill-package lint rubric—not a replacement for production code-review skills focused on app logic and CVEs.
Common Questions / FAQ
Who is skill-review for?
Indie agent authors, skill maintainers, and catalog contributors who need a detailed reference while reviewing SKILL.md packages for structure and discoverability.
When should I use skill-review?
During Build (agent-tooling) while drafting or refactoring a skill, during Ship (review) before tagging a release, and during Grow (content) when refreshing descriptions for SEO/AEO—whenever a skill needs an editorial compliance pass.
Is skill-review safe to install?
It is documentation-only review criteria with no shell side effects; still review the Security Audits panel on this Prism page before trusting any sibling scripts in the parent repo.
SKILL.md
READMESKILL.md - Skill Review
# Review Criteria Reference Detailed criteria for each review dimension. Load this file during Step 2 analysis. ## Structure Compliance — Detailed Checks ### Required - `SKILL.md` exists at root - Frontmatter has `name` field (matches directory name) - Frontmatter has `description` field (non-empty, >50 chars) ### Expected - `SKILL.md` ≤ 500 lines (hard limit; flag if >400 as approaching) - No files that serve no purpose: README.md (unless skill IS about READMEs), CHANGELOG.md, CONTRIBUTING.md - `scripts/` — if present, all files are executable and tested - `references/` — if present, each file is referenced from SKILL.md with load instructions - `assets/` — if present, files are used in output, not loaded into context ### Red Flags - SKILL.md > 500 lines → must extract to references - Deeply nested references (2+ levels) → flatten - Orphaned files (not referenced from SKILL.md) - Scripts without clear invocation instructions in SKILL.md ## Description Quality — Detailed Checks ### Keyword Bombing Indicators (Good) - 3+ verb variations for the same action (create/build/write/make) - Noun variations (skill/agent/tool/plugin) - Natural phrases a user would type - Both imperative ("create a skill") and question form ("how do I build a skill") ### Description Anti-Patterns (Bad) - Single-sentence description under 100 characters - Only technical terms, no natural language triggers - Describes implementation ("Uses markdown frontmatter...") instead of user intent - Contains instructions meant for the model body ### Scoring Heuristic - Count unique trigger phrases → <3 is weak, 5-8 is good, >10 is excellent - Check if casual user language would match → try 3 hypothetical queries mentally ## Workflow Design — Detailed Checks ### Checklist Quality - Uses `- [ ]` format (copy-paste friendly) - Steps are numbered and sequential - Sub-steps use indentation - ⚠️ marks appear on steps that must not be skipped - ⛔ marks appear on prerequisites/blockers ### Confirmation Gates Must exist before: - File creation/deletion/overwrite - External API calls with side effects - Long-running generative operations - Applying analysis results to code ### Flow Quality - Can a reader understand the full process from the checklist alone? - Are conditional branches clearly marked? - Is there a clear start and end state? ## Token Efficiency — Detailed Checks ### Iron Law - Present: yes/no - Placed at top (before workflow): yes/no - Specific and falsifiable: yes/no (bad: "write good code"; good: "never use placeholder text") - Addresses the #1 likely failure mode: yes/no ### Progressive Loading Signals (Good) - "Load references/X.md for..." with clear trigger conditions - References loaded at specific workflow steps, not all in Step 1 - Large context only loaded when that branch is taken ### Bloat Signals (Bad) - Explaining things Claude knows (how to write markdown, what JSON is) - "You should" / "You will" / "Remember to" (wasted tokens on filler) - Duplicated instructions (same thing said in workflow AND in later section) - Comments/explanations aimed at human readers, not model behavior ## Anti-Pattern Detection — Detailed Examples ### Vague Directives (rewrite as questions) - ❌ "Ensure the output is high quality" - ✅ "Does every heading have ≥2 sentences of content beneath it?" ### Over-Specification - ❌ "Use markdown headers with # for h1, ## for h2..." (Claude knows this) - ✅ "Use exactly 2 levels of headers: # for sections, ## for subsections" (this constrains) ### Missing Guardrails - No anti-patterns section = model will take lazy defaults - No pre-delivery checklist = no verification step - No Iron Law = no north star for quality decisions ### Monolithic SKILL.md - If SKILL.md > 300 lines, check: could any section be a reference loaded on-demand? - Domain knowledge that applies to only one step → extract to reference - Examples longer than 10 lines → extract to reference --- name: skill-review description: "Quality review an