
Printing Press Output Review
Run an internal forked agent review of sampled printed-CLI command output to catch plausibility bugs rule-based ship checks miss.
Install
npx skills add https://github.com/mvanhorn/cli-printing-press --skill printing-press-output-reviewWhat is this skill?
- Internal-only sub-skill (`user-invocable: false`) invoked via Skill tool from printing-press and printing-press-polish
- Uses `context: fork` so reviewer chatter stays out of the parent skill context
- Wave B policy: surfaces plausibility issues as warnings, never errors
- Targets substring-match relevance, format bugs, silent source drops, and ranking failures
- Expects `$CLI_DIR` absolute path to the printed CLI working directory
Adoption & trust: 1.9k installs on skills.sh; 3.1k GitHub stars; 2/3 security scanners passed (skills.sh audits).
Recommended Skills
Agent Browservercel-labs/open-agents
Tddmattpocock/skills
Use My Browserxixu-me/skills
Test Driven Developmentobra/superpowers
Verification Before Completionobra/superpowers
Webapp Testinganthropics/skills
Journey fit
Primary fit
Output review runs at shipcheck Phase 4.85 and inside polish diagnostics—canonical placement is Ship where pre-release CLI quality gates live. It is an agentic review pass over sampled outputs, complementing verify and scorecard live-check rules on the review shelf.
Common Questions / FAQ
Is Printing Press Output Review safe to install?
skills.sh reports 2 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Printing Press Output Review
# printing-press-output-review (internal) Review the sampled outputs from a printed CLI for plausibility bugs that dogfood, verify, and the rule-based `scorecard --live-check` rules can't catch. Wave B policy: all findings surface as warnings, never errors. This skill is **internal-only** (`user-invocable: false`). It's invoked by parents — main printing-press skill at shipcheck Phase 4.85, polish skill during its diagnostic loop. Running it standalone would produce floating findings text with no ship verdict, no fixes applied, no publish offer; the actionable wrappers are `/printing-press` and `/printing-press-polish`. The skill carries `context: fork` so the reviewer agent's diagnostic chatter stays isolated from the calling skill's context. ## Input The caller passes `$CLI_DIR` as the argument: an absolute path to the printed CLI's working directory. ## What this catches Bugs that rule-based checks miss, typically surfaced by 5 minutes of hands-on testing but slipping past dogfood, verify, and `scorecard --live-check` rules: - Substring-match results that coincidentally contain the query but don't match semantically (e.g., a query matches a substring of a larger unrelated term) - Aggregation commands silently dropping sources when only some of the requested N come back - Ranking or sort commands returning top-N results that aren't plausibly the best for the query (broken weights, extractor fallbacks) - URLs in output pointing at category index pages, feed endpoints, or random-selector routes rather than canonical content permalinks - Format bugs the rule-based layer doesn't catch (mojibake, inconsistent pluralization, truncated/wrapped cell content) ## Procedure ### Step 1: Gather sample data ```bash # Locate research.json. Adjacent to the binary covers the post-promote # layout (standalone polish, shipcheck against the library copy). The # grandparent fallback covers mid-pipeline invocations where $CLI_DIR is # $PRESS_RUNSTATE/runs/<id>/working/<cli> and research.json lives at # $PRESS_RUNSTATE/runs/<id>/research.json. Without the fallback, scorecard # reports `unable: true` mid-pipeline and we SKIP the most informative review. # Use a bash array so the flag survives paths with spaces. RESEARCH_ARGS=() if [ ! -f "$CLI_DIR/research.json" ]; then _grandparent="$(dirname "$(dirname "$CLI_DIR")")" if [ -f "$_grandparent/research.json" ]; then RESEARCH_ARGS=(--research-dir "$_grandparent") fi fi cli-printing-press scorecard --dir "$CLI_DIR" "${RESEARCH_ARGS[@]}" --live-check --json > /tmp/output-review-livecheck.json 2>&1 || true ``` If the scorecard call fails or `/tmp/output-review-livecheck.json` is empty, return the SKIP result (Step 3) without dispatching the reviewer. ### Step 2: Dispatch the reviewer agent Use the Agent tool (general-purpose) with this prompt contract: > Review the sampled outputs from the shipped CLI at `$CLI_DIR`. You have these ground-truth sources: > > - Sampled command output: read `/tmp/output-review-livecheck.json` and inspect the `live_check.features[]` array. Each entry has the command, example invocation, redacted stdout evidence (in `output_sample`, bounded to ~4 KiB), the redacted pass/fail reason, and a `warnings` array (populated by rule-based checks like the raw-HTML-entity detector). Treat `<redacted>` markers as privacy scrubbed values, not format bugs. > - **Review only `sta