
Harness Creator
Install when you want a small, followable agent harness—AGENTS.md, state files, verification commands, and session handoff—so coding agents stay in scope and can resume work.
Install
npx skills add https://github.com/walkinglabs/learn-harness-engineering --skill harness-creatorWhat is this skill?
- Five-subsystem model: Instructions, State, Verification, Scope, Lifecycle
- Audits existing instruction files, state, and verification commands before adding structure
- Produces minimal artifacts: AGENTS.md or CLAUDE.md, feature_list.json, progress.md, session-handoff.md
- Emphasizes harnesses small enough that agents actually follow them
- Covers memory persistence, context control, tool safety, and multi-agent coordination
Adoption & trust: 648 installs on skills.sh; 8k GitHub stars; 2/3 security scanners passed (skills.sh audits).
Recommended Skills
Journey fit
Canonical shelf is Build → agent-tooling because the primary artifact is making the repo agent-ready (instructions, state, verification, scope, lifecycle). Agent-tooling is where AGENTS.md/CLAUDE.md, feature_list.json, and init.sh-style verification belong—not generic frontend or backend feature code.
Common Questions / FAQ
Is Harness Creator safe to install?
skills.sh reports 2 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Harness Creator
# Harness Creator Use this skill to make a repository easier for coding agents to start, stay in scope, verify work, and resume across sessions. Keep the harness small enough that agents actually follow it. Not for model selection, prompt tuning in isolation, chat UI design, or general app architecture. ## Core Model Every useful coding-agent harness has five subsystems: | Subsystem | Minimal artifact | Purpose | |---|---|---| | Instructions | `AGENTS.md` or `CLAUDE.md` | Startup path, working rules, definition of done | | State | `feature_list.json`, `progress.md` | Current feature, status, evidence, next step | | Verification | `init.sh` or documented commands | Tests/checks the agent must run before claiming done | | Scope | Feature dependencies and done criteria | Prevents overreach and half-finished work | | Lifecycle | `session-handoff.md`, end-of-session routine | Makes the next session restartable | ## First Move 1. Inspect what already exists: instruction files, feature/state files, verification commands, docs, package manifests. 2. Ask only for missing context that cannot be inferred safely: target agent, desired file name, tolerance for structure, and whether overwriting is allowed. 3. Prefer a minimal harness first. Add memory, tool safety, multi-agent, or benchmark details only when the user's problem calls for them. ## Common Tasks ### Create a harness Use the bundled script when working on a local repository: ```bash node skills/harness-creator/scripts/create-harness.mjs --target /path/to/project ``` Options: - `--agent-file CLAUDE.md` for Claude-oriented projects. - `--package-manager npm|pnpm|yarn|bun` when detection is wrong. - `--commands "cmd one,cmd two"` for custom verification. - `--force` only after confirming overwrites are acceptable. Then explain what was created and how the user should replace placeholder feature entries. ### Audit an existing harness Run: ```bash node skills/harness-creator/scripts/validate-harness.mjs --target /path/to/project ``` Report the five subsystem scores, the lowest-scoring area, and the first 2-3 changes that would improve reliability. Treat the lowest score as a candidate bottleneck; confirm with failures, logs, or task outcomes before claiming causality. ### Produce a report Use when the user wants a shareable assessment: ```bash node skills/harness-creator/scripts/render-assessment-html.mjs --target /path/to/project node skills/harness-creator/scripts/run-benchmark.mjs --target /path/to/project --html /path/to/report.html ``` Be clear that this is a structural benchmark. Real effectiveness still needs before/after agent sessions on representative tasks. ## When to Read References Load only the reference needed for the user's problem: - Memory across sessions: [Memory Persistence](references/memory-persistence-pattern.md) - Reusable workflows as skills: [Skill Runtime](references/skill-runtime-pattern.md) - Permissions, tools, concurrency: [Tool Registry & Safety](references/tool-registry-pattern.md) - Context budget and progressive disclosure: [Context Engineering](references/context-engineering-pattern.md) - Delegation and parallel agents: [Multi-Agent Coordination](references/multi-agent-pattern.md) - Hooks, startup, long-running work: [Lifecycle & Bootstrap](references/lifecycle-bootstrap-pattern.md) - Non-obvious failure modes: [Gotchas](references/gotchas.md) ## Design Rules - Keep the root instruction file short: routing and invariants, not a full manual. - Put project facts in project docs, not in the skill. - Make verification commands explicit and runnable. - Require evidence before marking a feature done. - Use one active fe