
Comfyui Workflow Builder
Generate valid ComfyUI workflow JSON with correct node wiring, resolutions, and model references for image pipelines.
Install
npx skills add https://github.com/mckruz/comfyui-expert --skill comfyui-workflow-builderWhat is this skill?
- Eval rubric spans 6 weighted criteria including real class_types, output indices, and dangling-input checks
- Targets 85% with-skill score vs 30% baseline with documented token overhead and manual correction reduction goals
- Enforces CheckpointLoader output index semantics MODEL, CLIP, and VAE slots
- Includes 7 test cases (5 happy path, 1 failure mode) for workflow validation
- Requires resolution to match the selected model training resolution and reasonable VRAM estimates
Adoption & trust: 987 installs on skills.sh; 69 GitHub stars; 2/3 security scanners passed (skills.sh audits).
Recommended Skills
Video Editagentspace-so/runcomfy-agent-skills
Image To Videoagentspace-so/runcomfy-agent-skills
Image Editagentspace-so/runcomfy-agent-skills
Flux Kontextagentspace-so/runcomfy-agent-skills
Nano Banana 2agentspace-so/runcomfy-agent-skills
Nano Banana Editagentspace-so/runcomfy-agent-skills
Journey fit
Primary fit
Build is where generative pipelines are assembled; the skill produces workflow graphs rather than shipping or marketing them. Integrations subphase matches wiring ComfyUI nodes, checkpoints, and graph semantics into a runnable workflow file.
Common Questions / FAQ
Is Comfyui Workflow Builder safe to install?
skills.sh reports 2 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Comfyui Workflow Builder
skill_type: capability_uplift baseline_expected_score: "30%" with_skill_target_score: "85%" token_overhead_acceptable: "25%" manual_correction_reduction: "70%" test_case_count: 7 happy_path_cases: 5 failure_mode_cases: 1 comparison_criteria: - criterion: "All class_types reference real ComfyUI nodes" weight: 0.25 assertion_types: [contains, json_valid] anchor: "node name string match and JSON validity" - criterion: "Output indices are correct for each node type (e.g., CheckpointLoader [0]=MODEL, [1]=CLIP, [2]=VAE)" weight: 0.25 assertion_types: [structure_check, not_contains] anchor: "semantic — requires JSON graph analysis; anti-pattern exclusion anchored" - criterion: "No dangling inputs — every required connection is wired" weight: 0.20 assertion_types: [structure_check] anchor: "semantic — requires full graph traversal" - criterion: "Resolution matches the selected model's training resolution" weight: 0.15 assertion_types: [regex, contains, not_contains] anchor: "resolution numeric value and anti-pattern exclusion" - criterion: "VRAM estimate is reasonable for the workflow complexity" weight: 0.10 assertion_types: [structure_check] anchor: "semantic — requires understanding workflow complexity" - criterion: "Model filenames reference known/real checkpoint files" weight: 0.05 assertion_types: [structure_check] anchor: "semantic — requires knowledge of real model filenames" # comfyui-workflow-builder — Eval Configuration ## Classification - **Type**: Capability Uplift - **Category**: Structured workflow generation from natural language with hardware-aware validation ## What "Good" Looks Like 1. Output is valid ComfyUI workflow JSON with correct node IDs, class_types, and connections 2. All class_types reference nodes that actually exist in ComfyUI (no hallucinated node names) 3. Model filenames match real checkpoint/LoRA files (e.g., `juggernautXL_v9.safetensors`, not invented names) 4. Connections are correct — no dangling inputs, output indices match node output slots (e.g., CheckpointLoader outputs [MODEL, CLIP, VAE] at indices [0, 1, 2]) 5. Resolution matches the selected model's training resolution (1024x1024 for SDXL, 512x512 for SD1.5, etc.) and VRAM estimate is reasonable for the hardware ## Known Limitations - Cannot verify at generation time whether the user actually has a specific model installed - Custom node availability varies per installation — skill uses common nodes but can't guarantee all are present - VRAM estimates are approximations based on typical configurations ## Benchmark Strategy - **Without skill**: Base Claude produces plausible-looking JSON but frequently hallucinates node class_types, uses wrong output indices, and ignores resolution/model compatibility - **With skill**: Generates validated workflows with correct node names, proper output slot indices, model-appropriate resolutions, and VRAM estimates - **Key differentiator**: Output index correctness and node class_type accuracy — the difference between a workflow that loads vs. one that errors immediately ## Security — Eval Sandboxing Eval runs use real tool access and may expose secrets in output. Results are gitignored. Use `--allowedTools "Read,Glob,Grep"` to prevent modification during eval runs. ## Running Evals ```bash bash eval/run-eval.sh # Full run (with-skill + baseline) bash eval/run-eval.sh --skill-only # With-skill only bash eval/run-eval.sh --case TC-001 # Single test case ``` ## Retirement Signal When base Claude consistently produces ComfyUI workflows with correct output indices, valid class_types from the actual node registry, and model-appropriate resolutions without needing the skill's node/model reference data. # Eval results may contain secrets from real config files. * !.gitignore !.gitkeep #!/usr/bin/env bash # ═══════════════════════════════════════════════════════════════════════ # Skill Eval Runn