
Wan 2 7
Generate text-to-video with Wan 2.7 on RunComfy including multi-reference conditioning and audio lip-sync.
Install
npx skills add https://github.com/agentspace-so/runcomfy-agent-skills --skill wan-2-7What is this skill?
- Documents multi-reference conditioning and audio_url lip-sync patterns.
- Covers duration, resolution, aspect ratio, and prompt expansion options.
- Routes to HappyHorse, Seedance, Kling, or LTX 2 when another model fits better.
Adoption & trust: 209k installs on skills.sh; 15 GitHub stars; 1/3 security scanners passed (skills.sh audits).
Recommended Skills
Video Editagentspace-so/runcomfy-agent-skills
Image To Videoagentspace-so/runcomfy-agent-skills
Image Editagentspace-so/runcomfy-agent-skills
Flux Kontextagentspace-so/runcomfy-agent-skills
Nano Banana 2agentspace-so/runcomfy-agent-skills
Nano Banana Editagentspace-so/runcomfy-agent-skills
Journey fit
Common Questions / FAQ
Is Wan 2 7 safe to install?
skills.sh reports 1 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Wan 2 7
# Wan 2.7 — Pro Pack on RunComfy [runcomfy.com](https://www.runcomfy.com/?utm_source=skills.sh&utm_medium=skill&utm_campaign=wan-2-7) · [Text-to-video](https://www.runcomfy.com/models/wan-ai/wan-2-7/text-to-video?utm_source=skills.sh&utm_medium=skill&utm_campaign=wan-2-7) · [GitHub](https://github.com/agentspace-so/runcomfy-skills/tree/main/wan-2-7) Wan-AI's **Wan 2.7** — flagship video model with multi-reference conditioning and audio-driven lip-sync — hosted on the **RunComfy Model API**. ```bash npx skills add agentspace-so/runcomfy-skills --skill wan-2-7 -g ``` ## When to pick this model (vs siblings) | You want | Use | |---|---| | Lip-sync video to an audio track you supply | **Wan 2.7** (`audio_url`) | | Multi-reference fine motion control | **Wan 2.7** | | Smooth transitions, accurate motion physics | **Wan 2.7** | | Currently-#1 blind-vote video model | HappyHorse 1.0 | | Multi-modal cinematic with image+video+audio refs + in-pass voice generation | Seedance 2.0 Pro | | Cinematic motion editing on existing footage | Kling Video O1 | | Ultra-fast iteration | LTX 2 | If the user said "Wan" / "Wan 2.7" / "wan-ai" / "alibaba video" explicitly, route here regardless. ## Prerequisites 1. **RunComfy CLI** — `npm i -g @runcomfy/cli` 2. **RunComfy account** — `runcomfy login` opens a browser device-code flow. 3. **CI / containers** — set `RUNCOMFY_TOKEN=<token>` instead of `runcomfy login`. ## Endpoints + input schema ### `wan-ai/wan-2-7/text-to-video` | Field | Type | Required | Default | Notes | |---|---|---|---|---| | `prompt` | string | yes | — | Up to ~5000 chars / ~1500 tokens. | | `audio_url` | string | no | — | WAV/MP3, 3–30s, ≤15MB. **Drives lip-sync.** Omit → background music auto-generated. | | `aspect_ratio` | enum | no | `16:9` | `16:9`, `9:16`, `1:1`, `4:3`, `3:4`. | | `resolution` | enum | no | `1080p` | `720p` or `1080p`. | | `duration` | enum | no | `5` | 2–15 (whole seconds). | | `negative_prompt` | string | no | — | Up to 500 chars. Concrete issues to avoid. | | `enable_prompt_expansion` | bool | no | true | Auto-rewrites short prompts. Disable for literal control. | | `seed` | int | no | — | 0..2^31-1. Reuse for variants. | ## How to invoke **Default (5s 1080p 16:9, prompt-expanded):** ```bash runcomfy run wan-ai/wan-2-7/text-to-video \ --input '{"prompt": "<user prompt>"}' \ --output-dir <absolute/path> ``` **Audio-driven lip-sync (your own track):** ```bash runcomfy run wan-ai/wan-2-7/text-to-video \ --input '{ "prompt": "Medium close-up of the spokesperson, warm key light, locked tripod, slight breathing motion.", "audio_url": "https://.../voiceover.mp3", "duration": 12, "aspect_ratio": "9:16" }' \ --output-dir <absolute/path> ``` **Literal control (no auto-expansion):** ```bash runcomfy run wan-ai/wan-2-7/text-to-video \ --input '{ "prompt": "<exactly what you want, verbatim>", "enable_prompt_expansion": false, "negative_prompt": "no subtitles, no flicker, no distorted hands" }' \ --output-dir <absolute/path> ``` ## Prompting — what actually works **Camera + motion in plain English.** "Slow dolly in", "locked tripod, low angle", "handheld follow", "crane move from above". Front-load the shot. **One primary action per clip.** Don't pile up multiple competing actions.