
Ai Avatar Video
Generate talking-head, lip-sync, and avatar videos with RunComfy CLI across OmniHuman, Wan, HappyHorse, and Seedance.
Install
npx skills add https://github.com/agentspace-so/runcomfy-agent-skills --skill ai-avatar-videoWhat is this skill?
- Routes intent across OmniHuman, Wan 2-7 audio_url, HappyHorse 1.0, and Seedance v2 Pro
- Documents per-model prompting patterns and minimal runcomfy run invokes
- Covers lip sync, audio-driven avatars, reference audio, and portrait-to-speech workflows
Adoption & trust: 153k installs on skills.sh; 15 GitHub stars; 2/3 security scanners passed (skills.sh audits).
Recommended Skills
Video Editagentspace-so/runcomfy-agent-skills
Image To Videoagentspace-so/runcomfy-agent-skills
Image Editagentspace-so/runcomfy-agent-skills
Flux Kontextagentspace-so/runcomfy-agent-skills
Nano Banana 2agentspace-so/runcomfy-agent-skills
Nano Banana Editagentspace-so/runcomfy-agent-skills
Journey fit
Primary fit
Avatar and UGC-style video targets marketing, spokespeople, and dubbed demos rather than core app implementation. Output is publishable video content aligned with growth motions like UGC, virtual presenters, and HeyGen-style alternatives.
Common Questions / FAQ
Is Ai Avatar Video safe to install?
skills.sh reports 2 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Ai Avatar Video
# AI Avatar & Talking Head Video Put words in a face. This skill routes across RunComfy's audio-driven avatar models — OmniHuman, Wan 2-7 with audio_url, HappyHorse, Seedance v2 — picking the right path for the user's intent and shipping the documented prompts + the exact `runcomfy run` invoke for each. [runcomfy.com](https://www.runcomfy.com/?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) · [Lip-sync feature](https://www.runcomfy.com/models/feature/lip-sync?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) · [CLI docs](https://docs.runcomfy.com/cli/introduction?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) ## Powered by the RunComfy CLI ```bash # 1. Install (see runcomfy-cli skill for details) npm i -g @runcomfy/cli # or: npx -y @runcomfy/cli --version # 2. Sign in runcomfy login # or in CI: export RUNCOMFY_TOKEN=<token> # 3. Generate an avatar video runcomfy run <vendor>/<model>/<endpoint> \ --input '{"prompt": "...", "audio_url": "https://...", "image_url": "https://..."}' \ --output-dir ./out ``` CLI deep dive: [`runcomfy-cli`](https://www.skills.sh/agentspace-so/runcomfy-agent-skills/runcomfy-cli) skill. ## Install this skill ```bash npx skills add agentspace-so/runcomfy-agent-skills --skill ai-avatar-video -g ``` --- ## Pick the right model for the user's intent Listed newest first. The agent classifies user intent — pre-recorded audio file or just a script? Photoreal portrait or stylized character? Single shot or cinematic composition? — and picks one route below. **OmniHuman** — `bytedance/omnihuman/api` *(default)* > ByteDance audio-driven full-body avatar. Feed one portrait + one audio file, get back a video where the subject speaks / sings / gestures naturally. Listed on RunComfy's `/feature/lip-sync` as the curated default. > Pick for: UGC voiceover, virtual presenter, dubbed product demo, multi-language clips from same portrait. > Avoid for: no audio file available (need to generate speech from a script) — use **HappyHorse 1.0**. **HappyHorse 1.0** — `happyhorse/happyhorse-1-0/text-to-video` (t2v) · `happyhorse/happyhorse-1-0/image-to-video` (i2v) > Arena #1 t2v / i2v with in-pass audio generated from prompt. No external audio file required — quote the spoken line inside the prompt. > Pick for: written script with no audio file, "write a script → get a video", concept clips, i2v talking-head from an existing portrait. > Avoid for: precise lip-sync to a specific MP3 — audio is regenerated each call, not locked. **Seedance v2 Pro** — `bytedance/seedance-v2/pro` > ByteDance multi-modal flagship — up to 9 reference images, 3 reference videos, 3 reference audio tracks composed in one pass with cinematic motion / lens / lighting control. > Pick for: cinematic monologue with reference subject +