
Ai Avatar Video
Create talking-head, lip-sync, UGC, and virtual presenter videos by routing to the right RunComfy avatar model and running documented runcomfy CLI invokes
Install
npx skills add https://github.com/runcomfy-com/skills --skill ai-avatar-videoWhat is this skill?
- Routes across ByteDance OmniHuman, Wan 2-7 audio_url, HappyHorse 1.0, and Seedance v2 Pro by intent
- Triggers on talking head, lip sync, HeyGen/Synthesia alternatives, and make-portrait-speak workflows
- Ships per-model prompting patterns plus minimal runcomfy run commands
Adoption & trust: 113k installs on skills.sh; 1 GitHub stars; 2/3 security scanners passed (skills.sh audits).
Recommended Skills
Video Editagentspace-so/runcomfy-agent-skills
Image To Videoagentspace-so/runcomfy-agent-skills
Image Editagentspace-so/runcomfy-agent-skills
Flux Kontextagentspace-so/runcomfy-agent-skills
Nano Banana 2agentspace-so/runcomfy-agent-skills
Nano Banana Editagentspace-so/runcomfy-agent-skills
Journey fit
Primary fit
Avatar and dubbed video output supports distribution, demos, and lifecycle content rather than core app implementation Content covers scripted video, spokespeople, and audio-driven creative for channels and lifecycle touchpoints
Common Questions / FAQ
Is Ai Avatar Video safe to install?
skills.sh reports 2 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Ai Avatar Video
# AI Avatar & Talking Head Video Put words in a face. This skill routes across RunComfy's audio-driven avatar models — OmniHuman, Wan 2-7 with audio_url, HappyHorse, Seedance v2 — picking the right path for the user's intent and shipping the documented prompts + the exact `runcomfy run` invoke for each. [runcomfy.com](https://www.runcomfy.com/?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) · [Lip-sync feature](https://www.runcomfy.com/models/feature/lip-sync?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) · [CLI docs](https://docs.runcomfy.com/cli/introduction?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) ## Powered by the RunComfy CLI ```bash # 1. Install (see runcomfy-cli skill for details) npm i -g @runcomfy/cli # or: npx -y @runcomfy/cli --version # 2. Sign in runcomfy login # or in CI: export RUNCOMFY_TOKEN=<token> # 3. Generate an avatar video runcomfy run <vendor>/<model>/<endpoint> \ --input '{"prompt": "...", "audio_url": "https://...", "image_url": "https://..."}' \ --output-dir ./out ``` CLI deep dive: [`runcomfy-cli`](https://www.skills.sh/agentspace-so/runcomfy-agent-skills/runcomfy-cli) skill. ## Install this skill ```bash npx skills add agentspace-so/runcomfy-agent-skills --skill ai-avatar-video -g ``` --- ## Pick the right model for the user's intent Listed newest first. The agent classifies user intent — pre-recorded audio file or just a script? Photoreal portrait or stylized character? Single shot or cinematic composition? — and picks one route below. **OmniHuman** — `bytedance/omnihuman/api` *(default)* > ByteDance audio-driven full-body avatar. Feed one portrait + one audio file, get back a video where the subject speaks / sings / gestures naturally. Listed on RunComfy's `/feature/lip-sync` as the curated default. > Pick for: UGC voiceover, virtual presenter, dubbed product demo, multi-language clips from same portrait. > Avoid for: no audio file available (need to generate speech from a script) — use **HappyHorse 1.0**. **HappyHorse 1.0** — `happyhorse/happyhorse-1-0/text-to-video` (t2v) · `happyhorse/happyhorse-1-0/image-to-video` (i2v) > Arena #1 t2v / i2v with in-pass audio generated from prompt. No external audio file required — quote the spoken line inside the prompt. > Pick for: written script with no audio file, "write a script → get a video", concept clips, i2v talking-head from an existing portrait. > Avoid for: precise lip-sync to a specific MP3 — audio is regenerated each call, not locked. **Seedance v2 Pro** — `bytedance/seedance-v2/pro` > ByteDance multi-modal flagship — up to 9 reference images, 3 reference videos, 3 reference audio tracks composed in one pass with cinematic motion / lens / lighting control. > Pick for: cinematic monologue with reference subject +