Web Video Presentation

Distribution is where narrated visual content ships; this skill produces the recordable surface, not just slide bullets in a doc. Distribution covers product demos, lessons, and channel-ready video collateral built from written source material.

Also useful

Also useful

Where it fits

Example use

A product changelog article becomes a click-paced demo you record for a launch-week YouTube upload.

Example use

You scaffold the Vite + React 1920×1080 stage and chapter cursor before writing custom animation components.

Example use

An onboarding lesson script is rebuilt as hover-hidden progress UI suitable for Bilibili narration.

How it compares

Method-driven codegen for recordable web presentations—not a hosted slide SaaS or a single FFmpeg script.

Common Questions / FAQ

Who is web-video-presentation for?

Solo builders and educators using coding agents who want article-to-video workflows with a React presentation shell they control and record.

When should I use web-video-presentation?

At launch when packaging demos for YouTube or Bilibili; during build/frontend when scaffolding the presentation repo; at grow/content when turning lessons into channel assets.

Is web-video-presentation safe to install?

It generates a standard frontend project; review the Security Audits panel on this Prism page and audit dependencies in the scaffolded Vite app like any new repo.

SKILL.md

READMESKILL.md - Web Video Presentation

{
  "name": "web-video-presentation",
  "version": "1.2.1",
  "category": "Web Video / Presentation",
  "description": "Turn scripts, articles, lessons, product demos, and talks into click-driven 16:9 web presentations that can be screen-recorded as cinematic videos. Ships a Vite + React + TypeScript scaffold, a (chapter, step) cursor model, hard collaboration checkpoints, and a theme-token architecture.",
  "homepage": "https://github.com/ConardLi/garden-skills/tree/main/skills/web-video-presentation",
  "compat": [
    "claude-code",
    "claude-ai",
    "cursor",
    "codex-cli",
    "gemini-cli",
    "opencode"
  ]
}


# Web Video Presentation Skill

**A method-driven agent skill for turning scripts and articles into click-driven 16:9 web presentations that can be screen-recorded as cinematic videos.**

[中文文档](./README.zh-CN.md) · [Back to collection root](../../README.md)

![Web Video Presentation Skill](https://cdn.jsdelivr.net/gh/ConardLi/assets@main/imgs/web-video-presentation-skill.webp)

---

## What Is This?

`web-video-presentation` helps an agent build a Vite + React + TypeScript presentation that behaves like a video production surface rather than a slide deck. Each click advances one narration beat, each step owns the whole 1920×1080 stage, and the progress UI stays hidden unless hovered so the output is clean for screen recording.

It is designed for:

- Turning a written article into a Bilibili / YouTube / video-channel narration script
- Turning an existing voiceover script into a cinematic web presentation
- Building product demos, tutorials, keynote-style explainers, and visual talks
- Creating “dynamic PPT, but not PPT” experiences with strong motion and pacing
- Optionally synthesizing narration audio after the visual outline is approved

The skill is primarily a **methodology and collaboration workflow**. The scaffold supplies reusable tokens, stage primitives, themes, and examples, but each project should still choose a visual language that fits the topic.

---

## Core Ideas

- **Fixed 16:9 stage** — content is authored in a stable 1920×1080 coordinate system and scaled to the viewport.
- **One global step cursor** — click or keyboard advances `(chapter, step)`, with the cursor persisted locally.
- **One step, one idea** — every beat gets a focused full-screen scene instead of accumulating slide bullets.
- **Script beats drive structure** — narration rhythm maps directly to visual steps.
- **Hidden chrome** — progress controls are hover-only, keeping recordings clean.
- **Motion first** — each scene needs a moving visual anchor; static paragraphs are treated as a smell.
- **Theme tokens** — visual decisions flow through semantic tokens so themes can change the whole feel.
- **Pluggable TTS** — provider-agnostic audio runner ships **two built-in providers** (MiniMax `mmx-cli` and OpenAI TTS via curl); swap to ElevenLabs / edge-tts / Azure / Google Cloud / macOS `say` / any self-hosted TTS by dropping a single shell file into `tts-providers/`.
- **Hard checkpoints** — the agent pauses after script/theme alignment, after outline approval, and before optional audio synthesis.

---

## Workflow

```text
Phase 1.1  Identify input
Phase 1.2  Article -> narration script
   |
Checkpoint A1  Script, theme, and rough asset plan
   |
Phase 1.3  Script + article -> outline.md
   |
Checkpoint A2  Outline approval + development mode
   |
Phase 2    Build the Vite / React / TS presentation
   |
Checkpoint B   Ask whether to synthesize audio
   |
Phase 3    Optional audio synthesis
Phase 4    Recording and post-production
```

The checkpoints are part of the skill contract: the agent should not silently rush from raw article to finished code. Theme choice influences motion design, and outline approval keeps chapter pacing from drifting.

---

## What It Ships

```text
skills/web-video-presentation/
├── SKILL.md
├── README.md / README.zh-CN.md
├── references/
│   ├── PRINCIPLES.md
│   ├── CHAPTER-CRAFT.md
│   ├── OUTLIN

What is this skill?

Vite + React + TypeScript scaffold for 1920×1080 stage-native steps

(chapter, step) cursor model—one click advances one narration beat

Hard collaboration checkpoints in the agent workflow

Theme-token architecture for consistent cinematic styling

Progress UI hidden unless hovered for clean screen recording

Skill version 1.2.1

1920×1080 full-stage steps per narration beat

Compat list includes six agent CLIs including Claude Code and Cursor

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 1.8k installs on skills.sh; 7.6k GitHub stars; 3/3 security scanners passed (skills.sh audits).

Who is it for?

Creators and founders repurposing long-form writing into narrated video, course segments, or polished product demos without a separate video editor timeline.

Skip if: Teams that only need PowerPoint-style decks with no codegen or who want automated AI voice/video rendering without building a React app.

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Also useful

Also useful

Where it fits

Example use

A product changelog article becomes a click-paced demo you record for a launch-week YouTube upload.

Example use

You scaffold the Vite + React 1920×1080 stage and chapter cursor before writing custom animation components.

Example use