AI & Agents · Generative Media

Generative Media tools

Every Generative Media tool worth a solo builder's time - the agent skills, MCP servers and marketplaces tagged Generative Media, ranked by community signal. A focused slice of the broader AI & Agents category.

What's in Generative Media

Generative Media collects 532 curated tools across agent skills, a focused part of the broader AI & Agents category. Every one is screened against a single quality bar and ranked by real community signal.

These tools span Idea, Validate, Build, Ship, Launch and Grow of the build journey.

514 shown of 4,864

Description

1Remotion RenderAI & Agentsqu-skills/skillsTurn Remotion/React TSX components into MP4s through belt so your agent can ship motion graphics and data-driven videos from code.

235k512

2Ai Video GenerationAI & Agentsqu-skills/skillsGenerate short-form and marketing videos from prompts or reference images without leaving your agent, using inference.sh belt and 40+ models.

235k512

3Ai Image GenerationAI & Agentsqu-skills/skillsGenerate product mockups, marketing visuals, and social graphics from text prompts without leaving your agent workflow.

235k512

4Ai Avatar VideoAI & Agentsqu-skills/skillsProduce talking-head and UGC-style avatar videos from text or audio for explainers, ads, and virtual presenters via belt CLI.

235k512

5Video EditAI & Agentsagentspace-so/runcomfy-agent-skillsRoute natural-language video edit requests to the right RunComfy edit model and run jobs through the local RunComfy CLI without guessing which endpoint fits each transform.

234k15

6Image To VideoAI & Agentsagentspace-so/runcomfy-agent-skillsAnimate still images via RunComfy by routing to HappyHorse, Wan 2.7, or Seedance i2v models.

234k15

7Image EditAI & Agentsagentspace-so/runcomfy-agent-skillsRoute natural-language image edits through the RunComfy CLI to the right catalog model for backgrounds, objects, inpainting, or in-image text without trial-and-error model picking.

234k15

8Flux KontextAI & Agentsagentspace-so/runcomfy-agent-skillsRun Flux 1 Kontext Pro local image edits on RunComfy with BFL prompting patterns for high-fidelity changes.

234k15

9Nano Banana 2AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate images with Google Nano Banana 2 text-to-image on RunComfy for fast iteration and in-image typography.

233k15

10Nano Banana EditAI & Agentsagentspace-so/runcomfy-agent-skillsEdit images with Nano Banana 2 on RunComfy preserving identity across batch edits up to 20 inputs.

233k15

11Seedance V2AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate cinematic short-form video with Seedance 2.0 Pro multimodal references on RunComfy.

233k15

12Happyhorse 1 0AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate text-to-video with HappyHorse 1.0 on RunComfy including native 1080p synchronized audio.

233k15

13Gpt Image EditAI & Agentsagentspace-so/runcomfy-agent-skillsEdit images with OpenAI GPT Image 2 on RunComfy for multilingual in-image text and multi-reference layout edits.

233k15

14Wan 2 7AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate text-to-video with Wan 2.7 on RunComfy including multi-reference conditioning and audio lip-sync.

233k15

15Flux 2 KleinAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate images fast with Flux 2 Klein on RunComfy using sub-second latency and multi-reference brand styling.

233k15

16Kling 3 0AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate Kling 3.0 multi-shot video (text-to-video and image-to-video) via RunComfy CLI across Standard, Pro, and 4K tiers.

210k15

17Gpt Image 2AI & Agentsagentspace-so/agent-skillsGenerate images with OpenAI GPT Image 2 text-to-image on RunComfy using ChatGPT Images prompting patterns.

194k9

18Ai Video GenerationAI & Agentsagentspace-so/runcomfy-agent-skillsRoute text-to-video, image-to-video, and extend flows across RunComfy models via runcomfy CLI with intent-based model selection.

178k15

19Ai Image GenerationAI & Agentsagentspace-so/runcomfy-agent-skillsRoute text-to-image and image-to-image jobs to the right RunComfy model (FLUX, Nano Banana, GPT Image, Seedream, Qwen, Wan) with tuned prompts and `runcomfy run` invokes.

178k15

20Face SwapAI & Agentsagentspace-so/runcomfy-agent-skillsSwap faces or characters in stills or video via RunComfy—routing to Wan Animate, GPT Image Edit, Nano Banana, Flux Kontext, or Kling Motion Control by scenario.

177k15

21Ai Avatar VideoAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate talking-head, lip-sync, and avatar videos with RunComfy CLI across OmniHuman, Wan, HappyHorse, and Seedance.

176k15

22Video InpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsLet agents remove objects, clean watermarks, or patch masked regions across video frames using the RunComfy CLI with the right Wan, Lucy Edit, or Seedream route.

175k15

23Controlnet PoseAI & Agentsagentspace-so/runcomfy-agent-skillsTransfer motion from a reference video onto a target character (Kling motion control),Generate images conditioned on OpenPose, DWPose, depth, or canny references,Pick video vs still and photoreal vs s

174k15

24Image InpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsRemove objects or watermarks from still images using a mask,Fill or replace a masked region with Z-Image Turbo or edit models,Run inpainting jobs through the runcomfy CLI from an agent workflow

174k15

25LipsyncAI & Agentsagentspace-so/runcomfy-agent-skillsAnimate a portrait still plus audio into a speaking avatar (OmniHuman),Re-sync mouth movement on existing video to a new audio track (Sync Labs, Kling, Creatify),Generate video with synced speech from

174k15

26Video ExtendAI & Agentsagentspace-so/runcomfy-agent-skillsExtend short Veo clips into longer sequences or chained shots via RunComfy CLI extend-video endpoints.

174k15

27Image OutpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsUncrop stills, change aspect ratio, and expand canvas while keeping the original subject intact via RunComfy edit routes.

174k15

28Video OutpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsSpatially widen or reframe video (e.g. 9:16 to 16:9) while keeping central action consistent.

174k15

29RelightAI & Agentsagentspace-so/runcomfy-agent-skillsRelight product shots or portraits via RunComfy CLI without reshooting—studio, golden hour, rim light, and color-temperature changes.

173k15

30Elevenlabs Music GenerationAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate vocal songs, instrumentals, jingles, podcast intros, or game loops from prompts via ElevenLabs Music on RunComfy.

173k15

31Ace StepAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate, inpaint, or extend stereo music tracks with StepFun ACE Step through the RunComfy CLI when you need cheap, tag-driven audio for demos, apps, or ads.

165k15

32Ai MusicAI & Agentsagentspace-so/runcomfy-agent-skillsLet the agent pick RunComfy’s ElevenLabs or ACE Step music stack from plain-language intent—vocals, cheap beds, multilingual lyrics, or inpaint/outpaint edits—and run the right `runcomfy` command.

165k15

33Seedance V2AI & Agentsruncomfy-com/skillscinematic-short-form-video

134k1

34Face SwapAI & Agentsruncomfy-com/skillsSwap faces or characters in stills or video by choosing the right RunComfy model (Wan Animate, GPT Image 2 Edit, Nano Banana, Flux Kontext, Kling Motion Control) from user intent.

134k1

35Ai Image GenerationAI & Agentsruncomfy-com/skillsText-to-image and image-to-image across FLUX, Nano Banana, GPT Image, Seedream, Qwen,Choose models for typography, photoreal portraits, or fast iteration,Restyle or edit images with multi-reference br

134k1

36Image To VideoAI & Agentsruncomfy-com/skillsstill-to-motion-animation

134k1

37Image OutpaintingAI & Agentsdoany-ai/skillsExtend still image canvas, uncrop, and change aspect ratio while preserving the original scene via Nano Banana 2, GPT Image 2, FLUX Kontext Pro, or brand-locked edit endpoints.

134k1

38Gpt Image 2AI & Agentsruncomfy-com/skillstypography-and-brand-imagery

134k1

39Controlnet PoseAI & Agentsdoany-ai/skillsPose-, depth-, and motion-conditioned image or video generation through RunComfy CLI routing.

134k1

40Nano Banana EditAI & Agentsdoany-ai/skillsEdit images with Google Nano Banana 2 on RunComfy via the local CLI, with routing guidance to sibling edit models when another endpoint fits better.

134k1

41Ai Avatar VideoAI & Agentsdoany-ai/skillsGenerate talking-head, lip-sync, and avatar videos from audio or portraits via RunComfy CLI across OmniHuman, Wan 2-7, HappyHorse, and Seedance v2.

134k1

42Happyhorse 1 0AI & Agentsdoany-ai/skillsGenerate text-to-video with HappyHorse 1.0 on RunComfy—1080p, in-pass synced audio, multi-shot consistency—using documented duration, aspect ratio, and resolution options.

134k1

43Image InpaintingAI & Agentsruncomfy-com/skillsMask-driven inpainting and region edits on RunComfy—object or watermark removal, blemish cleanup, fill and replace masked areas—with Z-Image Turbo Inpainting or prose-driven edit models when no mask e

134k1

44Image InpaintingAI & Agentsdoany-ai/skillsRemove objects, watermarks, or blemishes with a binary mask,Fill or replace a masked region via Z-Image Turbo Inpainting,Describe a region in prose when no mask using Nano Banana, GPT Image, or FLUX K

134k1

45Video EditAI & Agentsruncomfy-com/skillsRoute video edit intents (restyle, background swap, motion control) to the right RunComfy model via CLI.

134k1

46Flux 2 KleinAI & Agentsdoany-ai/skillsGenerate fast Flux 2 Klein images on RunComfy with distilled-model prompting, step-count strategy, 9B vs 4B choice, and fallbacks to Flux 2 Pro, Seedream 5, or GPT Image 2.

134k1

47Nano Banana 2AI & Agentsdoany-ai/skillsGenerate images with Google Nano Banana 2 on RunComfy for rapid iteration, in-image typography, social thumbnails, and drafts; includes resolution pricing, safety tolerance, and routing guidance to Pr

134k1

48RelightAI & Agentsdoany-ai/skillsRelight product or portrait stills via RunComfy (Qwen relight LoRA or fallback edit models) without reshooting.

134k1

49Gpt Image EditAI & Agentsruncomfy-com/skillsEdit existing images with GPT Image 2 on RunComfy CLI using preservation-aware prompts and multi-reference inputs

134k1

50Elevenlabs Music GenerationAI & Agentsruncomfy-com/skillsCreate songs, instrumentals, jingles, and podcast/game audio from text via ElevenLabs Music on RunComfy with section tags and duration control.

134k1

51Face SwapAI & Agentsdoany-ai/skillsSubstitute one identity for another in stills or video by routing RunComfy CLI jobs to Wan animate, GPT Image 2, Nano Banana, Flux Kontext, or Kling motion models.

134k1

52Video ExtendAI & Agentsdoany-ai/skillsExtend or continue existing Veo 3-1 clips via RunComfy CLI, chain narrative shots from a seed video, and produce longer clips with consistent motion and subject identity.

134k1

53Video OutpaintingAI & Agentsdoany-ai/skillsExtend video spatial canvas and convert aspect ratios (e.g. 9:16 to 16:9) while keeping central action, via Wan 2-7 edit-video or ComfyUI outpaint workflows for hero seams.

134k1

54Ai Video GenerationAI & Agentsruncomfy-com/skillsGenerate or extend AI video clips from text or images by auto-selecting HappyHorse, Wan, Seedance, Kling, Veo, Hailuo, and related RunComfy models

134k1

55LipsyncAI & Agentsdoany-ai/skillsMatch mouth movement to an audio track on portrait stills or existing video via RunComfy lip-sync endpoints

134k1

56Video InpaintingAI & Agentsruncomfy-com/skillsRemove objects or watermarks across video frames,Replace masked regions with motion-consistent fills,Pick Wan, Lucy Edit, or Seedream routes by edit style

134k1

57Ace StepAI & Agentsdoany-ai/skillsGenerate or edit stereo music from tags and lyrics through RunComfy’s ACE Step APIs without wiring your own StepFun hosting.

134k1

58Kling 3 0AI & Agentsruncomfy-com/skillsGenerate multi-shot cinematic Kling 3.0 video with native audio from text or a reference image through the RunComfy CLI.

134k1

59Controlnet PoseAI & Agentsruncomfy-com/skillsRoute pose-, skeleton-, depth-, or motion-conditioned image and video jobs to the right RunComfy endpoints (Kling motion control, Wan Animate, Z-Image Turbo ControlNet) from natural-language requests.

134k1

60Elevenlabs Music GenerationAI & Agentsdoany-ai/skillsCompose full vocal songs or instrumental beds from text with ElevenLabs Music on RunComfy for podcasts, games, ads, and product videos.

134k1

Showing the top 514 of 4,864 tools · search to find the rest.

Explore more

By journey phase

By type

By what you're building

FAQ

Generative Media tools - common questions

What counts as a Generative Media tool?

Any agent skill, MCP server or marketplace tagged Generative Media - a focused slice of the broader AI & Agents category. Skillselion collects every Generative Media tool across types on one page.

How are Generative Media tools ranked?

By real community signal - installs, GitHub stars and votes - not paid placement. Sponsored slots, when present, are labelled and kept out of the ranking.

Generative Media tools

What's in Generative Media

Related AI & Agents sub-categories

Generative Media tools - common questions

What counts as a Generative Media tool?

How are Generative Media tools ranked?

This week for builders