AI & Agents · Generative Media

Generative Media tools

Every Generative Media tool worth a solo builder's time - the agent skills, MCP servers and marketplaces tagged Generative Media, ranked by community signal. A focused slice of the broader AI & Agents category.

What's in Generative Media

Generative Media collects 532 curated tools across agent skills, a focused part of the broader AI & Agents category. Every one is screened against a single quality bar and ranked by real community signal.

These tools span Idea, Validate, Build, Ship, Launch and Grow of the build journey.

514 shown of 4,864
Description
1Remotion RenderAI & Agentsqu-skills/skillsTurn Remotion/React TSX components into MP4s through belt so your agent can ship motion graphics and data-driven videos from code.
235k512
2Ai Video GenerationAI & Agentsqu-skills/skillsGenerate short-form and marketing videos from prompts or reference images without leaving your agent, using inference.sh belt and 40+ models.
235k512
3Ai Image GenerationAI & Agentsqu-skills/skillsGenerate product mockups, marketing visuals, and social graphics from text prompts without leaving your agent workflow.
235k512
4Ai Avatar VideoAI & Agentsqu-skills/skillsProduce talking-head and UGC-style avatar videos from text or audio for explainers, ads, and virtual presenters via belt CLI.
235k512
5Video EditAI & Agentsagentspace-so/runcomfy-agent-skillsRoute natural-language video edit requests to the right RunComfy edit model and run jobs through the local RunComfy CLI without guessing which endpoint fits each transform.
234k15
6Image To VideoAI & Agentsagentspace-so/runcomfy-agent-skillsAnimate still images via RunComfy by routing to HappyHorse, Wan 2.7, or Seedance i2v models.
234k15
7Image EditAI & Agentsagentspace-so/runcomfy-agent-skillsRoute natural-language image edits through the RunComfy CLI to the right catalog model for backgrounds, objects, inpainting, or in-image text without trial-and-error model picking.
234k15
8Flux KontextAI & Agentsagentspace-so/runcomfy-agent-skillsRun Flux 1 Kontext Pro local image edits on RunComfy with BFL prompting patterns for high-fidelity changes.
234k15
9Nano Banana 2AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate images with Google Nano Banana 2 text-to-image on RunComfy for fast iteration and in-image typography.
233k15
10Nano Banana EditAI & Agentsagentspace-so/runcomfy-agent-skillsEdit images with Nano Banana 2 on RunComfy preserving identity across batch edits up to 20 inputs.
233k15
11Seedance V2AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate cinematic short-form video with Seedance 2.0 Pro multimodal references on RunComfy.
233k15
12Happyhorse 1 0AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate text-to-video with HappyHorse 1.0 on RunComfy including native 1080p synchronized audio.
233k15
13Gpt Image EditAI & Agentsagentspace-so/runcomfy-agent-skillsEdit images with OpenAI GPT Image 2 on RunComfy for multilingual in-image text and multi-reference layout edits.
233k15
14Wan 2 7AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate text-to-video with Wan 2.7 on RunComfy including multi-reference conditioning and audio lip-sync.
233k15
15Flux 2 KleinAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate images fast with Flux 2 Klein on RunComfy using sub-second latency and multi-reference brand styling.
233k15
16Kling 3 0AI & Agentsagentspace-so/runcomfy-agent-skillsGenerate Kling 3.0 multi-shot video (text-to-video and image-to-video) via RunComfy CLI across Standard, Pro, and 4K tiers.
210k15
17Gpt Image 2AI & Agentsagentspace-so/agent-skillsGenerate images with OpenAI GPT Image 2 text-to-image on RunComfy using ChatGPT Images prompting patterns.
194k9
18Ai Video GenerationAI & Agentsagentspace-so/runcomfy-agent-skillsRoute text-to-video, image-to-video, and extend flows across RunComfy models via runcomfy CLI with intent-based model selection.
178k15
19Ai Image GenerationAI & Agentsagentspace-so/runcomfy-agent-skillsRoute text-to-image and image-to-image jobs to the right RunComfy model (FLUX, Nano Banana, GPT Image, Seedream, Qwen, Wan) with tuned prompts and `runcomfy run` invokes.
178k15
20Face SwapAI & Agentsagentspace-so/runcomfy-agent-skillsSwap faces or characters in stills or video via RunComfy—routing to Wan Animate, GPT Image Edit, Nano Banana, Flux Kontext, or Kling Motion Control by scenario.
177k15
21Ai Avatar VideoAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate talking-head, lip-sync, and avatar videos with RunComfy CLI across OmniHuman, Wan, HappyHorse, and Seedance.
176k15
22Video InpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsLet agents remove objects, clean watermarks, or patch masked regions across video frames using the RunComfy CLI with the right Wan, Lucy Edit, or Seedream route.
175k15
23Controlnet PoseAI & Agentsagentspace-so/runcomfy-agent-skillsTransfer motion from a reference video onto a target character (Kling motion control),Generate images conditioned on OpenPose, DWPose, depth, or canny references,Pick video vs still and photoreal vs s
174k15
24Image InpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsRemove objects or watermarks from still images using a mask,Fill or replace a masked region with Z-Image Turbo or edit models,Run inpainting jobs through the runcomfy CLI from an agent workflow
174k15
25LipsyncAI & Agentsagentspace-so/runcomfy-agent-skillsAnimate a portrait still plus audio into a speaking avatar (OmniHuman),Re-sync mouth movement on existing video to a new audio track (Sync Labs, Kling, Creatify),Generate video with synced speech from
174k15
26Video ExtendAI & Agentsagentspace-so/runcomfy-agent-skillsExtend short Veo clips into longer sequences or chained shots via RunComfy CLI extend-video endpoints.
174k15
27Image OutpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsUncrop stills, change aspect ratio, and expand canvas while keeping the original subject intact via RunComfy edit routes.
174k15
28Video OutpaintingAI & Agentsagentspace-so/runcomfy-agent-skillsSpatially widen or reframe video (e.g. 9:16 to 16:9) while keeping central action consistent.
174k15
29RelightAI & Agentsagentspace-so/runcomfy-agent-skillsRelight product shots or portraits via RunComfy CLI without reshooting—studio, golden hour, rim light, and color-temperature changes.
173k15
30Elevenlabs Music GenerationAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate vocal songs, instrumentals, jingles, podcast intros, or game loops from prompts via ElevenLabs Music on RunComfy.
173k15
31Ace StepAI & Agentsagentspace-so/runcomfy-agent-skillsGenerate, inpaint, or extend stereo music tracks with StepFun ACE Step through the RunComfy CLI when you need cheap, tag-driven audio for demos, apps, or ads.
165k15
32Ai MusicAI & Agentsagentspace-so/runcomfy-agent-skillsLet the agent pick RunComfy’s ElevenLabs or ACE Step music stack from plain-language intent—vocals, cheap beds, multilingual lyrics, or inpaint/outpaint edits—and run the right `runcomfy` command.
165k15
33Seedance V2AI & Agentsruncomfy-com/skillscinematic-short-form-video
134k1
34Face SwapAI & Agentsruncomfy-com/skillsSwap faces or characters in stills or video by choosing the right RunComfy model (Wan Animate, GPT Image 2 Edit, Nano Banana, Flux Kontext, Kling Motion Control) from user intent.
134k1
35Ai Image GenerationAI & Agentsruncomfy-com/skillsText-to-image and image-to-image across FLUX, Nano Banana, GPT Image, Seedream, Qwen,Choose models for typography, photoreal portraits, or fast iteration,Restyle or edit images with multi-reference br
134k1
36Image To VideoAI & Agentsruncomfy-com/skillsstill-to-motion-animation
134k1
37Image OutpaintingAI & Agentsdoany-ai/skillsExtend still image canvas, uncrop, and change aspect ratio while preserving the original scene via Nano Banana 2, GPT Image 2, FLUX Kontext Pro, or brand-locked edit endpoints.
134k1
38Gpt Image 2AI & Agentsruncomfy-com/skillstypography-and-brand-imagery
134k1
39Controlnet PoseAI & Agentsdoany-ai/skillsPose-, depth-, and motion-conditioned image or video generation through RunComfy CLI routing.
134k1
40Nano Banana EditAI & Agentsdoany-ai/skillsEdit images with Google Nano Banana 2 on RunComfy via the local CLI, with routing guidance to sibling edit models when another endpoint fits better.
134k1
41Ai Avatar VideoAI & Agentsdoany-ai/skillsGenerate talking-head, lip-sync, and avatar videos from audio or portraits via RunComfy CLI across OmniHuman, Wan 2-7, HappyHorse, and Seedance v2.
134k1
42Happyhorse 1 0AI & Agentsdoany-ai/skillsGenerate text-to-video with HappyHorse 1.0 on RunComfy—1080p, in-pass synced audio, multi-shot consistency—using documented duration, aspect ratio, and resolution options.
134k1
43Image InpaintingAI & Agentsruncomfy-com/skillsMask-driven inpainting and region edits on RunComfy—object or watermark removal, blemish cleanup, fill and replace masked areas—with Z-Image Turbo Inpainting or prose-driven edit models when no mask e
134k1
44Image InpaintingAI & Agentsdoany-ai/skillsRemove objects, watermarks, or blemishes with a binary mask,Fill or replace a masked region via Z-Image Turbo Inpainting,Describe a region in prose when no mask using Nano Banana, GPT Image, or FLUX K
134k1
45Video EditAI & Agentsruncomfy-com/skillsRoute video edit intents (restyle, background swap, motion control) to the right RunComfy model via CLI.
134k1
46Flux 2 KleinAI & Agentsdoany-ai/skillsGenerate fast Flux 2 Klein images on RunComfy with distilled-model prompting, step-count strategy, 9B vs 4B choice, and fallbacks to Flux 2 Pro, Seedream 5, or GPT Image 2.
134k1
47Nano Banana 2AI & Agentsdoany-ai/skillsGenerate images with Google Nano Banana 2 on RunComfy for rapid iteration, in-image typography, social thumbnails, and drafts; includes resolution pricing, safety tolerance, and routing guidance to Pr
134k1
48RelightAI & Agentsdoany-ai/skillsRelight product or portrait stills via RunComfy (Qwen relight LoRA or fallback edit models) without reshooting.
134k1
49Gpt Image EditAI & Agentsruncomfy-com/skillsEdit existing images with GPT Image 2 on RunComfy CLI using preservation-aware prompts and multi-reference inputs
134k1
50Elevenlabs Music GenerationAI & Agentsruncomfy-com/skillsCreate songs, instrumentals, jingles, and podcast/game audio from text via ElevenLabs Music on RunComfy with section tags and duration control.
134k1
51Face SwapAI & Agentsdoany-ai/skillsSubstitute one identity for another in stills or video by routing RunComfy CLI jobs to Wan animate, GPT Image 2, Nano Banana, Flux Kontext, or Kling motion models.
134k1
52Video ExtendAI & Agentsdoany-ai/skillsExtend or continue existing Veo 3-1 clips via RunComfy CLI, chain narrative shots from a seed video, and produce longer clips with consistent motion and subject identity.
134k1
53Video OutpaintingAI & Agentsdoany-ai/skillsExtend video spatial canvas and convert aspect ratios (e.g. 9:16 to 16:9) while keeping central action, via Wan 2-7 edit-video or ComfyUI outpaint workflows for hero seams.
134k1
54Ai Video GenerationAI & Agentsruncomfy-com/skillsGenerate or extend AI video clips from text or images by auto-selecting HappyHorse, Wan, Seedance, Kling, Veo, Hailuo, and related RunComfy models
134k1
55LipsyncAI & Agentsdoany-ai/skillsMatch mouth movement to an audio track on portrait stills or existing video via RunComfy lip-sync endpoints
134k1
56Video InpaintingAI & Agentsruncomfy-com/skillsRemove objects or watermarks across video frames,Replace masked regions with motion-consistent fills,Pick Wan, Lucy Edit, or Seedream routes by edit style
134k1
57Ace StepAI & Agentsdoany-ai/skillsGenerate or edit stereo music from tags and lyrics through RunComfy’s ACE Step APIs without wiring your own StepFun hosting.
134k1
58Kling 3 0AI & Agentsruncomfy-com/skillsGenerate multi-shot cinematic Kling 3.0 video with native audio from text or a reference image through the RunComfy CLI.
134k1
59Controlnet PoseAI & Agentsruncomfy-com/skillsRoute pose-, skeleton-, depth-, or motion-conditioned image and video jobs to the right RunComfy endpoints (Kling motion control, Wan Animate, Z-Image Turbo ControlNet) from natural-language requests.
134k1
60Elevenlabs Music GenerationAI & Agentsdoany-ai/skillsCompose full vocal songs or instrumental beds from text with ElevenLabs Music on RunComfy for podcasts, games, ads, and product videos.
134k1

Showing the top 514 of 4,864 tools · search to find the rest.

Explore more
FAQ

Generative Media tools - common questions

What counts as a Generative Media tool?

Any agent skill, MCP server or marketplace tagged Generative Media - a focused slice of the broader AI & Agents category. Skillselion collects every Generative Media tool across types on one page.

How are Generative Media tools ranked?

By real community signal - installs, GitHub stars and votes - not paid placement. Sponsored slots, when present, are labelled and kept out of the ranking.

This week for builders

Five minutes, every Monday — the tools, releases and tactics for shipping solo.

unsubscribe anytime.