
Ad Creative
Pick the right generative AI stack for static banners, short video ads, voiceovers, and templated or personalized creative at scale.
Install
npx skills add https://github.com/infrasity-labs/dev-gtm-claude-skills --skill ad-creativeWhat is this skill?
- Decision table maps needs (static images, text overlays, 6–30s video, voiceover, multi-language, templated scale) to too
- Covers image generators (ChatGPT Images, Nano Banana Pro, Flux, Ideogram) plus video (Veo, Kling, Runway, Sora, Seedance
- Voice paths: ElevenLabs, OpenAI TTS, Cartesia, PlayHT, and brand cloning (ElevenLabs, Resemble AI)
- Code-based scale via Remotion for templated and personalized video ads
- Nano Banana Pro (Gemini) notes: high-quality ad images, product visuals, text rendering, ~$0.04/image (Gemini 2.5 Flash
Adoption & trust: 1 installs on skills.sh; 24 GitHub stars; trending (+100% hot-view momentum).
Recommended Skills
Seo Auditcoreyhaines31/marketingskills
Copywritingcoreyhaines31/marketingskills
Twitter Automationqu-skills/skills
Marketing Psychologycoreyhaines31/marketingskills
Content Strategycoreyhaines31/marketingskills
Programmatic Seocoreyhaines31/marketingskills
Journey fit
Primary fit
Ad creative production sits on the launch shelf because solo builders deploy paid and organic campaigns once the product exists. Distribution is where channel-ready visuals and video assets are produced before ads go live on social, search, and app networks.
SKILL.md
READMESKILL.md - Ad Creative
# Generative AI Tools for Ad Creative Reference for using AI image generators, video generators, and code-based video tools to produce ad visuals at scale. --- ## When to Use Generative Tools | Need | Tool Category | Best Fit | |------|---------------|----------| | Static ad images (banners, social) | Image generation | ChatGPT Images 2.0, Nano Banana Pro, Flux, Ideogram | | Ad images with text overlays | Image generation (text-capable) | Ideogram, Nano Banana Pro | | Short video ads (6-30 sec) | Video generation | Veo, Kling, Runway, Sora, Seedance | | Video ads with voiceover | Video gen + voice | Veo/Sora (native), or Runway + ElevenLabs | | Voiceover tracks for ads | Voice generation | ElevenLabs, OpenAI TTS, Cartesia | | Multi-language ad versions | Voice generation | ElevenLabs, PlayHT | | Brand voice cloning | Voice generation | ElevenLabs, Resemble AI | | Product mockups and variations | Image generation + references | Flux (multi-image reference) | | Templated video ads at scale | Code-based video | Remotion | | Personalized video (name, data) | Code-based video | Remotion | | Brand-consistent variations | Image gen + style refs | Flux, Ideogram, Nano Banana Pro | --- ## Image Generation ### Nano Banana Pro (Gemini) Google DeepMind's image generation model, available through the Gemini API. **Best for:** High-quality ad images, product visuals, text rendering **API:** Gemini API (Google AI Studio, Vertex AI) **Pricing:** ~$0.04/image (Gemini 2.5 Flash Image), ~$0.24/4K image (Nano Banana Pro) **Strengths:** - Strong text rendering in images (logos, headlines) - Native image editing (modify existing images with prompts) - Available through the same Gemini API used for text generation - Supports both generation and editing in one model **Ad creative use cases:** - Generate social media ad images from text descriptions - Create product mockup variations - Edit existing ad images (swap backgrounds, change colors) - Generate images with headline text baked in **API example:** ```bash # Using the Gemini API for image generation curl -X POST "https://generativelanguage.googleapis.com/v1beta/models/gemini-2.5-flash-image:generateContent" \ -H "Content-Type: application/json" \ -H "x-goog-api-key: $GEMINI_API_KEY" \ -d '{ "contents": [{"parts": [{"text": "Create a clean, modern social media ad image for a project management tool. Show a laptop with a kanban board interface. Bright, professional, 16:9 ratio."}]}], "generationConfig": {"responseModalities": ["TEXT", "IMAGE"]} }' ``` **Docs:** [Gemini Image Generation](https://ai.google.dev/gemini-api/docs/image-generation) --- ### Flux (Black Forest Labs) Open-weight image generation models with API access through Replicate and BFL's native API. **Best for:** Photorealistic images, brand-consistent variations, multi-reference generation **API:** Replicate, BFL API, fal.ai **Pricing:** ~$0.01-0.06/image depending on model and resolution **Model variants:** | Model | Speed | Quality | Cost | Best For | |-------|-------|---------|------|----------| | Flux 2 Pro | ~6 sec | Highest | $0.015/MP | Final production assets | | Flux 2 Flex | ~22 sec | High + editing | $0.06/MP | Iterative editing | | Flux 2 Dev | ~2.5 sec | Good | $0.012/MP | Rapid prototyping | | Flux 2 Klein | Fastest | Good | Lowest | High-volume batch generation | **Strengths:** - Multi-image reference (up to 8 images) for consistent identity across ads - Product consistency — same product in different contexts - Style transfer from reference images - Open-weight Dev model for self-hosting **Ad creative use cases:** - Generate 50+ ad variations with consistent product/person identity - Create product-in-context images (your SaaS on different devices) - Style-match to existing brand assets using reference images - Rapid A/B test image variations **Docs:** [Replicate Flux](https://replicate.com/black-forest-labs/flux-2-pro), [BFL API](https://docs.bfl.ml/) --- ### Ideogram Special