
Gpt Image 2
Generate or edit marketing and product visuals via GPT Image 2 with structured prompts across local, host-native, or advisor-only runtimes.
Overview
GPT Image 2 is an agent skill most often used in Build (also Launch distribution, Validate prototype) that generates and edits images via GPT Image 2 with 18 categories and 80+ prompt templates across three runtime modes
Install
npx skills add https://github.com/conardli/garden-skills --skill gpt-image-2What is this skill?
- 18 visual categories with 80+ structured prompt templates
- Three runtime modes: Garden local, host-native delegation, and advisor-only prompt coaching
- Supports OpenAI-compatible POST /images/generations and POST /images/edits only
- Single SKILL definition adapts behavior without changing user-facing workflow
- Compat listed for Claude Code, Cursor, Codex CLI, Gemini CLI, OpenCode, and Claude.ai
- 3 runtime modes (Garden local, host-native, advisor-only)
Adoption & trust: 1.7k installs on skills.sh; 7.6k GitHub stars; 3/3 security scanners passed (skills.sh audits).
What problem does it solve?
You need on-brand images fast but get inconsistent results when every agent chat invents one-off prompts and mismatched API setups.
Who is it for?
Solo builders shipping landing pages, app store art, or in-app graphics who already use or can add an OpenAI-compatible image API.
Skip if: Video, 3D, or non-image ML pipelines—or teams that require a single fixed runtime with no advisor-only fallback.
When should I use this skill?
When generating or editing images with GPT Image 2 or OpenAI-compatible image APIs inside a supported Garden host.
What do I get? / Deliverables
You get repeatable generations or edits using categorized templates and a mode-aware workflow that matches your host’s image capabilities.
- Generated or edited image assets
- Structured prompts from template library
Recommended Skills
Journey fit
Spans multiple journey phases - primary shelf plus alternate fits below.
Build/frontend is the canonical shelf because output is visual assets for interfaces, landing pages, and product surfaces. Frontend subphase covers hero images, UI mock visuals, and branded graphics produced during product construction.
Where it fits
Generate hero and feature illustrations for a SaaS landing section using a category template.
Produce quick mock screenshots before committing to a design system.
Create social and announcement visuals with the same structured prompts as the site.
How it compares
Structured generative-media skill pack, not an MCP server or general multimodal chat plugin.
Common Questions / FAQ
Who is gpt-image-2 for?
Indie makers and agent users who want GPT Image 2 workflows with templates across Claude Code, Cursor, Codex, and related hosts.
When should I use gpt-image-2?
During Build frontend for UI and marketing art, Validate prototype for mock visuals, and Launch distribution for social or store imagery.
Is gpt-image-2 safe to install?
Local mode may use network and API keys for image endpoints—review the Security Audits panel on this page and never embed secrets in prompts.
SKILL.md
READMESKILL.md - Gpt Image 2
{ "name": "gpt-image-2", "version": "1.0.3", "category": "Image Generation / Prompt Engineering", "description": "Focused image generation and editing skill for GPT Image 2 and OpenAI-compatible image APIs. Supports three runtime modes — Garden local, host-native delegation, and advisor-only — with 18 visual categories and 80+ structured prompt templates.", "homepage": "https://github.com/ConardLi/garden-skills/tree/main/skills/gpt-image-2", "compat": [ "claude-code", "claude-ai", "cursor", "codex-cli", "gemini-cli", "opencode" ] } # GPT Image 2 Skill **A focused image-generation / editing skill for GPT Image 2, with a single SKILL definition that adapts to three runtime modes — local generation, host-native delegation, and pure prompt advisor.** [中文文档](./README.zh-CN.md) · [Back to collection root](../../README.md)  --- ## What it does This skill is a structured prompt-engineering and image-generation pack built around the GPT Image 2 model (and OpenAI-compatible image endpoints). It only does two image tasks — `POST /images/generations` and `POST /images/edits` — but it does them in three different runtime environments without changing user-facing behavior. It bundles: - A **mode-aware workflow** so the same skill works whether the agent itself owns the image API key, the host has its own image tool, or there is no image tool at all. - A **structured template library** of 18 categories and 79 prompt templates covering posters, UI mockups, product visuals, infographics, academic figures, technical diagrams, comics, avatars, and editing workflows. - **Reproducible prompt + image archival** under `garden-gpt-image-2/prompt/` and `garden-gpt-image-2/image/` with task-slug + timestamp naming. --- ## The three runtime modes The very first thing this skill does on any task is run a tiny detection script: ```bash node skills/gpt-image-2/scripts/check-mode.js # or for structured output: node skills/gpt-image-2/scripts/check-mode.js --json ``` The output picks one of three modes: | Mode | Trigger | Behavior | |---|---|---| | **A — Garden local** | `ENABLE_GARDEN_IMAGEGEN` truthy **AND** `OPENAI_API_KEY` present | End-to-end: pick template → render prompt → call `generate.js` / `edit.js` → image lands on disk | | **B — Host-native** | Garden disabled, but the host agent already has an image tool (`image_generation`, `dalle`, `nano_banana`, image MCP, etc.) | Render the prompt, then **delegate** image generation to the host's own tool | | **C — Advisor** | Garden disabled, host has no image tool | Skill degrades into a high-quality prompt writer — saves the rendered prompt to `garden-gpt-image-2/prompt/` and instructs the user to paste it into ChatGPT / Midjourney / DALL·E / Sora / Nano Banana / their own gateway | In all three modes, prompt files are saved (mode A & C must save, mode B is recommended for reuse). Only mode A produces an image file; mode B leaves that to the host, mode C cannot. --- ## Quick start ### 0. Detect the mode (always step 0) ```bash node skills/gpt-image-2/scripts/check-mode.js ``` The commands below (1–4) only apply in **Mode A**. ### 1. Text-to-image ```bash node skills/gpt-image-2/scripts/generate.js \ --prompt "A cute baby sea otter" \ --size 1024x1024 \ --quality high ``` ### 2. Generate from a saved prompt file ```bash node skills/gpt-image-2/scripts/generate.js \ --promptfile garden-gpt-image-2/prompt/poster-20260424-153045.md ``` ### 3. Edit an existing image ```bash node skills/gpt-image-2/scripts/edit.js \ --image assets/source.png \ --prompt "Replace the background with a clean studio scene" ``` ### 4. Mask-based local edit ```bash node skills/gpt-image-2/scripts/edit.js \ --image assets/source.png \ --mask assets/mask.png \ --prompt "Replace only the masked area with a glass vase" ``` For Mode B / C there is no CLI