
Pixel Surgeon
Generate, edit, and region-repair images and videos for landing pages, apps, and ads via Gemini, OpenAI, and Grok from one MCP server.
Overview
Pixel Surgeon is a MCP server for the Build phase that generates and edits images and videos through Gemini, OpenAI, and Grok APIs from your agent.
What is this MCP server?
- Multi-provider image generation: Google Gemini, OpenAI GPT Image, xAI Grok Imagine
- Video generation pathways via configured provider keys
- Image editing and targeted region repair workflows
- Optional keys: GOOGLE_API_KEY, OPENAI_API_KEY, XAI_API_KEY (enable per vendor)
- stdio npm package pixel-surgeon-mcp 1.1.1
- Version 1.1.1
- Three optional provider API keys (Google, OpenAI, xAI)
- npm identifier pixel-surgeon-mcp with stdio transport
Community signal: 4 GitHub stars.
What problem does it solve?
Solo builders stall on visuals because switching between chat, three different AI art dashboards, and an image editor breaks momentum.
Who is it for?
Landing page art, social creatives, app mock refreshes, and quick inpainting fixes without opening separate vendor UIs.
Skip if: Print CMYK pipelines, strict brand-locked Figma systems without human review, or teams that cannot store paid API secrets locally.
What do I get? / Deliverables
Your agent can generate, edit, and repair image regions—and produce video where enabled—using whichever provider keys you supply in one MCP workflow.
- Generated or edited raster images per prompt
- Region-repaired image outputs
- Video assets when provider and tool path supports it
Recommended MCP Servers
Journey fit
Visual assets are produced while shaping the product surface; Pixel Surgeon fits Build when UI mocks, hero art, and in-app media are created alongside code. Frontend subphase covers marketing visuals, app screenshots, and iterative creative edits tied to shipped interfaces.
How it compares
Multi-vendor generative media MCP, not a Figma or Canva design skill.
Common Questions / FAQ
Who is Pixel Surgeon for?
Solo builders and indie hackers who ship their own UI and marketing and want Gemini, OpenAI, and Grok media tools inside Claude Code or Cursor.
When should I use Pixel Surgeon?
Use it while building or polishing frontend-facing assets—heroes, icons, ads, short videos—or when you need region-level image repair before launch.
How do I add Pixel Surgeon to my agent?
Install pixel-surgeon-mcp via npm stdio in MCP settings, set GOOGLE_API_KEY, OPENAI_API_KEY, and/or XAI_API_KEY as secrets, then invoke generation and edit tools.