Videoagent Image Studio

Name: Videoagent Image Studio
Author: pexoai

pexoai/pexo-skills

Generate still images for video-agent workflows through a single script that polls providers and returns a unified success/imageUrl payload.

Overview

Videoagent Image Studio is an agent skill for the Build phase that generates images via fal.ai and Legnext-backed models with one-shot script execution and a unified JSON result object.

Install

npx skills add https://github.com/pexoai/pexo-skills --skill videoagent-image-studio

What is this skill?

v2.0.0 simplified SKILL.md: run script once—internal polling, no manual async/poll steps
Unified result shape: success, model, imageUrl, images, jobId across providers
Model table covers Flux, SDXL, Nano Banana, Ideogram, Recraft (fal.ai) and Midjourney (Legnext)
Optional hosted or self-hosted proxy with IMAGE_STUDIO_PROXY_URL and pro keys
Env-driven FAL_KEY, LEGNEXT_KEY, and VALID_PRO_KEYS for self-hosted access control
v2.0.0 release (2026-03-03)
Unified 5-field output shape: success, model, imageUrl, images, jobId

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 5.4k installs on skills.sh; 732 GitHub stars; 3/3 security scanners passed (skills.sh audits).

What problem does it solve?

Your video agent needs images from several providers but async job polling and inconsistent response shapes slow automation.

Who is it for?

Builders composing Pexo-style video agents who want multi-model image generation with env-based keys and optional self-hosted proxy control.

Skip if: Teams that only need a single manual image in a design tool, or workflows with no API budget and no willingness to manage FAL_KEY/LEGNEXT_KEY secrets.

When should I use this skill?

User or video-agent workflow needs generated still images from configured fal.ai or Legnext models via the image studio script or proxy.

What do I get? / Deliverables

You run one command path and receive a consistent { success, model, imageUrl, images, jobId } payload ready for the next video-agent step.

Generated image URL(s)
Structured job result with model and jobId

Recommended Skills

Video Editagentspace-so/runcomfy-agent-skills

Video Edit is a RunComfy-focused agent skill that acts as a smart router between your edit intent and the correct model …211k installs·15 stars

Image To Videoagentspace-so/runcomfy-agent-skills

Image-to-Video on RunComfy picks the right i2v model for each intent—HappyHorse for general animation, Wan 2.7 with audi…210k installs·15 stars

Image Editagentspace-so/runcomfy-agent-skills

Image Edit is a RunComfy Pro Pack agent skill that acts as a smart router between your edit intent and the right model i…210k installs·15 stars

Flux Kontextagentspace-so/runcomfy-agent-skills

Flux Kontext Pro on RunComfy packages Black Forest Labs' precise local edit model with documented prompting patterns and…210k installs·15 stars

Nano Banana 2agentspace-so/runcomfy-agent-skills

Nano Banana 2 on RunComfy wraps Google's Gemini-family flash text-to-image model with prompting patterns for fast iterat…210k installs·15 stars

Nano Banana Editagentspace-so/runcomfy-agent-skills

Nano Banana Edit on RunComfy documents Google's image-to-image edit endpoint for identity-preserving changes, background…210k installs·15 stars

Journey fit

Primary fit

BuildAgent skills & templates

Image generation for agents is Build-time tooling that feeds creative pipelines before ship-ready media packaging. Agent-tooling fits CLI/script skills that wrap external model APIs for automated creative steps.

How it compares

Scripted multi-provider image generator for agents—not a browser-only design skill or a general MCP catalog entry.

Common Questions / FAQ

Who is videoagent-image-studio for?

Solo builders and small teams wiring automated video or creative agents that must fetch stills from Flux, SDXL, Ideogram, Recraft, Nano Banana, or Midjourney-backed APIs.

When should I use videoagent-image-studio?

Use it in Build while assembling agent-tooling for generative pipelines—storyboards, scene stills, or promo frames—when you want one run-and-wait flow instead of a 3-step async poll recipe.

Is videoagent-image-studio safe to install?

It expects API keys and optional proxy auth; review the Security Audits panel on this page and never commit FAL_KEY, LEGNEXT_KEY, or VALID_PRO_KEYS into a public repo.

SKILL.md

READMESKILL.md - Videoagent Image Studio

# ── Client-side (optional) ──────────────────────────────────────────────────
# Custom proxy URL — leave empty to use the default hosted proxy
# IMAGE_STUDIO_PROXY_URL=https://your-proxy.vercel.app/api/image

# Pro access key for custom proxy authentication
# IMAGE_STUDIO_API_KEY=your_pro_key_here

# ── Server-side (only needed if self-hosting the proxy) ─────────────────────
# fal.ai API Key — required for Flux, SDXL, Nano Banana, Ideogram, Recraft
# Get it at: https://fal.ai/dashboard/keys
# FAL_KEY=your_fal_key_here

# Legnext.ai API Key — required for Midjourney
# Get it at: https://legnext.ai/dashboard
# LEGNEXT_KEY=your_legnext_key_here

# Comma-separated list of valid pro keys for access control (leave empty for open access)
# VALID_PRO_KEYS=key1,key2,key3


# Changelog

All notable changes to this project will be documented in this file.

The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## [2.0.0] - 2026-03-03

### Changed
- **Simplified SKILL.md**: Removed the complex 3-step async/poll workflow from the main instructions. The script already handles all polling internally — the SKILL.md now reflects this with a single, clean "run and get result" pattern.
- **Unified output format**: All models now return a consistent `{ success, model, imageUrl, images, jobId }` shape, making it easier to handle results uniformly.
- **Clearer model selection table**: Added "Speed" column so agents can make better trade-off decisions.
- **Added "Use when" trigger**: SKILL.md now starts with a clear activation condition so the agent knows exactly when to invoke this skill.
- **Documented `--reference-images` for Nano Banana**: Pass comma-separated URLs for character/style consistency across sequential image generations.

---

## [1.3.0] - 2026-02-25

### Added
- **Non-blocking async mode for Midjourney** (`--async` flag). Submit a job and return immediately with `job_id`, without waiting for completion. This prevents the bot from being blocked while waiting for image generation.
- **Status poll mode** (`--poll --job-id <id>`). Check job status once and return immediately — no waiting. Returns `status: "completed"`, `"pending"`, `"processing"`, or `"failed"`.
- Updated SKILL.md with mandatory async workflow documentation. All Midjourney requests should now use `--async` + periodic `--poll` to avoid blocking the bot.

### Changed
- `--async` flag is supported for all Midjourney actions: `imagine`, `upscale`, `variation`, `reroll`.

---

## [1.2.0] - 2026-02-25

### Changed
- **Midjourney Turbo mode enabled by default.** The `--turbo` flag is now automatically appended to all Midjourney prompts, reducing generation time from ~30-60s to ~10-20s (requires Midjourney Pro or Mega subscription).
- Added `--mode` parameter: `turbo` (default), `fast`, `relax`.

---

## [1.1.0] - 2026-02-25

### Changed
- **Midjourney provider switched from TTAPI to Legnext.ai** for faster generation speed and higher stability.
- Environment variable renamed from `TTAPI_KEY` to `LEGNEXT_KEY`. Please update your OpenClaw config.
- Upscale now supports `--upscale-type` parameter: `0` = Subtle (default), `1` = Creative.
- Variation now supports `--variation-type` parameter: `0` = Subtle (default), `1` = Strong.
- Added `--action reroll` support for Midjourney.
- Added `--action describe` support for Midjourney.
- Response now includes `imageUrls` array (4 individual image URLs) in addition to the grid `imageUrl`.

### Migration Guide
If you were using `TTAPI_KEY`, please:
1. Register at [legnext.ai](https://legnext.ai) and get your API key.
2. Update `~/.openclaw/openclaw.json`: rename `TTAPI_KEY` to `LEGNEXT_KEY` and set your new key.

---

## [1.0.0] - 2026-02-25

### Added
- Initial release of the unified image generation skill.
- **Midjourney** support via TTAPI (imagine, upscale U1-U4, variation V1-V4, reroll, zoom, pan).
- **Flux 1.1 Pro** su

What is this skill?

v2.0.0 simplified SKILL.md: run script once—internal polling, no manual async/poll steps

Unified result shape: success, model, imageUrl, images, jobId across providers

Model table covers Flux, SDXL, Nano Banana, Ideogram, Recraft (fal.ai) and Midjourney (Legnext)

Optional hosted or self-hosted proxy with IMAGE_STUDIO_PROXY_URL and pro keys

Env-driven FAL_KEY, LEGNEXT_KEY, and VALID_PRO_KEYS for self-hosted access control

v2.0.0 release (2026-03-03)

Unified 5-field output shape: success, model, imageUrl, images, jobId

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 5.4k installs on skills.sh; 732 GitHub stars; 3/3 security scanners passed (skills.sh audits).

SKILL.md

READMESKILL.md - Videoagent Image Studio

# ── Client-side (optional) ──────────────────────────────────────────────────
# Custom proxy URL — leave empty to use the default hosted proxy
# IMAGE_STUDIO_PROXY_URL=https://your-proxy.vercel.app/api/image

# Pro access key for custom proxy authentication
# IMAGE_STUDIO_API_KEY=your_pro_key_here

# ── Server-side (only needed if self-hosting the proxy) ─────────────────────
# fal.ai API Key — required for Flux, SDXL, Nano Banana, Ideogram, Recraft
# Get it at: https://fal.ai/dashboard/keys
# FAL_KEY=your_fal_key_here

# Legnext.ai API Key — required for Midjourney
# Get it at: https://legnext.ai/dashboard
# LEGNEXT_KEY=your_legnext_key_here

# Comma-separated list of valid pro keys for access control (leave empty for open access)
# VALID_PRO_KEYS=key1,key2,key3


# Changelog

All notable changes to this project will be documented in this file.

The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## [2.0.0] - 2026-03-03

### Changed
- **Simplified SKILL.md**: Removed the complex 3-step async/poll workflow from the main instructions. The script already handles all polling internally — the SKILL.md now reflects this with a single, clean "run and get result" pattern.
- **Unified output format**: All models now return a consistent `{ success, model, imageUrl, images, jobId }` shape, making it easier to handle results uniformly.
- **Clearer model selection table**: Added "Speed" column so agents can make better trade-off decisions.
- **Added "Use when" trigger**: SKILL.md now starts with a clear activation condition so the agent knows exactly when to invoke this skill.
- **Documented `--reference-images` for Nano Banana**: Pass comma-separated URLs for character/style consistency across sequential image generations.

---

## [1.3.0] - 2026-02-25

### Added
- **Non-blocking async mode for Midjourney** (`--async` flag). Submit a job and return immediately with `job_id`, without waiting for completion. This prevents the bot from being blocked while waiting for image generation.
- **Status poll mode** (`--poll --job-id <id>`). Check job status once and return immediately — no waiting. Returns `status: "completed"`, `"pending"`, `"processing"`, or `"failed"`.
- Updated SKILL.md with mandatory async workflow documentation. All Midjourney requests should now use `--async` + periodic `--poll` to avoid blocking the bot.

### Changed
- `--async` flag is supported for all Midjourney actions: `imagine`, `upscale`, `variation`, `reroll`.

---

## [1.2.0] - 2026-02-25

### Changed
- **Midjourney Turbo mode enabled by default.** The `--turbo` flag is now automatically appended to all Midjourney prompts, reducing generation time from ~30-60s to ~10-20s (requires Midjourney Pro or Mega subscription).
- Added `--mode` parameter: `turbo` (default), `fast`, `relax`.

---

## [1.1.0] - 2026-02-25

### Changed
- **Midjourney provider switched from TTAPI to Legnext.ai** for faster generation speed and higher stability.
- Environment variable renamed from `TTAPI_KEY` to `LEGNEXT_KEY`. Please update your OpenClaw config.
- Upscale now supports `--upscale-type` parameter: `0` = Subtle (default), `1` = Creative.
- Variation now supports `--variation-type` parameter: `0` = Subtle (default), `1` = Strong.
- Added `--action reroll` support for Midjourney.
- Added `--action describe` support for Midjourney.
- Response now includes `imageUrls` array (4 individual image URLs) in addition to the grid `imageUrl`.

### Migration Guide
If you were using `TTAPI_KEY`, please:
1. Register at [legnext.ai](https://legnext.ai) and get your API key.
2. Update `~/.openclaw/openclaw.json`: rename `TTAPI_KEY` to `LEGNEXT_KEY` and set your new key.

---

## [1.0.0] - 2026-02-25

### Added
- Initial release of the unified image generation skill.
- **Midjourney** support via TTAPI (imagine, upscale U1-U4, variation V1-V4, reroll, zoom, pan).
- **Flux 1.1 Pro** su

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Who is videoagent-image-studio for?

When should I use videoagent-image-studio?

Is videoagent-image-studio safe to install?

SKILL.md

This week for builders

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Who is videoagent-image-studio for?

When should I use videoagent-image-studio?

Is videoagent-image-studio safe to install?

SKILL.md