Generate Image

Name: Generate Image
Author: k-dense-ai

k-dense-ai/scientific-agent-skills

916 installs
31.9k repo stars
Updated July 28, 2026
k-dense-ai/scientific-agent-skills

This is a copy of generate-image by davila7 - installs and ranking accrue to the original listing.

generate-image is a Claude Code skill with a Python script that generates and edits images from prompts or reference images using top-tier models via the OpenRouter API for developers who need programmatic image creation

About

generate-image is a scientific-agent skill from k-dense-ai/scientific-agent-skills shipping a Python CLI for OpenRouter-backed image generation and editing. Supported models include google/gemini-3.1-flash-image-preview, black-forest-labs/flux.2-pro, and black-forest-labs/flux.2-flex, with editing via an input image plus prompt. The script checks for OPENROUTER_API_KEY in environment or .env files and outputs generated or edited image files. Developers reach for generate-image when agents or pipelines need reproducible image creation without wiring each model SDK manually.

Supports google/gemini-3.1-flash-image-preview for generation and editing
Supports black-forest-labs/flux.2-pro for generation and editing
Supports black-forest-labs/flux.2-flex for high-quality generation
Automatic .env lookup for OPENROUTER_API_KEY in current or parent directories
Converts local images to base64 for seamless editing workflows

Generate Image by the numbers

916 all-time installs (skills.sh)
+49 installs in the week ending Jul 28, 2026 (Skillselion tracking)
Security screen: LOW risk (skills.sh audit)
Data as of Jul 28, 2026 (Skillselion catalog sync)

npx skills add https://github.com/k-dense-ai/scientific-agent-skills --skill generate-image

Add your badge

Show developers this skill is listed on Skillselion. Paste this into your README.

[![Listed on Skillselion](https://skillselion.com/badge/skills/k-dense-ai/scientific-agent-skills/generate-image.svg)](https://skillselion.com/skills/k-dense-ai/scientific-agent-skills/generate-image)

Installs	916
repo stars	★ 31.9k
Security audit	3 / 3 scanners passed
Last updated	July 28, 2026
Repository	k-dense-ai/scientific-agent-skills ↗

How do you generate images with OpenRouter API?

Generate and edit images directly from prompts or reference images using top-tier models via OpenRouter.

Who is it for?

Developers and research agents needing scripted image generation or editing through OpenRouter with multiple model options.

Skip if: Local-only diffusion pipelines or design workflows requiring manual Figma asset handoff without API generation.

When should I use this skill?

User needs to generate or edit images programmatically using OpenRouter models from prompts or reference images.

What you get

Generated or edited image files from OpenRouter model API calls

generated image file
edited image file

Files

SKILL.mdMarkdownGitHub ↗

Generate Image

Generate and edit high-quality images using OpenRouter's image generation models including FLUX.2 Pro and Gemini 3.1 Flash Image Preview.

When to Use This Skill

Use generate-image for:

Photos and photorealistic images
Artistic illustrations and artwork
Concept art and visual concepts
Visual assets for presentations or documents
Image editing and modifications
Any general-purpose image generation needs

Use scientific-schematics instead for:

Flowcharts and process diagrams
Circuit diagrams and electrical schematics
Biological pathways and signaling cascades
System architecture diagrams
CONSORT diagrams and methodology flowcharts
Any technical/schematic diagrams

Quick Start

Use the scripts/generate_image.py script to generate or edit images:

# Generate a new image
python scripts/generate_image.py "A beautiful sunset over mountains"

# Edit an existing image
python scripts/generate_image.py "Make the sky purple" --input photo.jpg

This generates/edits an image and saves it as generated_image.png in the current directory.

API Key Setup

CRITICAL: The script requires an OpenRouter API key. Before running, check if the user has configured their API key:

1. Look for a .env file in the project directory or parent directories 2. Check for OPENROUTER_API_KEY=<key> in the .env file 3. If not found, inform the user they need to:

Create a .env file with OPENROUTER_API_KEY=your-api-key-here
Or set the environment variable: export OPENROUTER_API_KEY=your-api-key-here
Get an API key from: https://openrouter.ai/keys

The script will automatically detect the .env file and provide clear error messages if the API key is missing.

Model Selection

Default model: google/gemini-3.1-flash-image-preview (high quality, recommended)

Available models for generation and editing:

google/gemini-3.1-flash-image-preview - High quality, supports generation + editing
black-forest-labs/flux.2-pro - Fast, high quality, supports generation + editing

Generation only:

black-forest-labs/flux.2-flex - Fast and cheap, but not as high quality as pro

Select based on:

Quality: Use gemini-3.1-flash-image-preview or flux.2-pro
Editing: Use gemini-3.1-flash-image-preview or flux.2-pro (both support image editing)
Cost: Use flux.2-flex for generation only

Common Usage Patterns

Basic generation

python scripts/generate_image.py "Your prompt here"

Specify model

python scripts/generate_image.py "A cat in space" --model "black-forest-labs/flux.2-pro"

Custom output path

python scripts/generate_image.py "Abstract art" --output artwork.png

Edit an existing image

python scripts/generate_image.py "Make the background blue" --input photo.jpg

Edit with a specific model

python scripts/generate_image.py "Add sunglasses to the person" --input portrait.png --model "black-forest-labs/flux.2-pro"

Edit with custom output

python scripts/generate_image.py "Remove the text from the image" --input screenshot.png --output cleaned.png

Multiple images

Run the script multiple times with different prompts or output paths:

python scripts/generate_image.py "Image 1 description" --output image1.png
python scripts/generate_image.py "Image 2 description" --output image2.png

Script Parameters

prompt (required): Text description of the image to generate, or editing instructions
--input or -i: Input image path for editing (enables edit mode)
--model or -m: OpenRouter model ID (default: google/gemini-3.1-flash-image-preview)
--output or -o: Output file path (default: generated_image.png)
--api-key: OpenRouter API key (overrides .env file)

Example Use Cases

For Scientific Documents

# Generate a conceptual illustration for a paper
python scripts/generate_image.py "Microscopic view of cancer cells being attacked by immunotherapy agents, scientific illustration style" --output figures/immunotherapy_concept.png

# Create a visual for a presentation
python scripts/generate_image.py "DNA double helix structure with highlighted mutation site, modern scientific visualization" --output slides/dna_mutation.png

For Presentations and Posters

# Title slide background
python scripts/generate_image.py "Abstract blue and white background with subtle molecular patterns, professional presentation style" --output slides/background.png

# Poster hero image
python scripts/generate_image.py "Laboratory setting with modern equipment, photorealistic, well-lit" --output poster/hero.png

For General Visual Content

# Website or documentation images
python scripts/generate_image.py "Professional team collaboration around a digital whiteboard, modern office" --output docs/team_collaboration.png

# Marketing materials
python scripts/generate_image.py "Futuristic AI brain concept with glowing neural networks" --output marketing/ai_concept.png

Error Handling

The script provides clear error messages for:

Missing API key (with setup instructions)
API errors (with status codes)
Unexpected response formats
Missing dependencies (requests library)

If the script fails, read the error message and address the issue before retrying.

Notes

Images are returned as base64-encoded data URLs and automatically saved as PNG files
The script supports both images and content response formats from different OpenRouter models
Generation time varies by model (typically 5-30 seconds)
For image editing, the input image is encoded as base64 and sent to the model
Supported input image formats: PNG, JPEG, GIF, WebP
Check OpenRouter pricing for cost information: https://openrouter.ai/models

Image Editing Tips

Be specific about what changes you want (e.g., "change the sky to sunset colors" vs "edit the sky")
Reference specific elements in the image when possible
For best results, use clear and detailed editing instructions
Both Gemini 3.1 Flash Image Preview and FLUX.2 Pro support image editing through OpenRouter

Integration with Other Skills

scientific-schematics: Use for technical diagrams, flowcharts, circuits, pathways
generate-image: Use for photos, illustrations, artwork, visual concepts
scientific-slides: Combine with generate-image for visually rich presentations
latex-posters: Use generate-image for poster visuals and hero images

#!/usr/bin/env python3
"""
Generate and edit images using OpenRouter API with various image generation models.

Supports models like:
- google/gemini-3.1-flash-image-preview (generation and editing)
- black-forest-labs/flux.2-pro (generation and editing)
- black-forest-labs/flux.2-flex (generation)
- And more image generation models available on OpenRouter

For image editing, provide an input image along with an editing prompt.
"""

import sys
import json
import base64
import argparse
from pathlib import Path
from typing import Optional


def check_env_file() -> Optional[str]:
    """Check if .env file exists and contains OPENROUTER_API_KEY."""
    # Look for .env in current directory and parent directories
    current_dir = Path.cwd()
    for parent in [current_dir] + list(current_dir.parents):
        env_file = parent / ".env"
        if env_file.exists():
            with open(env_file, 'r') as f:
                for line in f:
                    if line.startswith('OPENROUTER_API_KEY='):
                        api_key = line.split('=', 1)[1].strip().strip('"').strip("'")
                        if api_key:
                            return api_key
    return None


def load_image_as_base64(image_path: str) -> str:
    """Load an image file and return it as a base64 data URL."""
    path = Path(image_path)
    if not path.exists():
        print(f"❌ Error: Image file not found: {image_path}")
        sys.exit(1)
    
    # Determine MIME type from extension
    ext = path.suffix.lower()
    mime_types = {
        '.png': 'image/png',
        '.jpg': 'image/jpeg',
        '.jpeg': 'image/jpeg',
        '.gif': 'image/gif',
        '.webp': 'image/webp',
    }
    mime_type = mime_types.get(ext, 'image/png')
    
    with open(path, 'rb') as f:
        image_data = f.read()
    
    base64_data = base64.b64encode(image_data).decode('utf-8')
    return f"data:{mime_type};base64,{base64_data}"


def save_base64_image(base64_data: str, output_path: str) -> None:
    """Save base64 encoded image to file."""
    # Remove data URL prefix if present
    if ',' in base64_data:
        base64_data = base64_data.split(',', 1)[1]

    # Decode and save
    image_data = base64.b64decode(base64_data)
    with open(output_path, 'wb') as f:
        f.write(image_data)


def generate_image(
    prompt: str,
    model: str = "google/gemini-3.1-flash-image-preview",
    output_path: str = "generated_image.png",
    api_key: Optional[str] = None,
    input_image: Optional[str] = None
) -> dict:
    """
    Generate or edit an image using OpenRouter API.

    Args:
        prompt: Text description of the image to generate, or editing instructions
        model: OpenRouter model ID (default: google/gemini-3.1-flash-image-preview)
        output_path: Path to save the generated image
        api_key: OpenRouter API key (will check .env if not provided)
        input_image: Path to an input image for editing (optional)

    Returns:
        dict: Response from OpenRouter API
    """
    try:
        import requests
    except ImportError:
        print("Error: 'requests' library not found. Install with: pip install requests")
        sys.exit(1)

    # Check for API key
    if not api_key:
        api_key = check_env_file()

    if not api_key:
        print("❌ Error: OPENROUTER_API_KEY not found!")
        print("\nPlease create a .env file in your project directory with:")
        print("OPENROUTER_API_KEY=your-api-key-here")
        print("\nOr set the environment variable:")
        print("export OPENROUTER_API_KEY=your-api-key-here")
        print("\nGet your API key from: https://openrouter.ai/keys")
        sys.exit(1)

    # Determine if this is generation or editing
    is_editing = input_image is not None
    
    if is_editing:
        print(f"✏️ Editing image with model: {model}")
        print(f"📷 Input image: {input_image}")
        print(f"📝 Edit prompt: {prompt}")
        
        # Load input image as base64
        image_data_url = load_image_as_base64(input_image)
        
        # Build multimodal message content for image editing
        message_content = [
            {
                "type": "text",
                "text": prompt
            },
            {
                "type": "image_url",
                "image_url": {
                    "url": image_data_url
                }
            }
        ]
    else:
        print(f"🎨 Generating image with model: {model}")
        print(f"📝 Prompt: {prompt}")
        message_content = prompt

    # Make API request
    response = requests.post(
        url="https://openrouter.ai/api/v1/chat/completions",
        headers={
            "Authorization": f"Bearer {api_key}",
            "Content-Type": "application/json",
        },
        json={
            "model": model,
            "messages": [
                {
                    "role": "user",
                    "content": message_content
                }
            ],
            "modalities": ["image", "text"]
        }
    )

    # Check for errors
    if response.status_code != 200:
        print(f"❌ API Error ({response.status_code}): {response.text}")
        sys.exit(1)

    result = response.json()

    # Extract and save image
    if result.get("choices"):
        message = result["choices"][0]["message"]

        # Handle both 'images' and 'content' response formats
        images = []

        if message.get("images"):
            images = message["images"]
        elif message.get("content"):
            # Some models return content as array with image parts
            content = message["content"]
            if isinstance(content, list):
                for part in content:
                    if isinstance(part, dict) and part.get("type") == "image":
                        images.append(part)

        if images:
            # Save the first image
            image = images[0]
            if "image_url" in image:
                image_url = image["image_url"]["url"]
                save_base64_image(image_url, output_path)
                print(f"✅ Image saved to: {output_path}")
            elif "url" in image:
                save_base64_image(image["url"], output_path)
                print(f"✅ Image saved to: {output_path}")
            else:
                print(f"⚠️ Unexpected image format: {image}")
        else:
            print("⚠️ No image found in response")
            if message.get("content"):
                print(f"Response content: {message['content']}")
    else:
        print("❌ No choices in response")
        print(f"Response: {json.dumps(result, indent=2)}")

    return result


def main():
    parser = argparse.ArgumentParser(
        description="Generate or edit images using OpenRouter API",
        formatter_class=argparse.RawDescriptionHelpFormatter,
        epilog="""
Examples:
  # Generate with default model (Gemini 3.1 Flash Image Preview)
  python generate_image.py "A beautiful sunset over mountains"

  # Use a specific model
  python generate_image.py "A cat in space" --model "black-forest-labs/flux.2-pro"

  # Specify output path
  python generate_image.py "Abstract art" --output my_image.png

  # Edit an existing image
  python generate_image.py "Make the sky purple" --input photo.jpg --output edited.png

  # Edit with a specific model
  python generate_image.py "Add a hat to the person" --input portrait.png -m "black-forest-labs/flux.2-pro"

Popular image models:
  - google/gemini-3.1-flash-image-preview (default, high quality, generation + editing)
  - black-forest-labs/flux.2-pro (fast, high quality, generation + editing)
  - black-forest-labs/flux.2-flex (development version)
        """
    )

    parser.add_argument(
        "prompt",
        type=str,
        help="Text description of the image to generate, or editing instructions"
    )

    parser.add_argument(
        "--model", "-m",
        type=str,
        default="google/gemini-3.1-flash-image-preview",
        help="OpenRouter model ID (default: google/gemini-3.1-flash-image-preview)"
    )

    parser.add_argument(
        "--output", "-o",
        type=str,
        default="generated_image.png",
        help="Output file path (default: generated_image.png)"
    )

    parser.add_argument(
        "--input", "-i",
        type=str,
        help="Input image path for editing (enables edit mode)"
    )

    parser.add_argument(
        "--api-key",
        type=str,
        help="OpenRouter API key (will check .env if not provided)"
    )

    args = parser.parse_args()

    generate_image(
        prompt=args.prompt,
        model=args.model,
        output_path=args.output,
        api_key=args.api_key,
        input_image=args.input
    )


if __name__ == "__main__":
    main()

Related skills

Remotion Best PracticesGet Remotion-specific coding guidance that prevents common video rendering mistakes when creating animated React videos.442k4.1k

Remotion RenderGenerate high-quality MP4 videos from React code using Remotion inside an AI coding agent.363k648

Ai Video GenerationTurn written prompts into short videos using AI video generation models directly from Cursor or Claude.363k648

Ai Avatar VideoGenerate short talking-head videos of custom AI avatars from text prompts.363k648

Ai Image GenerationLet their coding agent generate, iterate on, and insert high-quality images directly into web apps, marketing assets, or product features.363k648

Video EditIntelligently route video editing requests to the best RunComfy model without trial-and-error.357k31

FAQ

Which image models does generate-image support?

generate-image supports OpenRouter models including google/gemini-3.1-flash-image-preview, black-forest-labs/flux.2-pro, and black-forest-labs/flux.2-flex for generation and editing workflows.

How does generate-image authenticate to OpenRouter?

generate-image checks for OPENROUTER_API_KEY in the environment or a .env file in the current or parent directory before making OpenRouter image API requests.

Is Generate Image safe to install?

skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

Generative Mediaagentsautomation

About

Generate Image by the numbers

Add your badge

How do you generate images with OpenRouter API?

Who is it for?

When should I use this skill?

What you get

Files

Generate Image

When to Use This Skill

Quick Start

API Key Setup

Model Selection

Common Usage Patterns

Basic generation

Specify model

Custom output path

Edit an existing image

Edit with a specific model

Edit with custom output

Multiple images

Script Parameters

Example Use Cases

For Scientific Documents

For Presentations and Posters

For General Visual Content

Error Handling

Notes

Image Editing Tips

Integration with Other Skills

Related skills

FAQ

Which image models does generate-image support?

How does generate-image authenticate to OpenRouter?

Is Generate Image safe to install?

This week in AI coding