Vision Squeezer

Name: Vision Squeezer
Author: eralpozcan

eralpozcan/vision-squeezer

Preprocess screenshots and UI captures so vision-model tile billing stays predictable before you ship agent features that analyze images.

Overview

Vision Squeezer is a Ship-phase MCP server that cuts vision API token costs by snapping images to provider tile boundaries for Claude, GPT-4o, GPT-5, and Gemini workflows.

What is this MCP server?

Snaps image dimensions to vision provider tile boundaries to reduce billed tokens
Targets multimodal stacks using Claude, GPT-4o, GPT-5, and Gemini vision pricing models
npm stdio package vision-squeezer at version 0.1.1 for local MCP wiring
Runs as a focused preprocessing MCP rather than a full image editor or CDN
GitHub-hosted eralpozcan/vision-squeezer for indie agent cost control
npm package identifier vision-squeezer at version 0.1.1
stdio transport only in published registry packages block
Documented compatibility with Claude, GPT-4o, GPT-5, and Gemini vision stacks

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Community signal: 2 GitHub stars.

What problem does it solve?

Multimodal agents burn budget sending oversized screenshots that providers bill as extra vision tiles.

Who is it for?

Indie builders shipping agent features that analyze UI screenshots or photos and need predictable vision API spend.

Skip if: Products with no vision models, teams needing lossless print workflows, or builders who only use text-only LLMs.

What do I get? / Deliverables

Preprocessed images align to tile grids so each vision call uses fewer tokens without you hand-resizing every capture.

Tile-aligned image payloads sized for major vision providers
Lower vision token usage on repeated screenshot or photo analysis runs
Local preprocessing step plug-in before primary multimodal MCP or API calls

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

Vision token cost is a ship-time performance and unit-economics concern once multimodal features exist, not an idea-phase research toy. Perf fits because the server optimizes image dimensions against provider tile boundaries to cut vision API token spend.

How it compares

Vision cost preprocessor MCP, not a full image CDN or creative editing skill.

Common Questions / FAQ

Who is Vision Squeezer for?

Solo developers running MCP agents that call Claude, GPT-4o, GPT-5, or Gemini vision APIs and want to trim token usage on image inputs.

When should I use Vision Squeezer?

Use it in ship and operate loops once multimodal features are live and screenshot or photo payloads are showing up on usage dashboards.

How do I add Vision Squeezer to my agent?

Install the npm package vision-squeezer and register the stdio MCP server in your Claude Code, Cursor, or compatible client config.