Token Compressor

Name: Token Compressor
Author: base76-research-lab

base76-research-lab/token-compressor

Shrink long system prompts and tool instructions before they hit a paid API, without stripping conditionals your agent relies on.

Overview

Token Compressor is a MCP server for the Build phase that shrinks agent prompts about 40–60% with local LLM compression and embedding validation while preserving conditionals.

What is this MCP server?

Targets 40–60% token reduction on verbose prompts using a local LLM pipeline
Embedding validation checks that compressed text still matches original intent
Preserves conditionals and branching logic instead of naive summarization
Stdio MCP package on PyPI (token-compressor-mcp) for Claude Code–style clients
Keeps sensitive prompt text on your machine when compression runs locally
Advertised 40–60% prompt compression range
Server version 0.1.1 on PyPI identifier token-compressor-mcp
Stdio transport; local LLM plus embedding validation per description

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Community signal: 8 GitHub stars.

What problem does it solve?

Long agent prompts burn context windows and API spend on every turn, and naive summarization silently drops the if/when rules that keep your agent safe.

Who is it for?

Solo builders running Claude Code or similar agents with repetitive, rule-heavy system prompts who want local, validation-backed compression before scale.

Skip if: Teams that need cloud-only MCP with no local LLM setup, or anyone who can solve the problem by deleting unused tools and skills instead of compressing text.

What do I get? / Deliverables

After you register the server, you can compress bulky prompts locally, validate semantic fidelity, and send leaner context to your agent without rewriting every conditional by hand.

Shorter prompt variants validated against the original meaning
Preserved conditional and branching language in compressed output
Repeatable compression step in your agent prep pipeline

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

BuildAgent skills & templates

Prompt compression sits where solo builders wire agents and skills—right before context is sent on every turn. Agent-tooling is the shelf for MCP servers that change how much context your coding agent carries, not for app feature code.

How it compares

MCP compression utility for prompts—not an agent skill and not a hosted chat product.

Common Questions / FAQ

Who is Token Compressor for?

It is for indie developers and agent builders who maintain large prompts or skill instructions and want measurable token savings without trusting blind summarization.

When should I use Token Compressor?

Use it when you are stabilizing agent-tooling—before you ship a long system prompt to production or when context limits start blocking multi-skill workflows.

How do I add Token Compressor to my agent?

Install the PyPI package token-compressor-mcp, configure stdio transport in your MCP client (for example Claude Code), and invoke its compress tools on prompt text you already use in sessions.

What is this MCP server?

Targets 40–60% token reduction on verbose prompts using a local LLM pipeline

Embedding validation checks that compressed text still matches original intent

Preserves conditionals and branching logic instead of naive summarization

Stdio MCP package on PyPI (token-compressor-mcp) for Claude Code–style clients

Keeps sensitive prompt text on your machine when compression runs locally

Advertised 40–60% prompt compression range

Server version 0.1.1 on PyPI identifier token-compressor-mcp

Stdio transport; local LLM plus embedding validation per description

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Community signal: 8 GitHub stars.

Who is it for?

Solo builders running Claude Code or similar agents with repetitive, rule-heavy system prompts who want local, validation-backed compression before scale.

Skip if: Teams that need cloud-only MCP with no local LLM setup, or anyone who can solve the problem by deleting unused tools and skills instead of compressing text.

What do I get? / Deliverables

After you register the server, you can compress bulky prompts locally, validate semantic fidelity, and send leaner context to your agent without rewriting every conditional by hand.

Shorter prompt variants validated against the original meaning

Preserved conditional and branching language in compressed output

Repeatable compression step in your agent prep pipeline

Journey fit

Primary fit

BuildAgent skills & templates

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is Token Compressor for?

When should I use Token Compressor?

How do I add Token Compressor to my agent?

This week for builders

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is Token Compressor for?

When should I use Token Compressor?

How do I add Token Compressor to my agent?