Mcp Server

Name: Mcp Server
Author: iris-eval

iris-eval/mcp-server

Score agent and MCP tool outputs for quality, safety, and cost before you ship prompts or workflows to production.

Overview

io.github.iris-eval/mcp-server is an MCP server for the Ship phase that scores agent outputs for quality, safety, and cost using a consistent eval standard.

What is this MCP server?

Positions itself as an agent eval standard wired for MCP workflows
Scores every agent output on quality, safety, and cost dimensions
stdio npm package @iris-eval/mcp-server v0.4.2 with optional IRIS_API_KEY
Persists runs via configurable IRIS_DB_PATH SQLite and IRIS_LOG_LEVEL
Fits CI or pre-ship checks on agent replies and tool results
Package version 0.4.2 (@iris-eval/mcp-server, stdio)
Optional secret env IRIS_API_KEY; configurable IRIS_DB_PATH and IRIS_LOG_LEVEL
Repository: github.com/iris-eval/mcp-server

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Community signal: 7 GitHub stars.

What problem does it solve?

You cannot tell whether agent changes actually improved replies, stayed safe, or blew the budget without a repeatable scoring layer on every output.

Who is it for?

Indie builders shipping agent features or MCP-heavy workflows who want SQLite-backed eval history and optional API-key auth before promoting prompts to users.

Skip if: Teams that only need manual code review with no LLM output scoring, or builders still in pure ideation with no agent pipeline to measure.

What do I get? / Deliverables

After you register the server, your agent can run structured eval passes and store scores locally so shipping decisions rest on measured quality, safety, and cost—not vibes.

Per-output quality, safety, and cost scores via MCP tools
Local eval history when IRIS_DB_PATH SQLite is configured
Log-level controlled runs via IRIS_LOG_LEVEL

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

Output scoring and eval gates belong on the ship shelf because they validate behavior right before or after release, not while you are still drafting product ideas. Testing is the canonical subphase for systematic pass/fail scoring of model outputs rather than one-off debugging in operate.

How it compares

MCP eval scoring server, not a prompt-writing or brainstorming skill.

Common Questions / FAQ

Who is io.github.iris-eval/mcp-server for?

Solo and small-team builders who ship AI agents and want MCP-native tools to grade output quality, safety, and cost before release.

When should I use io.github.iris-eval/mcp-server?

Use it during ship and testing when you compare prompt versions, regression fixes, or new tools and need comparable scores on every run.

How do I add io.github.iris-eval/mcp-server to my agent?

Add the stdio MCP entry for npm package @iris-eval/mcp-server v0.4.2 in Claude Code or Cursor, set IRIS_API_KEY if required, and configure IRIS_DB_PATH and IRIS_LOG_LEVEL as needed.

Mcp Server

iris-eval/mcp-server

Score agent and MCP tool outputs for quality, safety, and cost before you ship prompts or workflows to production.

Overview

io.github.iris-eval/mcp-server is an MCP server for the Ship phase that scores agent outputs for quality, safety, and cost using a consistent eval standard.

What is this MCP server?

Positions itself as an agent eval standard wired for MCP workflows
Scores every agent output on quality, safety, and cost dimensions
stdio npm package @iris-eval/mcp-server v0.4.2 with optional IRIS_API_KEY
Persists runs via configurable IRIS_DB_PATH SQLite and IRIS_LOG_LEVEL
Fits CI or pre-ship checks on agent replies and tool results
Package version 0.4.2 (@iris-eval/mcp-server, stdio)
Optional secret env IRIS_API_KEY; configurable IRIS_DB_PATH and IRIS_LOG_LEVEL
Repository: github.com/iris-eval/mcp-server

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Community signal: 7 GitHub stars.

What problem does it solve?

You cannot tell whether agent changes actually improved replies, stayed safe, or blew the budget without a repeatable scoring layer on every output.

Who is it for?

Indie builders shipping agent features or MCP-heavy workflows who want SQLite-backed eval history and optional API-key auth before promoting prompts to users.

Skip if: Teams that only need manual code review with no LLM output scoring, or builders still in pure ideation with no agent pipeline to measure.

What do I get? / Deliverables

After you register the server, your agent can run structured eval passes and store scores locally so shipping decisions rest on measured quality, safety, and cost—not vibes.

Per-output quality, safety, and cost scores via MCP tools
Local eval history when IRIS_DB_PATH SQLite is configured
Log-level controlled runs via IRIS_LOG_LEVEL

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

How it compares

MCP eval scoring server, not a prompt-writing or brainstorming skill.

Common Questions / FAQ

Who is io.github.iris-eval/mcp-server for?

Solo and small-team builders who ship AI agents and want MCP-native tools to grade output quality, safety, and cost before release.

When should I use io.github.iris-eval/mcp-server?

Use it during ship and testing when you compare prompt versions, regression fixes, or new tools and need comparable scores on every run.

How do I add io.github.iris-eval/mcp-server to my agent?

Add the stdio MCP entry for npm package @iris-eval/mcp-server v0.4.2 in Claude Code or Cursor, set IRIS_API_KEY if required, and configure IRIS_DB_PATH and IRIS_LOG_LEVEL as needed.

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is io.github.iris-eval/mcp-server for?

When should I use io.github.iris-eval/mcp-server?

How do I add io.github.iris-eval/mcp-server to my agent?

This week for builders

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is io.github.iris-eval/mcp-server for?

When should I use io.github.iris-eval/mcp-server?

How do I add io.github.iris-eval/mcp-server to my agent?