Ai Eval

Name: Ai Eval
Author: lazymac2x

lazymac2x/ai-eval-api

Run AI evaluation workflows from your coding agent via a hosted Cloudflare Workers MCP endpoint without wiring custom eval scripts locally.

Overview

io.github.lazymac2x/ai-eval is a MCP server for the Ship phase that exposes Cloudflare Workers–hosted AI evaluation tools to your agent over streamable HTTP.

What is this MCP server?

Remote streamable-http MCP at https://api.lazy-mac.com/ai-eval/mcp (no local worker deploy required to connect)
Server schema version 1.0.0 with GitHub source at lazymac2x/ai-eval-api
Cloudflare Workers-hosted MCP bridge so Claude Code, Cursor, and Codex can call eval tooling as tools
Fits solo builders who need repeatable AI output checks without building a separate eval dashboard first
Pairs with gateway, guardrails, and model-router in the same lazy-mac MCP suite for end-to-end agent stacks
MCP server version 1.0.0
1 remote endpoint (streamable-http)
Hosted on Cloudflare Workers via api.lazy-mac.com

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

What problem does it solve?

You cannot trust an agent workflow you only eyeballed once, and you do not want to build and host a custom eval API just to score prompts and outputs from Claude Code or Cursor.

Who is it for?

Indie builders with MCP-capable agents who want a remote eval endpoint tied into ship-time testing without operating their own Workers project first.

Skip if: Teams that need private on-prem eval pipelines, large labeled datasets, or detailed offline reporting with no external HTTP dependency.

What do I get? / Deliverables

After you register the remote MCP URL, your agent can invoke ai-eval tools on demand so testing and iteration cycles include structured evaluation instead of ad-hoc guesses.

Registered remote ai-eval MCP tools visible in your agent
Agent-invokable evaluation calls against the lazy-mac Workers API
Repeatable eval step you can run before ship and after prompt changes

Recommended MCP Servers

0Latency Memory

0Latency Memory is a hosted MCP server that gives AI agents a persistent memory layer with fast recall, semantic search,…

0nMCP — Universal AI API Orchestrator0nork/0nMCP

0nMCP is a Universal AI API Orchestrator MCP server aimed at solo builders who would otherwise register a long list of p…

0xHumans Protocol MCPDavidOrpeli/0xhumans-mcp-proxy

io.github.DavidOrpeli/0xhumans-mcp is a Model Context Protocol offering for the 0xHumans Protocol, aimed at AI agents th…

1k Patient Mcp

The 1k Patient MCP server is a hosted Model Context Protocol endpoint described as serving on the order of one thousand …

1trippulsegkcogz/OneTrip-Beta

1trip PULSE is a travel-focused MCP server that packages twenty-one planning tools—flights, hotels, visa guidance, safet…

4bots Content

io.github.davidsiegel59/4bots-content is a remote MCP server that supplies daily, channelized content for AI agents buil…

Journey fit

Primary fit

Canonical shelf is Ship because measuring model and prompt quality belongs with testing and release confidence, even when you also invoke eval during build and post-launch iteration. Testing is the primary fit: ai-eval is for benchmarking outputs, regression checks, and pass-rate style validation before you treat an agent workflow as production-ready.

How it compares

Remote MCP integration for AI evaluation, not an in-repo agent skill or a full local benchmark framework.

Common Questions / FAQ

Who is io.github.lazymac2x/ai-eval for?

Solo and indie builders using Claude Code, Cursor, Codex, or similar agents who want hosted AI eval tools reachable through standard MCP configuration.

When should I use io.github.lazymac2x/ai-eval?

Use it during Ship testing and whenever you change prompts, models, or tool chains and need your agent to run evaluation calls before you ship or iterate in production.

How do I add io.github.lazymac2x/ai-eval to my agent?

Add a remote MCP server entry pointing at https://api.lazy-mac.com/ai-eval/mcp with type streamable-http in your client’s MCP config, then restart or reload the agent so tools from ai-eval appear.

What is this MCP server?

Remote streamable-http MCP at https://api.lazy-mac.com/ai-eval/mcp (no local worker deploy required to connect)

Server schema version 1.0.0 with GitHub source at lazymac2x/ai-eval-api

Cloudflare Workers-hosted MCP bridge so Claude Code, Cursor, and Codex can call eval tooling as tools

Fits solo builders who need repeatable AI output checks without building a separate eval dashboard first

Pairs with gateway, guardrails, and model-router in the same lazy-mac MCP suite for end-to-end agent stacks

MCP server version 1.0.0

1 remote endpoint (streamable-http)

Hosted on Cloudflare Workers via api.lazy-mac.com

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

What do I get? / Deliverables

After you register the remote MCP URL, your agent can invoke ai-eval tools on demand so testing and iteration cycles include structured evaluation instead of ad-hoc guesses.

Registered remote ai-eval MCP tools visible in your agent

Agent-invokable evaluation calls against the lazy-mac Workers API

Repeatable eval step you can run before ship and after prompt changes

Journey fit