
WhichModel
Route autonomous agent workloads to cost-efficient LLMs using WhichModel recommendations over remote MCP.
Overview
WhichModel is an MCP server for the build phase that delivers cost-optimized LLM routing recommendations for autonomous AI agents over HTTP.
What is this MCP server?
- WhichModel remote MCP at https://whichmodel.dev/mcp
- Cost-optimized LLM routing recommendations for autonomous agents
- Streamable-http hosted server v1.0.0
- Public site https://whichmodel.dev for context alongside MCP
- Server version 1.0.0
- Remote MCP URL https://whichmodel.dev/mcp
- Transport type streamable-http
What problem does it solve?
Autonomous agents default to expensive models because builders lack real-time, task-aware routing advice inside the coding session.
Who is it for?
Indie builders running high-volume or multi-agent automations who need MCP-accessible cost routing without building their own router.
Skip if: Single-shot chat with one fixed model, or teams that ignore cost and only optimize for maximum capability per call.
What do I get? / Deliverables
Your agent can query WhichModel for cheaper-fit model choices while you ship and iterate multi-step autonomous workflows.
- Agent-callable cost-optimized model routing recommendations
- Hosted MCP integration without local npm stdio install
- Reference alignment to WhichModel v1.0.0 remote server
Recommended MCP Servers
Journey fit
How it compares
LLM cost-routing MCP service, not a prompt library skill or local model runner.
Common Questions / FAQ
Who is WhichModel for?
Solo builders and agent authors who want autonomous workflows to pick cost-efficient LLMs via MCP instead of hard-coding one provider.
When should I use WhichModel?
Use it while designing or operating agent loops where token spend varies by step and you want routing recommendations on demand.
How do I add WhichModel to my agent?
Configure your MCP client with the remote streamable-http URL https://whichmodel.dev/mcp and verify connectivity to whichmodel.dev.