Computer Use

Name: Computer Use
Author: domdomegg

domdomegg/computer-use-mcp

Give your agent controlled desktop automation—screenshots plus mouse and keyboard actions—when GUI-only tools have no API.

Overview

Computer Use MCP is a Build-phase MCP server that controls the desktop with screenshots, mouse, and keyboard automation for agent-driven GUI tasks.

What is this MCP server?

Capture screenshots for visual grounding before actions
Drive mouse movement and clicks through MCP tools
Send keyboard input for form fill and shortcut workflows
Install via npx stdio or published mcpb bundle for supported clients
Published server version 1.8.0 as computer-use-mcp
npm identifier computer-use-mcp
Published mcpb release for v1.8.0

Compatible agents: Claude Code, Cursor, Codex, Windsurf

Community signal: 297 GitHub stars.

What problem does it solve?

Builders hit dead ends when critical software only offers a GUI, forcing manual clicking while the agent cannot see or act on the screen.

Who is it for?

Advanced solo builders automating legacy desktop tools, reproducing visual bugs, or scripting personal workflows on a trusted local machine.

Skip if: Shared servers, unattended production bots, or anyone who needs least-privilege automation without full desktop control risk.

What do I get? / Deliverables

After registration, the agent can observe the screen and perform bounded mouse and keyboard actions to complete tasks that lack APIs.

Screenshot-backed context for agent reasoning about on-screen UI
Executed click and type sequences driven through MCP
Repeatable stdio or mcpb MCP registration for trusted dev hosts

Recommended MCP Servers

1stDibs

The 1stDibs MCP server exposes browse-and-search capabilities against the 1stDibs luxury goods marketplace through a hos…

2Captcha MCParuxojuyu665/2Captcha-MCP

2Captcha MCP exposes the commercial 2Captcha API to MCP hosts with 43 tools—31 focused on captcha solving plus managemen…

4fetch

4fetch is a hosted MCP server that fetches a URL and returns clean Markdown with metadata so coding agents can quote pag…

AcrawlMingye-Lu/AgenticCrawler

acrawl (Agentic Crawler) is a Model Context Protocol server that packages autonomous web browsing into a single local bi…5 stars

Agentfetchbch1212/agentfetch-mcp

Agentfetch MCP is a token-budgeted web retrieval server for AI coding agents. Solo builders doing idea-phase competitor …

AgenticTotem Web Extractor

AgenticTotem Web Extractor is a hosted MCP server for AI web extraction: you supply URLs and a JSON Schema, and the serv…

Journey fit

Primary fit

BuildAgent skills & templates

Desktop control is agent-tooling you add during Build when automating legacy apps or verifying UI flows that APIs do not expose. Agent-tooling is the canonical shelf because the server extends what the model can do on the host OS, not a specific frontend component library.

How it compares

OS-level computer-use MCP, not a read-only fetch tool or a single-site browser skill.

Common Questions / FAQ

Who is Computer Use MCP for?

Experienced solo builders and agent users who need GUI automation on their own machine when APIs are missing and they accept desktop control risks.

When should I use Computer Use MCP?

Use it during Build for agent-tooling experiments, legacy app workflows, or visual verification—not as a default for every coding task.

How do I add Computer Use MCP to my agent?

Add computer-use-mcp via npx stdio or install the published v1.8.0 mcpb bundle per your client’s MCP instructions, on a host where screen and input access is intentional.

Computer Use

domdomegg/computer-use-mcp

Give your agent controlled desktop automation—screenshots plus mouse and keyboard actions—when GUI-only tools have no API.

Overview

Computer Use MCP is a Build-phase MCP server that controls the desktop with screenshots, mouse, and keyboard automation for agent-driven GUI tasks.

What is this MCP server?

Capture screenshots for visual grounding before actions
Drive mouse movement and clicks through MCP tools
Send keyboard input for form fill and shortcut workflows
Install via npx stdio or published mcpb bundle for supported clients
Published server version 1.8.0 as computer-use-mcp
npm identifier computer-use-mcp
Published mcpb release for v1.8.0

Compatible agents: Claude Code, Cursor, Codex, Windsurf

Community signal: 297 GitHub stars.

What problem does it solve?

Builders hit dead ends when critical software only offers a GUI, forcing manual clicking while the agent cannot see or act on the screen.

Who is it for?

Advanced solo builders automating legacy desktop tools, reproducing visual bugs, or scripting personal workflows on a trusted local machine.

Skip if: Shared servers, unattended production bots, or anyone who needs least-privilege automation without full desktop control risk.

What do I get? / Deliverables

After registration, the agent can observe the screen and perform bounded mouse and keyboard actions to complete tasks that lack APIs.

Screenshot-backed context for agent reasoning about on-screen UI
Executed click and type sequences driven through MCP
Repeatable stdio or mcpb MCP registration for trusted dev hosts

Recommended MCP Servers

1stDibs

The 1stDibs MCP server exposes browse-and-search capabilities against the 1stDibs luxury goods marketplace through a hos…

2Captcha MCParuxojuyu665/2Captcha-MCP

2Captcha MCP exposes the commercial 2Captcha API to MCP hosts with 43 tools—31 focused on captcha solving plus managemen…

4fetch

4fetch is a hosted MCP server that fetches a URL and returns clean Markdown with metadata so coding agents can quote pag…

AcrawlMingye-Lu/AgenticCrawler

acrawl (Agentic Crawler) is a Model Context Protocol server that packages autonomous web browsing into a single local bi…5 stars

Agentfetchbch1212/agentfetch-mcp

Agentfetch MCP is a token-budgeted web retrieval server for AI coding agents. Solo builders doing idea-phase competitor …

AgenticTotem Web Extractor

AgenticTotem Web Extractor is a hosted MCP server for AI web extraction: you supply URLs and a JSON Schema, and the serv…

Journey fit

Primary fit

BuildAgent skills & templates

How it compares

OS-level computer-use MCP, not a read-only fetch tool or a single-site browser skill.

Common Questions / FAQ

Who is Computer Use MCP for?

Experienced solo builders and agent users who need GUI automation on their own machine when APIs are missing and they accept desktop control risks.

When should I use Computer Use MCP?

Use it during Build for agent-tooling experiments, legacy app workflows, or visual verification—not as a default for every coding task.

How do I add Computer Use MCP to my agent?

Add computer-use-mcp via npx stdio or install the published v1.8.0 mcpb bundle per your client’s MCP instructions, on a host where screen and input access is intentional.

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is Computer Use MCP for?

When should I use Computer Use MCP?

How do I add Computer Use MCP to my agent?

This week for builders

Overview

What is this MCP server?

What problem does it solve?

Who is it for?

What do I get? / Deliverables

Recommended MCP Servers

Journey fit

Who is Computer Use MCP for?

When should I use Computer Use MCP?

How do I add Computer Use MCP to my agent?