Browsing With Playwright

Name: Browsing With Playwright
Author: bilalmk

bilalmk/todo_correct

Spin up Playwright MCP to navigate sites, submit forms, screenshot pages, and scrape dynamic UIs when curl cannot see client-rendered content.

Overview

Browsing with Playwright is an agent skill most often used in Build (also Ship) that automates browser interactions via the Playwright MCP server for navigation, forms, screenshots, and dynamic scraping.

Install

npx skills add https://github.com/bilalmk/todo_correct --skill browsing-with-playwright

What is this skill?

Playwright MCP server start/stop via helper scripts or npx on port 8808 with --shared-browser-context
Navigation, back, click, fill forms, screenshots, and data extraction through mcp-client.py tool calls
Explicit boundary: use for interactive browsing—not static fetch jobs better served by curl or wget
Server lifecycle guidance for end-of-task shutdown versus long multi-step sessions
Restart path when the browser context becomes unresponsive
Default Playwright MCP listener documented on port 8808

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 1.7k installs on skills.sh; 1 GitHub stars; 1/3 security scanners passed (skills.sh audits).

What problem does it solve?

You need to interact with a modern web app—log in, click through flows, or screenshot a page—but fetching HTML alone misses JavaScript-rendered content.

Who is it for?

Agent workflows that must exercise real browser state across multiple steps on localhost:8808 with shared context preserved.

Skip if: Bulk static page downloads, server-only API testing with no DOM, or environments that forbid local Node processes and browser binaries.

When should I use this skill?

Tasks require web browsing, form submission, web scraping, UI testing, or browser interaction; not when only fetching static content.

What do I get? / Deliverables

A running Playwright MCP session with documented tool calls completes the browsing task and can be closed cleanly without leaking headless processes.

Executed browser navigation and interaction steps
Screenshots or extracted page data from live sessions
Clean server shutdown after the task

Recommended Skills

Agent Browservercel-labs/agent-browser

agent-browser is a Node-installed browser automation CLI built for AI agents that need dependable programmatic web inter…428k installs·35.5k stars

Lark Imlarksuite/cli

Lark IM is a Larksuite agent skill that exposes Feishu/Lark instant messaging to Claude Code, Cursor, and similar agents…210k installs·13.7k stars

Lark Calendarlarksuite/cli

lark-calendar is an agent skill for Feishu/Lark Calendar v4 exposed via lark-cli. Solo builders and small teams who alre…209k installs·13.7k stars

Lark Sheetslarksuite/cli

Skill for programmatic Feishu spreadsheet and worksheet management—create tables, bulk data IO, lookup, and export—using…209k installs·13.7k stars

Lark Vclarksuite/cli

lark-vc is an agent skill for Feishu/Lark video conferencing history and artifacts through lark-cli. After calls end, so…208k installs·13.7k stars

Lark Contactlarksuite/cli

CLI skill for Lark directory lookup: search employees and fetch metadata by open_id, with clear boundaries vs IM, calend…208k installs·13.7k stars

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

BuildAgent skills & templates

Build is primary because the skill wires an MCP browser automation server into the agent toolchain for day-to-day product and integration work. Agent-tooling is the shelf—documented server lifecycle, mcp-client.py calls, and shared-browser-context are MCP infrastructure, not a one-off test script.

Also useful

ShipTesting & QA

Also useful

ValidatePrototype & spike

Where it fits

Example use

BuildIntegrations & version control

Walk through a third-party OAuth consent screen to capture callback parameters for your integration spec.

Example use

ShipTesting & QA

Screenshot each step of checkout after deploy to attach evidence to a pre-release checklist.

Example use

ValidatePrototype & spike

Click through a competitor signup funnel to document friction before you scope your MVP.

Example use

GrowContent & marketing

Capture updated marketing pages after a launch tweak for changelog assets.

How it compares

Choose this MCP browser skill instead of curl/wget when the task requires clicks, forms, or rendered UI—not when a single HTTP GET suffices.

Common Questions / FAQ

Who is browsing-with-playwright for?

Solo builders and agent users who already run Claude Code, Cursor, or similar clients with MCP and need dependable Playwright-driven browser automation.

When should I use browsing-with-playwright?

Use it for web browsing, form submission, scraping dynamic sites, UI testing, or any browser interaction during Build integrations—and during Ship testing when you need to verify flows in a real browser rather than static fetches.

Is browsing-with-playwright safe to install?

It starts local servers and can drive arbitrary URLs; check the Security Audits panel on this page and restrict credentials, cookies, and target sites before automating logged-in sessions.

SKILL.md

READMESKILL.md - Browsing With Playwright

# Browser Automation



Automate browser interactions via Playwright MCP server.



## Server Lifecycle



### Start Server

```bash

# Using helper script (recommended)

bash scripts/start-server.sh



# Or manually

npx @playwright/mcp@latest --port 8808 --shared-browser-context &

```



### Stop Server

```bash

# Using helper script (closes browser first)

bash scripts/stop-server.sh



# Or manually

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_close -p '{}'

pkill -f "@playwright/mcp"

```



### When to Stop

- **End of task**: Stop when browser work is complete

- **Long sessions**: Keep running if doing multiple browser tasks

- **Errors**: Stop and restart if browser becomes unresponsive



**Important:** The `--shared-browser-context` flag is required to maintain browser state across multiple mcp-client.py calls. Without it, each call gets a fresh browser context.



## Quick Reference



### Navigation



```bash

# Go to URL

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_navigate \

  -p '{"url": "https://example.com"}'



# Go back

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_navigate_back -p '{}'

```



### Get Page State



```bash

# Accessibility snapshot (returns element refs for clicking/typing)

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_snapshot -p '{}'



# Screenshot

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_take_screenshot \

  -p '{"type": "png", "fullPage": true}'

```



### Interact with Elements



Use `ref` from snapshot output to target elements:



```bash

# Click element

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_click \

  -p '{"element": "Submit button", "ref": "e42"}'



# Type text

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_type \

  -p '{"element": "Search input", "ref": "e15", "text": "hello world", "submit": true}'



# Fill form (multiple fields)

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_fill_form \

  -p '{"fields": [{"ref": "e10", "value": "john@example.com"}, {"ref": "e12", "value": "password123"}]}'



# Select dropdown

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_select_option \

  -p '{"element": "Country dropdown", "ref": "e20", "values": ["US"]}'

```



### Wait for Conditions



```bash

# Wait for text to appear

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_wait_for \

  -p '{"text": "Success"}'



# Wait for time (ms)

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_wait_for \

  -p '{"time": 2000}'

```



### Execute JavaScript



```bash

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_evaluate \

  -p '{"function": "return document.title"}'

```



### Multi-Step Playwright Code



For complex workflows, use `browser_run_code` to run multiple actions in one call:



```bash

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_run_code \

  -p '{"code": "async (page) => { await page.goto(\"https://example.com\"); await page.click(\"text=Learn more\"); return await page.title(); }"}'

```



**Tip:** Use `browser_run_code` for complex multi-step operations that should be atomic (all-or-nothing).



## Workflow: Form Submission



1. Navigate to page

2. Get snapshot to find element refs

3. Fill form fields using refs

4. Click submit

5. Wait for confirmation

6. Screenshot result



## Workflow: Data Extraction



1. Navigate to page

2. Get snapshot (contains text conten

What is this skill?

Playwright MCP server start/stop via helper scripts or npx on port 8808 with --shared-browser-context

Navigation, back, click, fill forms, screenshots, and data extraction through mcp-client.py tool calls

Explicit boundary: use for interactive browsing—not static fetch jobs better served by curl or wget

Server lifecycle guidance for end-of-task shutdown versus long multi-step sessions

Restart path when the browser context becomes unresponsive

Default Playwright MCP listener documented on port 8808

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 1.7k installs on skills.sh; 1 GitHub stars; 1/3 security scanners passed (skills.sh audits).

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

BuildAgent skills & templates

Also useful

ShipTesting & QA

Also useful

ValidatePrototype & spike

Where it fits

Example use

BuildIntegrations & version control

Walk through a third-party OAuth consent screen to capture callback parameters for your integration spec.

Example use

ShipTesting & QA

Screenshot each step of checkout after deploy to attach evidence to a pre-release checklist.

Example use

ValidatePrototype & spike

Click through a competitor signup funnel to document friction before you scope your MVP.

Example use

GrowContent & marketing

Capture updated marketing pages after a launch tweak for changelog assets.

SKILL.md

READMESKILL.md - Browsing With Playwright

# Browser Automation



Automate browser interactions via Playwright MCP server.



## Server Lifecycle



### Start Server

```bash

# Using helper script (recommended)

bash scripts/start-server.sh



# Or manually

npx @playwright/mcp@latest --port 8808 --shared-browser-context &

```



### Stop Server

```bash

# Using helper script (closes browser first)

bash scripts/stop-server.sh



# Or manually

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_close -p '{}'

pkill -f "@playwright/mcp"

```



### When to Stop

- **End of task**: Stop when browser work is complete

- **Long sessions**: Keep running if doing multiple browser tasks

- **Errors**: Stop and restart if browser becomes unresponsive



**Important:** The `--shared-browser-context` flag is required to maintain browser state across multiple mcp-client.py calls. Without it, each call gets a fresh browser context.



## Quick Reference



### Navigation



```bash

# Go to URL

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_navigate \

  -p '{"url": "https://example.com"}'



# Go back

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_navigate_back -p '{}'

```



### Get Page State



```bash

# Accessibility snapshot (returns element refs for clicking/typing)

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_snapshot -p '{}'



# Screenshot

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_take_screenshot \

  -p '{"type": "png", "fullPage": true}'

```



### Interact with Elements



Use `ref` from snapshot output to target elements:



```bash

# Click element

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_click \

  -p '{"element": "Submit button", "ref": "e42"}'



# Type text

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_type \

  -p '{"element": "Search input", "ref": "e15", "text": "hello world", "submit": true}'



# Fill form (multiple fields)

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_fill_form \

  -p '{"fields": [{"ref": "e10", "value": "john@example.com"}, {"ref": "e12", "value": "password123"}]}'



# Select dropdown

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_select_option \

  -p '{"element": "Country dropdown", "ref": "e20", "values": ["US"]}'

```



### Wait for Conditions



```bash

# Wait for text to appear

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_wait_for \

  -p '{"text": "Success"}'



# Wait for time (ms)

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_wait_for \

  -p '{"time": 2000}'

```



### Execute JavaScript



```bash

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_evaluate \

  -p '{"function": "return document.title"}'

```



### Multi-Step Playwright Code



For complex workflows, use `browser_run_code` to run multiple actions in one call:



```bash

python3 scripts/mcp-client.py call -u http://localhost:8808 -t browser_run_code \

  -p '{"code": "async (page) => { await page.goto(\"https://example.com\"); await page.click(\"text=Learn more\"); return await page.title(); }"}'

```



**Tip:** Use `browser_run_code` for complex multi-step operations that should be atomic (all-or-nothing).



## Workflow: Form Submission



1. Navigate to page

2. Get snapshot to find element refs

3. Fill form fields using refs

4. Click submit

5. Wait for confirmation

6. Screenshot result



## Workflow: Data Extraction



1. Navigate to page

2. Get snapshot (contains text conten

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is browsing-with-playwright for?

When should I use browsing-with-playwright?

Is browsing-with-playwright safe to install?

SKILL.md

This week for builders

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is browsing-with-playwright for?

When should I use browsing-with-playwright?

Is browsing-with-playwright safe to install?

SKILL.md