Benchmark Testing

Name: Benchmark Testing
Author: vercel-labs

vercel-labs/vercel-plugin

1.1k installs
229 repo stars
Updated July 27, 2026
vercel-labs/vercel-plugin

benchmark-testing provides documented workflows for Create and launch benchmark test projects to exercise vercel-plugin skill injection across realistic scenarios. Sets up isolated directories, installs the plugi

About

The benchmark-testing skill create and launch benchmark test projects to exercise vercel-plugin skill injection across realistic scenarios Sets up isolated directories installs the plugin and spawns WezTerm panes running Claude Code with crafted prompts Benchmark Testing Create isolated test projects that exercise vercel-plugin skill injection with realistic technology-agnostic prompts Create test directories bash BASE dev vercel-plugin-testing mkdir p BASE 01-slug 02-slug 2 Install the plugin in each directory bash for dir in BASE do echo basename dir cd dir npx add-plugin https github com vercel vercel-plugin s project y 2 1 tail 1 done This creates claude settings json with enabledPlugins in each directory Launch Claude Code in WezTerm panes Critical details that must all be followed Use cwd absolute-path to set the working directory Use unset CLAUDECODE before x to avoid nested-session errors Use settings claude settings json not settings project to load the plugin Use double quotes on the outer ic string single quotes around the prompt Wait 10 seconds between each launch to avoid overwhelming the system

Use `--cwd <absolute-path>` to set the working directory
Use `unset CLAUDECODE` before `x` to avoid nested-session errors
Use `--settings .claude/settings.json` (not `--settings project`) to load the plugin
Use double quotes on the outer `-ic` string, single quotes around the prompt
Wait **10 seconds** between each launch to avoid overwhelming the system

Benchmark Testing by the numbers

1,117 all-time installs (skills.sh)
Ranked #514 of 2,184 Testing & QA skills by installs in the Skillselion catalog
Security screen: MEDIUM risk (skills.sh audit)
Data as of Jul 28, 2026 (Skillselion catalog sync)

At a glance

benchmark-testing capabilities & compatibility

Capabilities: use ` cwd <absolute path>` to set the working d · use `unset claudecode` before `x` to avoid neste · use ` settings .claude/settings.json` (not ` s · use double quotes on the outer ` ic` string, sin · wait **10 seconds** between each launch to avoid
Use cases: documentation

From the docs

What benchmark-testing says it does

# Benchmark Testing Create isolated test projects that exercise vercel-plugin skill injection with realistic, technology-agnostic prompts.

SKILL.md

Create test directories ```bash BASE=~/dev/vercel-plugin-testing mkdir -p "$BASE"/{01-slug,02-slug,...} ``` ### 2.

SKILL.md

npx skills add https://github.com/vercel-labs/vercel-plugin --skill benchmark-testing

Add your badge

Show developers this skill is listed on Skillselion. Paste this into your README.

[![Listed on Skillselion](https://skillselion.com/badge/skills/vercel-labs/vercel-plugin/benchmark-testing.svg)](https://skillselion.com/skills/vercel-labs/vercel-plugin/benchmark-testing)

Installs	1.1k
repo stars	★ 229
Security audit	1 / 3 scanners passed
Last updated	July 27, 2026
Repository	vercel-labs/vercel-plugin ↗

How do I use benchmark-testing for the task described in its SKILL.md triggers?

Create and launch benchmark test projects to exercise vercel-plugin skill injection across realistic scenarios. Sets up isolated directories, installs the plugin, and spawns WezTerm panes running Cla.

Who is it for?

Teams invoking benchmark-testing when the user request matches documented triggers and prerequisites.

Skip if: Skip when cached docs are missing, the request is a negative trigger, or another sibling skill owns the workflow.

When should I use this skill?

What you get

Step-by-step guidance grounded in benchmark-testing documentation and reference files.

benchmark directory tree
plugin-installed sandboxes
parallel test session layout

By the numbers

Workflow creates multiple numbered benchmark directories under a shared BASE path

Files

SKILL.mdMarkdownGitHub ↗

Benchmark Testing

Create isolated test projects that exercise vercel-plugin skill injection with realistic, technology-agnostic prompts.

Workflow

1. Create test directories

BASE=~/dev/vercel-plugin-testing
mkdir -p "$BASE"/{01-slug,02-slug,...}

2. Install the plugin in each directory

for dir in "$BASE"/*/; do
  echo "=== $(basename "$dir") ==="
  cd "$dir" && npx add-plugin https://github.com/vercel/vercel-plugin -s project -y 2>&1 | tail -1
done

This creates .claude/settings.json with enabledPlugins in each directory.

3. Launch Claude Code in WezTerm panes

Critical details that must all be followed:

Use --cwd <absolute-path> to set the working directory
Use unset CLAUDECODE before x to avoid nested-session errors
Use --settings .claude/settings.json (not --settings project) to load the plugin
Use double quotes on the outer -ic string, single quotes around the prompt
Wait 10 seconds between each launch to avoid overwhelming the system
Always use spawn (new tabs) — split-pane runs out of space after ~4 panes

Working command template:

wezterm cli spawn --cwd /absolute/path/to/test-dir -- /bin/zsh -ic "unset CLAUDECODE; x 'YOUR PROMPT HERE. Link the project to my vercel-labs team so we can deploy it later.' --settings .claude/settings.json; exec zsh"

Prompt Guidelines

Never name specific technologies (no "Next.js", "Stripe", "Vercel KV", etc.)
Describe the product and features — let the plugin infer which skills to inject
Make prompts ambitious and multi-featured to exercise multiple skill triggers
Always append: "Link the project to my vercel-labs team so we can deploy it later."

Example prompts

Slug	Prompt	Expected skills
recipe-platform	"Build a recipe sharing platform where users sign up, upload photos of their dishes, write ingredients and steps, and browse a feed with infinite scroll..."	auth, vercel-storage, nextjs
trivia-game	"Create a multiplayer trivia game where players join a room with a 6-letter code, answer questions in real-time with a 15-second countdown..."	vercel-storage, nextjs
code-review-bot	"Build an AI-powered code review dashboard with webhook API routes, LLM streaming analysis, and stats over time..."	ai-sdk, nextjs
conference-tickets	"Create a conference ticketing system with tiered checkout, QR code emails, admin panel, and payment webhook handling..."	payments, email, auth
content-aggregator	"Build a content aggregator with hourly scheduled RSS fetching, LLM summaries, category filters, and bookmarks..."	cron-jobs, ai-sdk
finance-tracker	"Build a personal finance tracker with bank connection, spending charts, and weekly email digest via scheduled job..."	cron-jobs, email
multi-tenant-blog	"Create a multi-tenant blog where each user gets a subdomain, with request-level routing, headless content API, and role-based auth..."	routing-middleware, cms, auth
status-page	"Build a SaaS status page with scheduled endpoint pinging, uptime charts, incident logging, and KV-stored history..."	cron-jobs, vercel-storage, observability
dog-walking-saas	"Build a dog walking SaaS with user accounts, pet photos, booking, monthly invoicing, admin dashboard, and separate dev/prod env configs..."	payments, auth, vercel-storage, env-vars

Cleanup

rm -rf ~/dev/vercel-plugin-testing

Related skills

TddFollow test-driven development with a strict red-green-refactor loop when creating reliable features or fixing bugs.510k185k

Test Driven DevelopmentEnforce writing failing tests before any production implementation code.176k260k

QaRun conversational QA sessions that turn user-reported bugs into well-written, domain-aware GitHub issues without manual ticket writing.164k185k

Migrate To ShoehornAutomatically update TypeScript test files that rely on unsafe `as` type assertions by replacing them with type-safe partial objects from @total-typescript/shoehorn.151k185k

Webapp TestingVerify frontend behavior, debug UI issues, capture screenshots, and inspect logs of a running local web application using Playwright.121k164k

Playwright CliRun browser automation, generate element snapshots, inspect DOM attributes, and execute Playwright tests from the terminal.96.3k12.2k

How it compares

Use benchmark-testing for plugin skill injection QA; use application test suites for product feature correctness.

FAQ

What does benchmark-testing do?

When should I use benchmark-testing?

What are common prerequisites?

--- name: benchmark-testing description: Create and launch benchmark test projects to exercise vercel-plugin skill injection across realistic scenarios.

Is Benchmark Testing safe to install?

skills.sh reports 1 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

Testing & QAtesting

About

Benchmark Testing by the numbers

benchmark-testing capabilities & compatibility

What benchmark-testing says it does

Add your badge

How do I use benchmark-testing for the task described in its SKILL.md triggers?

Who is it for?

When should I use this skill?

What you get

By the numbers

Files

Benchmark Testing

Workflow

1. Create test directories

2. Install the plugin in each directory

3. Launch Claude Code in WezTerm panes

Prompt Guidelines

Example prompts

Cleanup

Related skills

How it compares

FAQ

What does benchmark-testing do?

When should I use benchmark-testing?

What are common prerequisites?

Is Benchmark Testing safe to install?

This week in AI coding