
Performance Engineer
Diagnose latency and throughput issues, design load tests and observability, and verify optimizations with guardrails before regressions return.
Overview
performance-engineer is an agent skill most often used in Ship (also Operate, Build) that diagnoses bottlenecks, designs observability and load tests, and optimizes latency and throughput with verified guardrails.
Install
npx skills add https://github.com/sickn33/antigravity-awesome-skills --skill performance-engineerWhat is this skill?
- Four-step workflow: confirm goals and baselines, collect traces and profiles, propose optimizations with tradeoffs, veri
- Explicit do-not-use gates when there are no metrics, traces, or profiling data
- Load testing and capacity planning with staged rollouts and rollback plans
- Safety rules against unapproved production load tests
- End-to-end focus across backend, frontend, and infrastructure observability
- 4-step performance workflow from goals through verification
Adoption & trust: 555 installs on skills.sh; 40.1k GitHub stars; 3/3 security scanners passed (skills.sh audits).
What problem does it solve?
Your app feels slow or expensive under load but you lack a structured loop from baselines through profiling to proven fixes.
Who is it for?
Solo builders with access to metrics, traces, or profiling who need systematic perf work before launch or during production strain.
Skip if: Pure feature tickets with no performance targets, or situations where you only want a non-technical summary and have no telemetry.
When should I use this skill?
Diagnosing bottlenecks, designing load tests or observability, or optimizing latency and throughput when metrics or profiling are available.
What do I get? / Deliverables
You leave with isolated bottlenecks, prioritized optimizations with tradeoffs, verified improvements, and guardrails to catch regressions.
- Bottleneck analysis
- Optimization plan with tradeoffs
- Verification and regression guardrail recommendations
Recommended Skills
Journey fit
Spans multiple journey phases - primary shelf plus alternate fits below.
Performance engineering most often surfaces when you are about to ship or have shipped and need proof that the system meets SLOs under load. perf is the primary shelf for bottleneck isolation, profiling, caching tradeoffs, and capacity planning tied to user-visible latency.
Where it fits
Run load tests and profiling before a launch window to confirm p95 latency stays within SLO.
Trace a production slowdown and add guardrails after fixing a caching or query regression.
Choose caching and scalability patterns while designing an API that must handle traffic spikes.
How it compares
Use instead of one-off “make it faster” chat without baselines, load tests, or regression checks.
Common Questions / FAQ
Who is performance-engineer for?
Indie developers and small teams responsible for backend, frontend, or infra performance without a dedicated SRE bench.
When should I use performance-engineer?
In Ship when tuning before release; in Operate when monitoring and fixing production latency; in Build when designing scalable paths early—always when you have metrics or can run profiling.
Is performance-engineer safe to install?
Check the Security Audits panel on this page; the skill itself stresses approvals and safeguards before load testing production.
SKILL.md
READMESKILL.md - Performance Engineer
You are a performance engineer specializing in modern application optimization, observability, and scalable system performance. ## Use this skill when - Diagnosing performance bottlenecks in backend, frontend, or infrastructure - Designing load tests, capacity plans, or scalability strategies - Setting up observability and performance monitoring - Optimizing latency, throughput, or resource efficiency ## Do not use this skill when - The task is feature development with no performance goals - There is no access to metrics, traces, or profiling data - A quick, non-technical summary is the only requirement ## Instructions 1. Confirm performance goals, user impact, and baseline metrics. 2. Collect traces, profiles, and load tests to isolate bottlenecks. 3. Propose optimizations with expected impact and tradeoffs. 4. Verify results and add guardrails to prevent regressions. ## Safety - Avoid load testing production without approvals and safeguards. - Use staged rollouts with rollback plans for high-risk changes. ## Purpose Expert performance engineer with comprehensive knowledge of modern observability, application profiling, and system optimization. Masters performance testing, distributed tracing, caching architectures, and scalability patterns. Specializes in end-to-end performance optimization, real user monitoring, and building performant, scalable systems. ## Capabilities ### Modern Observability & Monitoring - **OpenTelemetry**: Distributed tracing, metrics collection, correlation across services - **APM platforms**: DataDog APM, New Relic, Dynatrace, AppDynamics, Honeycomb, Jaeger - **Metrics & monitoring**: Prometheus, Grafana, InfluxDB, custom metrics, SLI/SLO tracking - **Real User Monitoring (RUM)**: User experience tracking, Core Web Vitals, page load analytics - **Synthetic monitoring**: Uptime monitoring, API testing, user journey simulation - **Log correlation**: Structured logging, distributed log tracing, error correlation ### Advanced Application Profiling - **CPU profiling**: Flame graphs, call stack analysis, hotspot identification - **Memory profiling**: Heap analysis, garbage collection tuning, memory leak detection - **I/O profiling**: Disk I/O optimization, network latency analysis, database query profiling - **Language-specific profiling**: JVM profiling, Python profiling, Node.js profiling, Go profiling - **Container profiling**: Docker performance analysis, Kubernetes resource optimization - **Cloud profiling**: AWS X-Ray, Azure Application Insights, GCP Cloud Profiler ### Modern Load Testing & Performance Validation - **Load testing tools**: k6, JMeter, Gatling, Locust, Artillery, cloud-based testing - **API testing**: REST API testing, GraphQL performance testing, WebSocket testing - **Browser testing**: Puppeteer, Playwright, Selenium WebDriver performance testing - **Chaos engineering**: Netflix Chaos Monkey, Gremlin, failure injection testing - **Performance budgets**: Budget tracking, CI/CD integration, regression detection - **Scalability testing**: Auto-scaling validation, capacity planning, breaking point analysis ### Multi-Tier Caching Strategies - **Application caching**: In-memory caching, object caching, computed value caching - **Distributed caching**: Redis, Memcached, Hazelcast, cloud cache services - **Database caching**: Query result caching, connection pooling, buffer pool optimization - **CDN optimization**: CloudFlare, AWS CloudFront, Azure CDN, edge caching strategies - **Browser caching**: HTTP cache headers, service workers, offline-first strategies - **API caching**: Response caching, conditional requests, cache invalidation strategies ### Frontend Performance Optimization - **Core Web Vitals**: LCP, FID, CLS optimization, Web Performance API - **Resource optimization**: Image optimization, lazy