Operate · Monitoring & observability
Monitoring & observability
The monitoring & observability tools a solo builder reaches for in the Operate phase - every AI-coding skill, MCP server and workflow Skillselion tracks for monitoring & observability, ranked by community signal so you can pick the right one fast.
TypeDescription
1Azure DiagnosticsDev ToolsskillTriage production Azure failures across App Service, Functions, AKS, and messaging with AppLens, logs, and KQL.374k1.2k
2Appinsights InstrumentationMonitoring & CloudskillWire Azure Application Insights onto an ASP.NET Core or Node.js app in App Service—via Portal auto-instrument or OpenTelemetry in code—so telemetry flows without guessing portal URLs or package names.374k1.2k
3Azure KustoMonitoring & CloudskillRun KQL against Azure Data Explorer for logs, telemetry, time series, schema discovery, and operational analytics373k1.2k
4Azure ObservabilityMonitoring & CloudskillRoute your agent through Azure Monitor, Application Insights, Log Analytics, alerts, and workbooks for metrics, APM, KQL, and dashboards on live Azure workloads.98.2k1.2k
5Caveman StatsAI & AgentsskillRun /caveman-stats to see real session token input, output, and estimated caveman savings from the Claude Code JSONL log without asking the model to guess.90.6k70k
6Gws Admin ReportsMonitoring & CloudskillQuery Google Workspace admin audit activities and customer or entity usage reports from the terminal when you need compliance trails or seat and app usage stats.15.7k26.9k
7Persona It AdminMonitoring & CloudskillOperate as a Google Workspace IT admin persona—security monitoring, audit review, Drive sharing policy, and daily standup via gws with gmail, drive, and calendar skills loaded.15.4k26.9k
8Google Agents Cli ObservabilityMonitoring & CloudskillTurn on BigQuery agent analytics, Cloud Trace, and prompt-response logging for scaffolded Google ADK agents.12.4k2.7k
9Agent PulseAI & AgentsskillSnapshot recent agent sessions, token usage, model costs, health signals, and forecasts via the Agent Pulse CLI.11.6k1
10Appinsights InstrumentationMonitoring & CloudskillWire an ASP.NET Core or Node.js app on Azure to Application Insights so solo builders can see health and telemetry in production.9.3k34.6k
11Emblem Portfolio TrackerBackend & DataskillRun a cross-chain crypto portfolio report with USD balances, trade P&L, and DeFi positions through the EmblemAI CLI.8.7k10
12Grafana DashboardsMonitoring & CloudskillSpin up production Grafana dashboards with RED/USE panels so you can see API health, infra saturation, and business KPIs in one place.8.4k36.5k
13Azure Resource Health DiagnoseMonitoring & CloudskillRun a structured Azure health workflow on a named resource—best practices, discovery, logs/telemetry—and output a remediation plan via Azure MCP-first tooling.8.4k34.6k
14StatusAI & AgentsskillPoll an in-flight Parallel research run by run ID so you know when results are ready without re-submitting the job.7.9k56
15Python ObservabilityAutomationskillInstrument Python services with Prometheus-style metrics, decorators, and the four golden signals so solo builders can see latency, traffic, errors, and saturation in production.7.1k36.5k
16Distributed TracingAutomationskillStand up Jaeger-style distributed tracing and instrument multi-service requests so you can debug latency in production.6.9k36.5k
17Okx Cex BotBackend & DataskillOperate OKX Grid and DCA Martingale bots from an agent via the okx CLI—create, tune, and watch P&L without hand-writing exchange API calls.6.8k134
18Service Mesh ObservabilityMonitoring & CloudskillStand up metrics, distributed tracing, and mesh dashboards so you can debug service-to-service latency and hit SLOs on Istio or Linkerd.6.7k36.5k
19On Call Handoff PatternsAutomationskillStandardize on-call shift handoffs so the incoming engineer inherits active incidents, open investigations, and next steps without tribal knowledge.6.7k36.5k
20Slo ImplementationAutomationskillDefine SLIs, SLOs, burn-rate alerts, and review cadences so a solo builder can run production services with measurable reliability instead of guessing uptime.6.7k36.5k
21LangfuseAI & AgentsskillQuery traces, observations, metrics, and scores from Langfuse via CLI so you can debug LLM apps and agent workflows in production.5.5k158
22Okx Audit LogDev ToolsskillLook up where Onchain OS stores CLI and MCP audit history so you can debug failed commands offline without dumping secrets into chat.4.6k284
23Technical Seo CheckerDocs & PlanningskillSolo builders use this tool to audit technical SEO health, identify crawlability and indexing issues, and optimize Core Web Vitals without manual testing.4.5k2.1k
24Openclaw Control CenterAI & AgentsskillRun a local-first dashboard to watch OpenClaw agent health, token spend, tasks, collaboration handoffs, and memory without enabling risky writes by default.4.1k31
25Use RailwayMonitoring & CloudskillAnalyze MongoDB health and performance on Railway-hosted services using SSH metrics and Railway API infrastructure signals.4k274
26Alert ManagerDocs & PlanningskillSet up automated SEO and traffic monitoring alerts to catch ranking drops, SERP changes, conversion declines, and technical issues before they impact business metrics.3.9k2.1k
27Golang ObservabilityAutomationskillInstrument Go services with slog, Prometheus, OpenTelemetry, profiling, and dashboards so production behavior is measurable before users complain.3.7k2k
28Logging Best PracticesDev ToolsskillInstall this when agents write or review backend code so logs become one wide canonical event per request instead of scattered printf debugging.3.4k94
29Canary WatchAutomationskillSmoke-test a live URL after deploy with HTTP, assets, SSE, console, and performance checks in quick or sustained watch modes.3.4k210k
30Dashboard BuilderAutomationskillTurn a pile of metrics into a Grafana, SigNoz, or similar dashboard that operators can actually act on.3.2k210k
31Monitoring ExpertAutomationskillDraft Prometheus alert rules for error rate, latency, service health, CPU, and memory so production issues page you with sane thresholds and annotations.3k9.7k
32Unified Notifications OpsAutomationskillUnify GitHub, Linear, hooks, and desktop alerts into one severity-aware notification policy so CI and review noise becomes actionable follow-up.3k210k
33Ecc Tools Cost AuditAutomationskillAudit ECC Tools GitHub App burn, PR recursion, quota bypass, and premium-model routing with an evidence-first operator workflow in the ECC-Tools repo.3k210k
34Adk Observability GuideAI & AgentsskillConfigure tracing, logging, and analytics for Google ADK agents in production so you can debug real traffic and improve performance.2.6k1.4k
35Terraform StacksAutomationskillPoll HCP Terraform Stack deployment and configuration state via the API when CLI watch commands block in CI or agent runs.2.5k654
36Toss SecuritiesBackend & DataskillQuery your Toss Securities accounts, holdings, quotes, and order history through tossctl without building custom scrapers.2.5k5.4k
37Google Cloud Recipe Networking ObservabilityMonitoring & CloudskillDiagnose GCP VPC, firewall, NAT, and threat telemetry when production traffic or connectivity looks wrong.2.4k12.1k
38Security MonitoringSecurityskillWire agent-driven SIEM queries, alerting, and incident workflows when you run production apps and need continuous threat and compliance visibility.2.4k196
39Weather AutomationAutomationskillPull forecasts and alerts and trigger location-aware automations (calendar, home, Slack) when weather thresholds hit.2.4k196
40Langsmith TraceAI & AgentsskillWire LangSmith tracing into LangChain or LangGraph apps and query or export traces when debugging solo-builder AI features in production.2.3k130
41Han River Water LevelBackend & DataskillLet your coding agent answer live Han River bridge water level and flow questions in Korean without you provisioning an HRFCO ServiceKey.2.3k5.4k
42Google Cloud Networking ObservabilityMonitoring & CloudskillAudit outbound NAT traffic and troubleshoot Cloud NAT port exhaustion using Logging MCP, BigQuery trends, or gcloud/bq fallbacks.2.2k12.1k
43Model UsageAI & AgentsskillPull per-model Codex or Claude spend from local CodexBar logs so you can see what your agent coding sessions actually cost.2k378k
44Sentry WorkflowDev ToolsskillRoute to the right Sentry workflow for fixing production issues, reviewing Sentry bot comments, or upgrading SDKs.2k197
45CamsnapDev ToolsskillPull RTSP/ONVIF camera snapshots, short clips, and motion-triggered captures from the terminal for home lab, security, or IoT workflows.1.8k378k
46Sentry Nextjs SdkAI & AgentsskillWire Sentry AI Agents Monitoring into a Next.js app so solo builders can see agent runs, LLM token/cost data, and tool-call failures in one place.1.8k197
47Sentry Create AlertAutomationskillConfigure Sentry workflow-engine alerts so issues trigger email, Slack, PagerDuty, or Discord when conditions match.1.7k197
48Sentry Sdk SetupMonitoring & CloudskillRoute solo builders to the correct Sentry SDK skill and install error monitoring, tracing, and session replay for their detected stack.1.7k197
49Sentry Feature SetupAutomationskillRoute an agent to the right Sentry feature skill for AI LLM tracing, OpenTelemetry export, or alert workflows instead of guessing SDK snippets.1.6k197
50Sentry Sdk UpgradeDev ToolsskillUpgrade @sentry JavaScript SDK packages across major versions without missing framework-specific config files or breaking imports.1.5k197
51StatusMonitoring & CloudskillCheck what is linked, deployed, and healthy on Railway before you change env vars, redeploy, or debug production.1.5k274
52Aws ObservabilityMonitoring & CloudskillDrop in CDK-ready CloudWatch alarm and dashboard patterns for Lambda with M-of-N evaluation and error-rate math instead of brittle defaults.1.5k819
53Sentry Setup Ai MonitoringAI & AgentsskillConfigure Sentry traces sampling so gen_ai agent span trees are not dropped when the root HTTP transaction is undersampled.1.5k197
54Sentry Browser SdkDev ToolsskillWire Sentry’s browser SDK so uncaught errors and rejections surface in production without ad-hoc console hunting.1.5k197
55BriefSecurityskillGenerate legal-team daily, topic, or incident briefings by scanning connected email, calendar, and internal sources—always for human attorney review.1.4k19.6k
56Mongodb ConnectionBackend & DataskillMonitor MongoDB driver connection-pool events and metrics so you can spot exhaustion and misconfiguration before production outages.1.4k131
57Building DashboardsAutomationskillCreate and update Axiom observability dashboards with the right chart types, APL queries, and grid layouts via API scripts.1.4k9
58Observe WhatsappAutomationskillInterpret Kapso WhatsApp health-check payloads and triage degraded messaging, token, and webhook failures in production.1.4k128
59Observing AgentforceAI & AgentsskillQuery Salesforce Data Cloud STDM traces to debug Agentforce sessions, LLM steps, and aggregated agent metrics in production orgs.1.3k513
60Aiconfig Online EvalsAI & AgentsskillResolve old bookmarks or prompts that still reference the retired LaunchDarkly skill name and route agents to the current online-evals instructions.1.3k16
Explore more
By category