Agents Best Practices

Name: Agents Best Practices
Author: denissergeevitch

denissergeevitch/agents-best-practices

Install when you need a provider-neutral checklist to design, audit, or harden an agent harness (loops, tools, permissions, memory, MCP, evals) beyond a single coding-agent stack.

Overview

Agents Best Practices is a journey-wide agent skill that teaches provider-neutral agent harness architecture—usable whenever a solo builder needs to design or harden loops, tools, permissions, and observability before co

Install

npx skills add https://github.com/denissergeevitch/agents-best-practices --skill agents-best-practices

What is this skill?

Provider-neutral harness model: instruction builder → model → tool proposal → schema validation → permission gate → exec
Covers tool design, system prompts, planning/goals, context compaction, memory, skills, MCP, observability, evals, promp
Treats coding agents as one subdomain; same patterns for research, ops, support, finance, and workflow automation agents
Emphasizes harness as control plane: model proposes, runtime validates, authorizes, records, and summarizes
Markdown-only file policy aligned with versioned agent-legible environments
Default 8-step agent loop from user/task through context update and repeat
Skill scope version 1.2.0 provider-neutral-agent-harness

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 383 installs on skills.sh; 1.9k GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

What problem does it solve?

You can call LLM APIs and wire tools, but without a rigorous control plane you get fragile loops, unsafe actions, opaque failures, and no repeatable way to evaluate harness changes.

Who is it for?

Solo builders shipping or refactoring custom agents (coding, research, ops, or vertical workflows) who want one neutral blueprint instead of vendor-specific blog scraps.

Skip if: Teams that only need a one-off prompt tweak with no tools, or builders who want a drop-in MCP server or hosted runtime rather than harness design principles.

When should I use this skill?

Designing, generating an MVP blueprint for, auditing, refactoring, or explaining an agentic harness for any domain.

What do I get? / Deliverables

You leave with a concrete default architecture and best-practice dimensions to implement or audit your harness, so model proposals stay validated, permissioned, logged, and improvable via evals and observability.

Harness architecture aligned to the default control-plane loop
Audit checklist across tools, memory, MCP, evals, and safety
Refactor or MVP blueprint narrative for stakeholders

Recommended Skills

Microsoft Foundrymicrosoft/azure-skills

Microsoft Foundry skill guides agents through the full Azure AI Foundry lifecycle—containerizing agents, pushing to ACR,…377k installs·1.2k stars

Azure Aimicrosoft/azure-skills

azure-ai is a Prism-oriented quick reference for Microsoft Azure AI work, with the published body centered on the Azure …375k installs·1.2k stars

Azure Hosted Copilot Sdkmicrosoft/azure-skills

Azure Hosted Copilot SDK is Microsoft's entry skill for repos using @github/copilot-sdk—it detects CopilotClient usage, …346k installs·1.2k stars

Lark Eventlarksuite/cli

Lark real-time subscription skill via lark-cli event consume for building bots and streaming webhook-style agent workers…208k installs·13.7k stars

Running Claude Code Via Litellm Copilotxixu-me/skills

Running Claude Code via LiteLLM Copilot walks through pointing Claude Code at a local LiteLLM proxy that forwards Anthro…200k installs·61 stars

Setup Matt Pocock Skillsmattpocock/skills

One-time per-repo setup so Matt Pocock engineering skills share correct issue tracker, triage strings, and domain docume…180k installs·121k stars

Journey fit

Useful at every journey phase - explore requirements and options before committing to a direction.

Where it fits

Example use

ValidateScope & plan

Draft an MVP agent blueprint with explicit tool boundaries before building integrations.

Example use

BuildAgent skills & templates

Implement the default propose-validate-execute loop with structured observations for your domain agents.

Example use

ShipSecurity

Align permission decisions and approval pauses with safety requirements before enabling autonomous actions.

Example use

ShipTesting & QA

Define evals and feedback loops to regression-test harness changes across model providers.

Example use

OperateMonitoring & observability

Wire observability and compaction so long-running agents stay legible in production logs.

How it compares

Architecture and procedural doctrine for the agent runtime—not a single integration skill or a packaged data ETL template.

Common Questions / FAQ

Who is agents-best-practices for?

Indie and solo builders designing or debugging agent harnesses across domains (coding, research, support, ops) who need control-plane patterns independent of one API vendor.

When should I use agents-best-practices?

Use during Build when scaffolding agent-tooling; during Ship when adding evals and safety gates; during Operate when tightening observability and compaction; and during Validate when blueprinting an MVP agent before full implementation.

Is agents-best-practices safe to install?

It is markdown guidance with no built-in tool execution; review the Security Audits panel on this Prism page and treat permission and safety sections as design requirements in your own runtime.

SKILL.md

READMESKILL.md - Agents Best Practices

# Agents Best Practices

Use this skill when the user asks how to build, improve, debug, or evaluate an agentic harness. This is a general-purpose agent architecture skill. Coding agents are one subdomain only; apply the same principles to research, finance, legal, support, operations, sales, healthcare, education, data analysis, procurement, and workflow automation agents.

## Core stance

An agent harness is the control plane around a model. The model proposes actions; the harness validates, authorizes, executes, records, summarizes, and returns observations. Keep the loop simple and make the runtime rigorous.

Default architecture:

```text
user/task
  -> instruction and context builder
  -> model call
  -> tool/action proposal
  -> schema validation
  -> permission decision
  -> execution or approval pause
  -> structured observation
  -> context update
  -> repeat within budget or finish
```

## When to activate this skill

Use this skill for prompts involving any of these intents:

- build an agent, agentic workflow, AI worker, autonomous assistant, or harness;
- create a domain-specific MVP agent design, starter harness, implementation blueprint, or first production-safe version;
- choose between OpenAI, Anthropic, OpenAI-compatible APIs, direct tool loops, hosted tools, or SDKs;
- design tools, permissions, guardrails, approval flows, or sandboxing;
- create planning mode, workflow orchestration, goal mode, todo tracking, or long-running task behavior;
- add context compaction, memory, retrieval, scoped instructions, or prompt hierarchies;
- attach Agent Skills, reusable workflows, MCP servers, external connectors, or tool search;
- audit an existing agent for reliability, cost, prompt-cache hit rate, safety, latency, or observability;
- create system prompts or developer instructions for a domain-specific agent;
- make source-of-truth knowledge, validation signals, logs, metrics, or workflow state legible to an agent.

Do not use this skill for ordinary single-turn writing, translation, or Q&A unless the user is asking about the design of an agent that will perform those tasks.

## How to use this skill

First, identify the user's design problem:

1. **Domain**: what work the agent performs.
2. **Autonomy level**: answer-only, draft-only, approval-gated action, or autonomous action within policy.
3. **Risk level**: read-only, internal write, external communication, financial, legal, healthcare, security, destructive, or privileged.
4. **State duration**: single turn, multi-turn session, resumable workflow, or long-running goal.
5. **Tool surface**: internal APIs, hosted tools, MCP/external connectors, browser, sandbox, filesystem, database, communication, or computation.
6. **Validation**: what proves the task is complete.

Then load the most relevant reference files, not all files by default. If the user asks to make or build an agent for a domain, default to MVP Builder Mode.

## MVP Builder Mode

When the user asks to make, build, design, scaffold, or specify an agent for a domain, produce a concrete domain-specific MVP harness blueprint, not only advice. Use [mvp-agent-blueprint.md](references/mvp-agent-blueprint.md) as the primary reference and load other references as needed.

Default behavior:

1. Infer a reasonable first version from the user's domain and stated constraints.
2. State assumptions briefly instead

What is this skill?

Provider-neutral harness model: instruction builder → model → tool proposal → schema validation → permission gate → exec

Covers tool design, system prompts, planning/goals, context compaction, memory, skills, MCP, observability, evals, promp

Treats coding agents as one subdomain; same patterns for research, ops, support, finance, and workflow automation agents

Emphasizes harness as control plane: model proposes, runtime validates, authorizes, records, and summarizes

Markdown-only file policy aligned with versioned agent-legible environments

Default 8-step agent loop from user/task through context update and repeat

Skill scope version 1.2.0 provider-neutral-agent-harness

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Adoption & trust: 383 installs on skills.sh; 1.9k GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Who is it for?

Solo builders shipping or refactoring custom agents (coding, research, ops, or vertical workflows) who want one neutral blueprint instead of vendor-specific blog scraps.

Skip if: Teams that only need a one-off prompt tweak with no tools, or builders who want a drop-in MCP server or hosted runtime rather than harness design principles.

What do I get? / Deliverables

Harness architecture aligned to the default control-plane loop

Audit checklist across tools, memory, MCP, evals, and safety

Refactor or MVP blueprint narrative for stakeholders

Journey fit

Useful at every journey phase - explore requirements and options before committing to a direction.

Where it fits

Example use

ValidateScope & plan

Draft an MVP agent blueprint with explicit tool boundaries before building integrations.

Example use

BuildAgent skills & templates

Implement the default propose-validate-execute loop with structured observations for your domain agents.

Example use

ShipSecurity

Align permission decisions and approval pauses with safety requirements before enabling autonomous actions.

Example use

ShipTesting & QA

Define evals and feedback loops to regression-test harness changes across model providers.

Example use

OperateMonitoring & observability

Wire observability and compaction so long-running agents stay legible in production logs.

SKILL.md

READMESKILL.md - Agents Best Practices

# Agents Best Practices

Use this skill when the user asks how to build, improve, debug, or evaluate an agentic harness. This is a general-purpose agent architecture skill. Coding agents are one subdomain only; apply the same principles to research, finance, legal, support, operations, sales, healthcare, education, data analysis, procurement, and workflow automation agents.

## Core stance

An agent harness is the control plane around a model. The model proposes actions; the harness validates, authorizes, executes, records, summarizes, and returns observations. Keep the loop simple and make the runtime rigorous.

Default architecture:

```text
user/task
  -> instruction and context builder
  -> model call
  -> tool/action proposal
  -> schema validation
  -> permission decision
  -> execution or approval pause
  -> structured observation
  -> context update
  -> repeat within budget or finish
```

## When to activate this skill

Use this skill for prompts involving any of these intents:

- build an agent, agentic workflow, AI worker, autonomous assistant, or harness;
- create a domain-specific MVP agent design, starter harness, implementation blueprint, or first production-safe version;
- choose between OpenAI, Anthropic, OpenAI-compatible APIs, direct tool loops, hosted tools, or SDKs;
- design tools, permissions, guardrails, approval flows, or sandboxing;
- create planning mode, workflow orchestration, goal mode, todo tracking, or long-running task behavior;
- add context compaction, memory, retrieval, scoped instructions, or prompt hierarchies;
- attach Agent Skills, reusable workflows, MCP servers, external connectors, or tool search;
- audit an existing agent for reliability, cost, prompt-cache hit rate, safety, latency, or observability;
- create system prompts or developer instructions for a domain-specific agent;
- make source-of-truth knowledge, validation signals, logs, metrics, or workflow state legible to an agent.

Do not use this skill for ordinary single-turn writing, translation, or Q&A unless the user is asking about the design of an agent that will perform those tasks.

## How to use this skill

First, identify the user's design problem:

1. **Domain**: what work the agent performs.
2. **Autonomy level**: answer-only, draft-only, approval-gated action, or autonomous action within policy.
3. **Risk level**: read-only, internal write, external communication, financial, legal, healthcare, security, destructive, or privileged.
4. **State duration**: single turn, multi-turn session, resumable workflow, or long-running goal.
5. **Tool surface**: internal APIs, hosted tools, MCP/external connectors, browser, sandbox, filesystem, database, communication, or computation.
6. **Validation**: what proves the task is complete.

Then load the most relevant reference files, not all files by default. If the user asks to make or build an agent for a domain, default to MVP Builder Mode.

## MVP Builder Mode

When the user asks to make, build, design, scaffold, or specify an agent for a domain, produce a concrete domain-specific MVP harness blueprint, not only advice. Use [mvp-agent-blueprint.md](references/mvp-agent-blueprint.md) as the primary reference and load other references as needed.

Default behavior:

1. Infer a reasonable first version from the user's domain and stated constraints.
2. State assumptions briefly instead

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is agents-best-practices for?

When should I use agents-best-practices?

Is agents-best-practices safe to install?

SKILL.md

This week for builders

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is agents-best-practices for?

When should I use agents-best-practices?

Is agents-best-practices safe to install?

SKILL.md