Risk Classification

Name: Risk Classification
Author: athola

athola/claude-night-market

Pair task risk level with a graduated automation tier so coding agents pause, propose diffs, or hand load-bearing work back instead of over-committing.

Install

npx skills add https://github.com/athola/claude-night-market --skill risk-classification

What is this skill?

Maps GREEN / YELLOW / RED risk to four automation tiers: A3 Autonomous through A0 Manual
Aviation-inspired downgrade rule: drop automation tier when behavior drifts instead of repeating the same command
Tier table spells who writes, who approves, and default tier per risk color
Explicit handoff from risk answer (how carefully to verify) to automation answer (how much the agent may act alone)
Designed for agents that write, commit, or scaffold code without human load-bearing oversight

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Recommended Skills

Microsoft Foundrymicrosoft/azure-skills

Microsoft Foundry skill guides agents through the full Azure AI Foundry lifecycle—containerizing agents, pushing to ACR,…377k installs·1.2k stars

Azure Aimicrosoft/azure-skills

azure-ai is a Prism-oriented quick reference for Microsoft Azure AI work, with the published body centered on the Azure …375k installs·1.2k stars

Azure Hosted Copilot Sdkmicrosoft/azure-skills

Azure Hosted Copilot SDK is Microsoft's entry skill for repos using @github/copilot-sdk—it detects CopilotClient usage, …346k installs·1.2k stars

Lark Eventlarksuite/cli

Lark real-time subscription skill via lark-cli event consume for building bots and streaming webhook-style agent workers…208k installs·13.7k stars

Running Claude Code Via Litellm Copilotxixu-me/skills

Running Claude Code via LiteLLM Copilot walks through pointing Claude Code at a local LiteLLM proxy that forwards Anthro…200k installs·61 stars

Setup Matt Pocock Skillsmattpocock/skills

One-time per-repo setup so Matt Pocock engineering skills share correct issue tracker, triage strings, and domain docume…180k installs·121k stars

Journey fit

Primary fit

Canonical shelf on Ship because the skill governs how much autonomy is safe before changes land in the repo—verification posture is a shipping gate, not a one-off build task. Security subphase captures risk classification and downgrade paths when automation drifts, aligned with safe change control rather than feature implementation.

Common Questions / FAQ

Is Risk Classification safe to install?

skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

SKILL.md

READMESKILL.md - Risk Classification

# Automation Tiers

Risk classification answers "how carefully must this be verified."
Automation tiers answer a second, distinct question: "how much
should the agent do on its own, and when must it hand control
back." A task can be correctly classified RED and still fail
because the agent acted at full autonomy when it should have
proposed and paused.

## Why Graduated Autonomy

Aviation learned this the hard way. When automation does not do
what is needed, the safe move is to *downgrade the level of
automation* (autopilot to flight director to hand-flying) rather
than re-issue the same command at the same level. Crews who could
not or would not downgrade are the "children of the magenta line";
automation dependency was implicated in a large share of accidents.
Classifying the flight as risky was never the gap. Failing to drop
a tier when the automation drifted was.

The transfer to coding agents: pair every risk tier with a default
automation tier, and make downgrading an explicit, pre-licensed
move rather than something the agent has to be talked into.

## The Tiers

| Automation tier | Agent does | Human does | Default for |
|-----------------|-----------|-----------|-------------|
| **A3 Autonomous** | Writes and commits | Spot-checks after | GREEN |
| **A2 Proposed** | Writes a diff, waits | Approves per hunk | YELLOW |
| **A1 Assisted** | Drafts scaffolding, narrates reasoning | Writes load-bearing code, approves | RED |
| **A0 Manual** | Reviews, explains | Writes the change | CRITICAL |

The mapping is a default, not a floor. A human may raise autonomy
when they have strong context, and must be able to lower it freely.

## The Downgrade Trigger

Drop one automation tier, do not re-prompt at the same tier, when
any of these fire:

- **Repeated failure**: two consecutive failed attempts of the same
  shape (same file, same error class, same tool). This is the
  two-challenge rule: no third blind retry.
- **Confidence drop**: the agent cannot state its assumptions, the
  alternatives it considered, or why the change is correct.
- **Stakes spike**: the change reaches into auth, migrations, money,
  concurrency, or a destructive operation mid-task.
- **Mode mismatch**: the agent's stated model of the repo state does
  not match reality (the coding analog of cockpit mode confusion).

On a downgrade, switch to the lower tier's division of labor, run a
read-only diagnostic, and have the human (or an independent checker)
re-establish what is actually true before proceeding.

## Relationship to Verification Gates

Verification gates (see `verification-gates.md`) and automation
tiers are complementary. The gate says what must pass before the
change is accepted; the automation tier says who holds the pen
while it is produced. A RED change runs both: A1 assisted authorship
*and* the RED verification gate, including independent verification
(see `imbue:proof-of-work` module `independent-verification.md`).

## Integration

For orchestrators, carry the automation tier alongside the risk
tier in task metadata:

```json
{
  "metadata": {
    "risk_tier": "RED",
    "automation_tier": "A1",
    "downgrade_reason": null
  }
}
```

When a downgrade trigger fires, record the new tier and the reason,
so the audit trail shows where autonomy was reduced and why. Mode
selection in `imbue:assisted-mastery` reads the automation tier:
A0/A1 imply explain mode, A2/A3 imply produce mode.


---
name: heuristic-classifier
description: Pattern-based classification rules mapping file paths and change types to risk tiers
parent_skill: leyline:risk-classification
category: infrastructure
estimated_tokens: 250
---

# Heuristic Classifier

## Pattern-Based Classification Rules

The classifier evaluates task risk by matching affected file paths against pattern rules. Higher-tier matches take precedence.

### CRITICAL Patterns

```
migrations/*delete*        # Destructive migration
migrations/*drop*          # Table/column drops
*.sql DROP                 #

What is this skill?

Maps GREEN / YELLOW / RED risk to four automation tiers: A3 Autonomous through A0 Manual

Aviation-inspired downgrade rule: drop automation tier when behavior drifts instead of repeating the same command

Tier table spells who writes, who approves, and default tier per risk color

Explicit handoff from risk answer (how carefully to verify) to automation answer (how much the agent may act alone)

Designed for agents that write, commit, or scaffold code without human load-bearing oversight

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Journey fit

Primary fit

SKILL.md

READMESKILL.md - Risk Classification

# Automation Tiers

Risk classification answers "how carefully must this be verified."
Automation tiers answer a second, distinct question: "how much
should the agent do on its own, and when must it hand control
back." A task can be correctly classified RED and still fail
because the agent acted at full autonomy when it should have
proposed and paused.

## Why Graduated Autonomy

Aviation learned this the hard way. When automation does not do
what is needed, the safe move is to *downgrade the level of
automation* (autopilot to flight director to hand-flying) rather
than re-issue the same command at the same level. Crews who could
not or would not downgrade are the "children of the magenta line";
automation dependency was implicated in a large share of accidents.
Classifying the flight as risky was never the gap. Failing to drop
a tier when the automation drifted was.

The transfer to coding agents: pair every risk tier with a default
automation tier, and make downgrading an explicit, pre-licensed
move rather than something the agent has to be talked into.

## The Tiers

| Automation tier | Agent does | Human does | Default for |
|-----------------|-----------|-----------|-------------|
| **A3 Autonomous** | Writes and commits | Spot-checks after | GREEN |
| **A2 Proposed** | Writes a diff, waits | Approves per hunk | YELLOW |
| **A1 Assisted** | Drafts scaffolding, narrates reasoning | Writes load-bearing code, approves | RED |
| **A0 Manual** | Reviews, explains | Writes the change | CRITICAL |

The mapping is a default, not a floor. A human may raise autonomy
when they have strong context, and must be able to lower it freely.

## The Downgrade Trigger

Drop one automation tier, do not re-prompt at the same tier, when
any of these fire:

- **Repeated failure**: two consecutive failed attempts of the same
  shape (same file, same error class, same tool). This is the
  two-challenge rule: no third blind retry.
- **Confidence drop**: the agent cannot state its assumptions, the
  alternatives it considered, or why the change is correct.
- **Stakes spike**: the change reaches into auth, migrations, money,
  concurrency, or a destructive operation mid-task.
- **Mode mismatch**: the agent's stated model of the repo state does
  not match reality (the coding analog of cockpit mode confusion).

On a downgrade, switch to the lower tier's division of labor, run a
read-only diagnostic, and have the human (or an independent checker)
re-establish what is actually true before proceeding.

## Relationship to Verification Gates

Verification gates (see `verification-gates.md`) and automation
tiers are complementary. The gate says what must pass before the
change is accepted; the automation tier says who holds the pen
while it is produced. A RED change runs both: A1 assisted authorship
*and* the RED verification gate, including independent verification
(see `imbue:proof-of-work` module `independent-verification.md`).

## Integration

For orchestrators, carry the automation tier alongside the risk
tier in task metadata:

```json
{
  "metadata": {
    "risk_tier": "RED",
    "automation_tier": "A1",
    "downgrade_reason": null
  }
}
```

When a downgrade trigger fires, record the new tier and the reason,
so the audit trail shows where autonomy was reduced and why. Mode
selection in `imbue:assisted-mastery` reads the automation tier:
A0/A1 imply explain mode, A2/A3 imply produce mode.


---
name: heuristic-classifier
description: Pattern-based classification rules mapping file paths and change types to risk tiers
parent_skill: leyline:risk-classification
category: infrastructure
estimated_tokens: 250
---

# Heuristic Classifier

## Pattern-Based Classification Rules

The classifier evaluates task risk by matching affected file paths against pattern rules. Higher-tier matches take precedence.

### CRITICAL Patterns

```
migrations/*delete*        # Destructive migration
migrations/*drop*          # Table/column drops
*.sql DROP                 #

Install

What is this skill?

Recommended Skills

Journey fit

Is Risk Classification safe to install?

SKILL.md

This week for builders

Install

What is this skill?

Recommended Skills

Journey fit

Is Risk Classification safe to install?

SKILL.md