Workflow Monitor

Name: Workflow Monitor
Author: athola

athola/claude-night-market

Detect agent workflow failures—command errors, timeouts, retry loops, and context exhaustion—so long autonomous sessions stay efficient.

Overview

workflow-monitor is a journey-wide agent skill that catalogs detection patterns for command failures, timeouts, retry loops, and context exhaustion—usable whenever a solo builder runs multi-step agent workflows and needs

Install

npx skills add https://github.com/athola/claude-night-market --skill workflow-monitor

What is this skill?

Detects non-zero exit codes and stderr error/failed/exception patterns
Flags timeout events including exit code 124 and timed-out messaging
Retry-loop heuristic when the same command runs more than three times
Context exhaustion signals when usage exceeds roughly 90% or truncation appears
Efficiency pattern for verbose output exceeding about 500 lines
Retry-loop threshold: same command more than 3 times
Context exhaustion signal: >90% usage
Verbose output heuristic: >500 lines

Compatible agents: Claude Code, Codex, Cursor, any compatible agent

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 2/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

What problem does it solve?

Your agent session silently retries failed commands, hits context limits, or drowns in verbose output without a structured way to spot the breakdown.

Who is it for?

Builders orchestrating repeated shell-heavy agent sessions who want copy-paste detection logic for monitors or custom workflow runners.

Skip if: Teams needing a turnkey hosted observability product with dashboards and alerts out of the box.

When should I use this skill?

Designing or reviewing agent workflow monitors for command failure, timeout, retry loops, context limits, or excessive output.

What do I get? / Deliverables

You apply documented detection signals and snippets so workflows surface errors, timeouts, retry loops, and efficiency issues before they waste another hour of agent time.

Detection pattern catalog
Embeddable bash/Python check snippets

Recommended Skills

Agent Browservercel-labs/agent-browser

agent-browser is a Node-installed browser automation CLI built for AI agents that need dependable programmatic web inter…428k installs·35.5k stars

Lark Imlarksuite/cli

Lark IM is a Larksuite agent skill that exposes Feishu/Lark instant messaging to Claude Code, Cursor, and similar agents…210k installs·13.7k stars

Lark Calendarlarksuite/cli

lark-calendar is an agent skill for Feishu/Lark Calendar v4 exposed via lark-cli. Solo builders and small teams who alre…209k installs·13.7k stars

Lark Sheetslarksuite/cli

Skill for programmatic Feishu spreadsheet and worksheet management—create tables, bulk data IO, lookup, and export—using…209k installs·13.7k stars

Lark Vclarksuite/cli

lark-vc is an agent skill for Feishu/Lark video conferencing history and artifacts through lark-cli. After calls end, so…208k installs·13.7k stars

Lark Contactlarksuite/cli

CLI skill for Lark directory lookup: search employees and fetch metadata by open_id, with clear boundaries vs IM, calend…208k installs·13.7k stars

Journey fit

Useful at every journey phase - explore requirements and options before committing to a direction.

Where it fits

Example use

BuildAgent skills & templates

Wrap code-generation scripts with exit-code and stderr checks before merging agent-produced patches.

Example use

ShipTesting & QA

Fail fast when integration-test commands time out at 120s during agent-driven CI.

Example use

GrowLifecycle & retention

Monitor scheduled content or data jobs for repeat-command thrashing.

Example use

OperateMonitoring & observability

Alert when nightly agent maintenance hits context truncation mid-run.

How it compares

Pattern library for DIY workflow guards—not a replacement for full APM or CI log aggregation platforms.

Common Questions / FAQ

Who is workflow-monitor for?

Solo developers running Claude Night Market–style or custom multi-command agent workflows who need explicit error and efficiency detection rules.

When should I use workflow-monitor?

Use it during build automation scripting, ship CI agent runs, grow ops maintenance bots, and operate daily coding agents—anytime you want to catch failures, timeouts, >3 command retries, or >90% context usage.

Is workflow-monitor safe to install?

Check this page’s Security Audits panel for publication source and any audit metadata; the skill describes monitoring patterns and does not by itself execute commands until you integrate the snippets.

SKILL.md

READMESKILL.md - Workflow Monitor

# Detection Patterns

Patterns for detecting workflow errors and inefficiencies.

## Error Patterns

### Command Failure

```bash
# Detection: Exit code > 0
command_output=$(some_command 2>&1)
exit_code=$?
if [ $exit_code -ne 0 ]; then
  echo "ERROR: Command failed with exit code $exit_code"
fi
```

**Signals:**
- Non-zero exit code
- stderr output containing "error", "failed", "exception"
- Traceback patterns in output

### Timeout Events

```bash
# Detection: Command exceeds timeout
timeout 120 some_long_command
if [ $? -eq 124 ]; then
  echo "TIMEOUT: Command exceeded 120s limit"
fi
```

**Signals:**
- Exit code 124 (timeout)
- "timed out" in output
- Session timeout warnings

### Retry Loops

**Detection:** Same command executed more than 3 times in a session.

```python
def detect_retry_loop(commands: list[str]) -> bool:
    """Detect if same command is retried excessively."""
    from collections import Counter
    counts = Counter(commands)
    return any(count > 3 for count in counts.values())
```

**Signals:**
- Repeated identical commands
- Similar commands with minor variations
- "retrying" patterns in output

### Context Exhaustion

**Detection:** Context usage exceeds threshold.

**Signals:**
- "/context" shows >90% usage
- Truncation warnings
- "context limit" messages

## Efficiency Patterns

### Verbose Output

**Detection:** Command produces excessive output.

```bash
# Check output line count
output_lines=$(some_command | wc -l)
if [ $output_lines -gt 500 ]; then
  echo "WARNING: Verbose output ($output_lines lines)"
  echo "Suggestion: Use --quiet or redirect to file"
fi
```

**Common offenders:**
- `npm install` without `--silent`
- `pip install` without `--quiet`
- `git log` without `-n` limit
- `find` without `| head`

### Redundant File Reads

**Detection:** Same file read multiple times.

```python
def detect_redundant_reads(read_events: list[dict]) -> list[str]:
    """Find files read more than twice."""
    from collections import Counter
    file_counts = Counter(e["file_path"] for e in read_events)
    return [f for f, count in file_counts.items() if count > 2]
```

**Suggestions:**
- Cache file contents in variables
- Use Read tool with offset/limit for large files
- Batch related reads together

### Sequential vs Parallel

**Detection:** Independent operations run sequentially.

```python
def detect_parallelizable(operations: list[dict]) -> bool:
    """Check if operations could be parallelized."""
    # Operations are independent if:
    # - No data dependencies between them
    # - Different target files/resources
    # - No ordering requirements
    pass
```

**Examples:**
- Multiple independent `gh` API calls
- Reading unrelated files
- Running independent tests

### Over-Fetching

**Detection:** Large file read when only portion needed.

**Signals:**
- Full file read followed by small extraction
- Large files read without offset/limit
- Regex search on entire file content

## Severity Classification

| Pattern | Default Severity | Context-Dependent |
|---------|-----------------|-------------------|
| Command failure | High | Lower if in test context |
| Timeout | High | Medium if expected long |
| Retry loop | Medium | High if >5 retries |
| Context exhaustion | Medium | High if mandatory phases pending |
| Verbose output | Low | Medium if >1000 lines |
| Redundant reads | Low | Medium if >5 reads |

## Evidence Collection

For each detected pattern, collect:

1. **Command/action** - What was executed
2. **Output** - Full or relevant excerpt
3. **Timing** - When it occurred, duration
4. **Context** - What was happening before/after
5. **Severity** - Based on classification above

Format as evidence log entry:

```json
{
  "id": "E1",
  "type": "command_failure",
  "severity": "high",
  "command": "npm test",
  "exit_code": 1,
  "output_excerpt": "FAIL src/test.js\n  Test failed: expected...",
  "timestamp": "2025-01-14T10:30:00Z",
  "context": "Running validation phase"
}
```


# Efficiency

What is this skill?

Detects non-zero exit codes and stderr error/failed/exception patterns

Flags timeout events including exit code 124 and timed-out messaging

Retry-loop heuristic when the same command runs more than three times

Context exhaustion signals when usage exceeds roughly 90% or truncation appears

Efficiency pattern for verbose output exceeding about 500 lines

Retry-loop threshold: same command more than 3 times

Context exhaustion signal: >90% usage

Verbose output heuristic: >500 lines

Compatible agents: Claude Code, Codex, Cursor, any compatible agent

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 2/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Journey fit

Useful at every journey phase - explore requirements and options before committing to a direction.

Where it fits

Example use

BuildAgent skills & templates

Wrap code-generation scripts with exit-code and stderr checks before merging agent-produced patches.

Example use

ShipTesting & QA

Fail fast when integration-test commands time out at 120s during agent-driven CI.

Example use

GrowLifecycle & retention

Monitor scheduled content or data jobs for repeat-command thrashing.

Example use

OperateMonitoring & observability

Alert when nightly agent maintenance hits context truncation mid-run.

SKILL.md

READMESKILL.md - Workflow Monitor

# Detection Patterns

Patterns for detecting workflow errors and inefficiencies.

## Error Patterns

### Command Failure

```bash
# Detection: Exit code > 0
command_output=$(some_command 2>&1)
exit_code=$?
if [ $exit_code -ne 0 ]; then
  echo "ERROR: Command failed with exit code $exit_code"
fi
```

**Signals:**
- Non-zero exit code
- stderr output containing "error", "failed", "exception"
- Traceback patterns in output

### Timeout Events

```bash
# Detection: Command exceeds timeout
timeout 120 some_long_command
if [ $? -eq 124 ]; then
  echo "TIMEOUT: Command exceeded 120s limit"
fi
```

**Signals:**
- Exit code 124 (timeout)
- "timed out" in output
- Session timeout warnings

### Retry Loops

**Detection:** Same command executed more than 3 times in a session.

```python
def detect_retry_loop(commands: list[str]) -> bool:
    """Detect if same command is retried excessively."""
    from collections import Counter
    counts = Counter(commands)
    return any(count > 3 for count in counts.values())
```

**Signals:**
- Repeated identical commands
- Similar commands with minor variations
- "retrying" patterns in output

### Context Exhaustion

**Detection:** Context usage exceeds threshold.

**Signals:**
- "/context" shows >90% usage
- Truncation warnings
- "context limit" messages

## Efficiency Patterns

### Verbose Output

**Detection:** Command produces excessive output.

```bash
# Check output line count
output_lines=$(some_command | wc -l)
if [ $output_lines -gt 500 ]; then
  echo "WARNING: Verbose output ($output_lines lines)"
  echo "Suggestion: Use --quiet or redirect to file"
fi
```

**Common offenders:**
- `npm install` without `--silent`
- `pip install` without `--quiet`
- `git log` without `-n` limit
- `find` without `| head`

### Redundant File Reads

**Detection:** Same file read multiple times.

```python
def detect_redundant_reads(read_events: list[dict]) -> list[str]:
    """Find files read more than twice."""
    from collections import Counter
    file_counts = Counter(e["file_path"] for e in read_events)
    return [f for f, count in file_counts.items() if count > 2]
```

**Suggestions:**
- Cache file contents in variables
- Use Read tool with offset/limit for large files
- Batch related reads together

### Sequential vs Parallel

**Detection:** Independent operations run sequentially.

```python
def detect_parallelizable(operations: list[dict]) -> bool:
    """Check if operations could be parallelized."""
    # Operations are independent if:
    # - No data dependencies between them
    # - Different target files/resources
    # - No ordering requirements
    pass
```

**Examples:**
- Multiple independent `gh` API calls
- Reading unrelated files
- Running independent tests

### Over-Fetching

**Detection:** Large file read when only portion needed.

**Signals:**
- Full file read followed by small extraction
- Large files read without offset/limit
- Regex search on entire file content

## Severity Classification

| Pattern | Default Severity | Context-Dependent |
|---------|-----------------|-------------------|
| Command failure | High | Lower if in test context |
| Timeout | High | Medium if expected long |
| Retry loop | Medium | High if >5 retries |
| Context exhaustion | Medium | High if mandatory phases pending |
| Verbose output | Low | Medium if >1000 lines |
| Redundant reads | Low | Medium if >5 reads |

## Evidence Collection

For each detected pattern, collect:

1. **Command/action** - What was executed
2. **Output** - Full or relevant excerpt
3. **Timing** - When it occurred, duration
4. **Context** - What was happening before/after
5. **Severity** - Based on classification above

Format as evidence log entry:

```json
{
  "id": "E1",
  "type": "command_failure",
  "severity": "high",
  "command": "npm test",
  "exit_code": 1,
  "output_excerpt": "FAIL src/test.js\n  Test failed: expected...",
  "timestamp": "2025-01-14T10:30:00Z",
  "context": "Running validation phase"
}
```


# Efficiency

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is workflow-monitor for?

When should I use workflow-monitor?

Is workflow-monitor safe to install?

SKILL.md

This week for builders

Overview

Install

What is this skill?

What problem does it solve?

Who is it for?

When should I use this skill?

What do I get? / Deliverables

Recommended Skills

Journey fit

Where it fits

Who is workflow-monitor for?

When should I use workflow-monitor?

Is workflow-monitor safe to install?

SKILL.md