Memory Clarity Probe

Name: Memory Clarity Probe
Author: athola

athola/claude-night-market

Run dual anchor questions on a session summary or memory blob to see if task progress and information gaps are clear enough before handoff or compression.

Install

npx skills add https://github.com/athola/claude-night-market --skill memory-clarity-probe

What is this skill?

Dual-probe pattern from MMPO research: progress anchor plus information-gap anchor
Explicit pre-handoff, session checkpoint, and summary-quality-gate usage patterns
Best-of-N mode for comparing multiple summary candidates
Structured output format and exit criteria for pass-or-rewrite decisions
Integrates with session-palace-builder dependency for palace-oriented memory workflows

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; trending (+100% hot-view momentum).

Recommended Skills

Microsoft Foundrymicrosoft/azure-skills

Microsoft Foundry skill guides agents through the full Azure AI Foundry lifecycle—containerizing agents, pushing to ACR,…377k installs·1.2k stars

Azure Aimicrosoft/azure-skills

azure-ai is a Prism-oriented quick reference for Microsoft Azure AI work, with the published body centered on the Azure …375k installs·1.2k stars

Azure Hosted Copilot Sdkmicrosoft/azure-skills

Azure Hosted Copilot SDK is Microsoft's entry skill for repos using @github/copilot-sdk—it detects CopilotClient usage, …346k installs·1.2k stars

Lark Eventlarksuite/cli

Lark real-time subscription skill via lark-cli event consume for building bots and streaming webhook-style agent workers…208k installs·13.7k stars

Running Claude Code Via Litellm Copilotxixu-me/skills

Running Claude Code via LiteLLM Copilot walks through pointing Claude Code at a local LiteLLM proxy that forwards Anthro…200k installs·61 stars

Setup Matt Pocock Skillsmattpocock/skills

One-time per-repo setup so Matt Pocock engineering skills share correct issue tracker, triage strings, and domain docume…180k installs·121k stars

Journey fit

Primary fit

BuildAgent skills & templates

Build agent-tooling is the canonical shelf because the probe is a reusable quality gate in long agent sessions while you assemble tooling and workflows. Agent-tooling covers session memory, handoffs, and compression—exactly where clarity probes prevent corrupted context.

SKILL.md

READMESKILL.md - Memory Clarity Probe

## Table of Contents

- [What It Is](#what-it-is)
- [The Dual-Probe Pattern](#the-dual-probe-pattern)
- [What This Is NOT](#what-this-is-not)
- [When to Use](#when-to-use)
- [Core Workflow](#core-workflow)
- [Best-of-N Mode](#best-of-n-mode)
- [Output Format](#output-format)
- [Integration Points](#integration-points)
- [Exit Criteria](#exit-criteria)


# Memory Clarity Probe

Assess whether a memory, summary, or session state retains enough
task information to guide future reasoning.

## What It Is

A quality gate for any memory or summary, based on the dual-probe
pattern from MMPO (arXiv:2605.30159, Liu et al. 2026). The probe
asks two anchor questions against the current memory and evaluates
whether the answers are confident and complete:

1. **Progress probe**: "Based on current memory, what is the
   current task progress?"
2. **Gap probe**: "Based on current memory, what information is
   still needed?"

A clear memory answers the progress probe with specific, verifiable
state (not vague placeholders) and enumerates bounded, concrete
unknowns on the gap probe. An ambiguous memory produces hedging on
the progress probe and open-ended uncertainty on the gap probe.

## The Dual-Probe Pattern

The two probes target different failure modes:

- **Confident-wrong**: the model has a wrong but confident belief
  about task state. The gap probe alone misses this. The model
  claims it has enough. The progress probe catches it: if the
  stated progress contradicts known facts, the memory has drifted.
- **Uncertain-incomplete**: the model is uncertain about where the
  task stands. Both probes surface this: the progress answer hedges
  and the gap answer lists open-ended unknowns.

The MMPO paper's ablation (Table 4) shows `progress+gap` outperforms
`gap-only` across all context lengths. Use both probes.

## What This Is NOT

This skill implements a **qualitative** clarity assessment. It does
not compute the token-level predictive entropy (Belief Entropy,
Eq. 5 in MMPO) that the paper uses for RL training. Night-market
has no access to the model's internal log-probabilities.

The paper's Table 6 shows that qualitative probing (labeled
"direct-answer entropy", r=0.54) is weaker than true entropy
(r=0.68), and can encourage premature confidence. Use this probe
as a necessary quality check, not a sufficient one.

## When to Use

- Before `conserve:clear-context` hands off to a continuation agent
- At session checkpoints in `memory-palace:session-palace-builder`
- Before committing a summary to a knowledge palace via
  `memory-palace:knowledge-intake`
- Before `imbue:proof-of-work` declares work complete
- When evaluating multiple candidate summaries (Best-of-N mode)

## When NOT to Use

- As a substitute for actually reading the task requirements
- To validate factual correctness (the probe tests clarity, not truth)
- When the memory is trivially short (under 100 tokens: read it)

## Core Workflow

### Step 1: Receive the memory

Accept the memory or summary as input. Sources:
- The current session-state.md (from clear-context)
- A palace room's content (from session-palace-builder)
- A knowledge digest (from knowledge-intake)
- Inline text provided by the caller

### Step 2: Ask the progress probe

Evaluate the memory against:

```
Based on the memory below, what is the current task progress?
Describe specifically what has been completed and what st

What is this skill?

Dual-probe pattern from MMPO research: progress anchor plus information-gap anchor

Explicit pre-handoff, session checkpoint, and summary-quality-gate usage patterns

Best-of-N mode for comparing multiple summary candidates

Structured output format and exit criteria for pass-or-rewrite decisions

Integrates with session-palace-builder dependency for palace-oriented memory workflows

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; trending (+100% hot-view momentum).

Journey fit

Primary fit

BuildAgent skills & templates

SKILL.md

READMESKILL.md - Memory Clarity Probe

## Table of Contents

- [What It Is](#what-it-is)
- [The Dual-Probe Pattern](#the-dual-probe-pattern)
- [What This Is NOT](#what-this-is-not)
- [When to Use](#when-to-use)
- [Core Workflow](#core-workflow)
- [Best-of-N Mode](#best-of-n-mode)
- [Output Format](#output-format)
- [Integration Points](#integration-points)
- [Exit Criteria](#exit-criteria)


# Memory Clarity Probe

Assess whether a memory, summary, or session state retains enough
task information to guide future reasoning.

## What It Is

A quality gate for any memory or summary, based on the dual-probe
pattern from MMPO (arXiv:2605.30159, Liu et al. 2026). The probe
asks two anchor questions against the current memory and evaluates
whether the answers are confident and complete:

1. **Progress probe**: "Based on current memory, what is the
   current task progress?"
2. **Gap probe**: "Based on current memory, what information is
   still needed?"

A clear memory answers the progress probe with specific, verifiable
state (not vague placeholders) and enumerates bounded, concrete
unknowns on the gap probe. An ambiguous memory produces hedging on
the progress probe and open-ended uncertainty on the gap probe.

## The Dual-Probe Pattern

The two probes target different failure modes:

- **Confident-wrong**: the model has a wrong but confident belief
  about task state. The gap probe alone misses this. The model
  claims it has enough. The progress probe catches it: if the
  stated progress contradicts known facts, the memory has drifted.
- **Uncertain-incomplete**: the model is uncertain about where the
  task stands. Both probes surface this: the progress answer hedges
  and the gap answer lists open-ended unknowns.

The MMPO paper's ablation (Table 4) shows `progress+gap` outperforms
`gap-only` across all context lengths. Use both probes.

## What This Is NOT

This skill implements a **qualitative** clarity assessment. It does
not compute the token-level predictive entropy (Belief Entropy,
Eq. 5 in MMPO) that the paper uses for RL training. Night-market
has no access to the model's internal log-probabilities.

The paper's Table 6 shows that qualitative probing (labeled
"direct-answer entropy", r=0.54) is weaker than true entropy
(r=0.68), and can encourage premature confidence. Use this probe
as a necessary quality check, not a sufficient one.

## When to Use

- Before `conserve:clear-context` hands off to a continuation agent
- At session checkpoints in `memory-palace:session-palace-builder`
- Before committing a summary to a knowledge palace via
  `memory-palace:knowledge-intake`
- Before `imbue:proof-of-work` declares work complete
- When evaluating multiple candidate summaries (Best-of-N mode)

## When NOT to Use

- As a substitute for actually reading the task requirements
- To validate factual correctness (the probe tests clarity, not truth)
- When the memory is trivially short (under 100 tokens: read it)

## Core Workflow

### Step 1: Receive the memory

Accept the memory or summary as input. Sources:
- The current session-state.md (from clear-context)
- A palace room's content (from session-palace-builder)
- A knowledge digest (from knowledge-intake)
- Inline text provided by the caller

### Step 2: Ask the progress probe

Evaluate the memory against:

```
Based on the memory below, what is the current task progress?
Describe specifically what has been completed and what st

Install

What is this skill?

Recommended Skills

Journey fit

SKILL.md

This week for builders

Install

What is this skill?

Recommended Skills

Journey fit

SKILL.md