Karpathy Principles

Name: Karpathy Principles
Author: athola

athola/claude-night-market

Stop LLM coding drift by naming eight anti-pattern rails and applying think-before-coding checks before agents ship silent assumptions.

Install

npx skills add https://github.com/athola/claude-night-market --skill karpathy-principles

What is this skill?

Eight named drift rails (e.g. hidden assumptions, silent interpretation picks) mapped to Karpathy-style principles
Each rail includes trigger pattern, bad vs good example shape, and a one-sentence test for your diff
AP-1 rule: three or more silent assumptions in a diff is a rail violation
Forces explicit confirmation on scope, format, privacy, and volume before coding
Source chain documented via references/source-attribution.md in upstream repo

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Recommended Skills

Microsoft Foundrymicrosoft/azure-skills

Microsoft Foundry skill guides agents through the full Azure AI Foundry lifecycle—containerizing agents, pushing to ACR,…377k installs·1.2k stars

Azure Aimicrosoft/azure-skills

azure-ai is a Prism-oriented quick reference for Microsoft Azure AI work, with the published body centered on the Azure …375k installs·1.2k stars

Azure Hosted Copilot Sdkmicrosoft/azure-skills

Azure Hosted Copilot SDK is Microsoft's entry skill for repos using @github/copilot-sdk—it detects CopilotClient usage, …346k installs·1.2k stars

Lark Eventlarksuite/cli

Lark real-time subscription skill via lark-cli event consume for building bots and streaming webhook-style agent workers…208k installs·13.7k stars

Running Claude Code Via Litellm Copilotxixu-me/skills

Running Claude Code via LiteLLM Copilot walks through pointing Claude Code at a local LiteLLM proxy that forwards Anthro…200k installs·61 stars

Setup Matt Pocock Skillsmattpocock/skills

One-time per-repo setup so Matt Pocock engineering skills share correct issue tracker, triage strings, and domain docume…180k installs·121k stars

Journey fit

Primary fit

BuildAgent skills & templates

Build/agent-tooling is the first shelf where agent-assisted implementation happens and drift is most expensive if assumptions hide in the diff. Agent-tooling captures procedural guardrails for Claude/Cursor-style coding, not a single library integration.

Common Questions / FAQ

Is Karpathy Principles safe to install?

skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.

SKILL.md

READMESKILL.md - Karpathy Principles

# Anti-Patterns: Eight Named Drift Rails

Each drift rail names a specific way LLM-driven coding
goes wrong, with a trigger pattern, a short bad
example, a short good example, and a one-sentence
test you can apply to your own diff. Naming the rail
helps recall.

Adapted from observations distilled by Forrest Chang
from Karpathy's notes; see
`references/source-attribution.md` for the citation
chain. Examples are rewritten in our own prose to
illustrate the failure shape, not lifted verbatim.

## AP-1: Hidden Assumptions

**Maps to**: Principle 1 (Think Before Coding)

**Trigger**: User asks for a feature with multiple
unstated dimensions (scope, format, volume, privacy).
The agent picks one silently and ships.

**Bad shape**

```python
def export_users(format='json'):
    users = User.query.all()
    if format == 'json':
        with open('users.json', 'w') as f:
            json.dump([u.to_dict() for u in users], f)
```

Assumed: all users (privacy?), file path, fields,
volume.

**Good shape**

> Before implementing, I want to confirm:
>
> - Scope: all users, or filtered?
> - Format: download, background job, or API endpoint?
> - Fields: which? Some are sensitive.
> - Volume: thousands or millions?
>
> Default proposal: a paginated JSON endpoint. Adjust?

**Test**: count silent assumptions in the diff. Three
or more is a rail violation.

## AP-2: Multiple Interpretations Picked Silently

**Maps to**: Principle 1 (Think Before Coding)

**Trigger**: A request like "make the search faster"
has at least three valid readings (response time,
throughput, perceived speed). The agent picks one and
implements 200 lines without flagging the choice.

**Bad shape**

```python
@lru_cache(maxsize=1000)
async def search(query: str) -> List[Result]:
    # 200 lines of caching, async, indexes, all picked
    # without confirming what "faster" means
    ...
```

**Good shape**

> "Faster" could mean response time, throughput, or
> perceived speed. Each is a different fix:
>
> 1. Response time: indexes plus query caching (~2h)
> 2. Throughput: async path plus pooling (~4h)
> 3. Perceived: progressive load and partial results
>    (~3h)
>
> Current: ~500ms typical. Which axis matters?

**Test**: when the verb in the request is ambiguous
(faster, better, cleaner, simpler), did the agent name
the alternatives or pick one?

## AP-3: Strategy Pattern for One Function

**Maps to**: Principle 2 (Simplicity First)

**Trigger**: User asks for a single function. The
agent ships an abstract base class, two implementing
classes, a config dataclass, and a coordinator class
for ten lines of arithmetic.

**Bad shape**

```python
class DiscountStrategy(ABC):
    @abstractmethod
    def calculate(self, amount: float) -> float: ...

class PercentageDiscount(DiscountStrategy):
    def __init__(self, p): self.p = p
    def calculate(self, a): return a * (self.p / 100)

# Plus FixedDiscount, DiscountConfig, DiscountCalculator
# for what should be one function
```

**Good shape**

```python
def calculate_discount(amount: float, percent: float) -> float:
    return amount * (percent / 100)
```

**Test**: count types and classes added per actual use
case. If types-added exceeds use-cases-served, the
pattern is premature.

## AP-4: Speculative Features

**Maps to**: Principle 2 (Simplicity First)

**Trigger**: "Save user preferences to database"
becomes a class with optional caching, validation,
notification hooks, and merge semantics. None were
asked for.

**Bad shape**

```python
class PreferenceManager:
    def __init__(self, db, cache=None, validator=None):
        ...
    def save(self, user_id, prefs,
             merge=True, validate=True, notify=False):
        # 60 lines of optional behavior
```

**Good shape**

```python
def save_preferences(db, user_id: int, preferences: dict):
    db.execute(
        "UPDATE users SET preferences = ? WHERE id = ?",
        (json.dumps(preferences), user_id),
    )
```

**Test**: list every parameter that was not in the
reques

What is this skill?

Eight named drift rails (e.g. hidden assumptions, silent interpretation picks) mapped to Karpathy-style principles

Each rail includes trigger pattern, bad vs good example shape, and a one-sentence test for your diff

AP-1 rule: three or more silent assumptions in a diff is a rail violation

Forces explicit confirmation on scope, format, privacy, and volume before coding

Source chain documented via references/source-attribution.md in upstream repo

Adoption & trust: 1 installs on skills.sh; 304 GitHub stars; 3/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).

Journey fit

Primary fit

BuildAgent skills & templates

SKILL.md

READMESKILL.md - Karpathy Principles

# Anti-Patterns: Eight Named Drift Rails

Each drift rail names a specific way LLM-driven coding
goes wrong, with a trigger pattern, a short bad
example, a short good example, and a one-sentence
test you can apply to your own diff. Naming the rail
helps recall.

Adapted from observations distilled by Forrest Chang
from Karpathy's notes; see
`references/source-attribution.md` for the citation
chain. Examples are rewritten in our own prose to
illustrate the failure shape, not lifted verbatim.

## AP-1: Hidden Assumptions

**Maps to**: Principle 1 (Think Before Coding)

**Trigger**: User asks for a feature with multiple
unstated dimensions (scope, format, volume, privacy).
The agent picks one silently and ships.

**Bad shape**

```python
def export_users(format='json'):
    users = User.query.all()
    if format == 'json':
        with open('users.json', 'w') as f:
            json.dump([u.to_dict() for u in users], f)
```

Assumed: all users (privacy?), file path, fields,
volume.

**Good shape**

> Before implementing, I want to confirm:
>
> - Scope: all users, or filtered?
> - Format: download, background job, or API endpoint?
> - Fields: which? Some are sensitive.
> - Volume: thousands or millions?
>
> Default proposal: a paginated JSON endpoint. Adjust?

**Test**: count silent assumptions in the diff. Three
or more is a rail violation.

## AP-2: Multiple Interpretations Picked Silently

**Maps to**: Principle 1 (Think Before Coding)

**Trigger**: A request like "make the search faster"
has at least three valid readings (response time,
throughput, perceived speed). The agent picks one and
implements 200 lines without flagging the choice.

**Bad shape**

```python
@lru_cache(maxsize=1000)
async def search(query: str) -> List[Result]:
    # 200 lines of caching, async, indexes, all picked
    # without confirming what "faster" means
    ...
```

**Good shape**

> "Faster" could mean response time, throughput, or
> perceived speed. Each is a different fix:
>
> 1. Response time: indexes plus query caching (~2h)
> 2. Throughput: async path plus pooling (~4h)
> 3. Perceived: progressive load and partial results
>    (~3h)
>
> Current: ~500ms typical. Which axis matters?

**Test**: when the verb in the request is ambiguous
(faster, better, cleaner, simpler), did the agent name
the alternatives or pick one?

## AP-3: Strategy Pattern for One Function

**Maps to**: Principle 2 (Simplicity First)

**Trigger**: User asks for a single function. The
agent ships an abstract base class, two implementing
classes, a config dataclass, and a coordinator class
for ten lines of arithmetic.

**Bad shape**

```python
class DiscountStrategy(ABC):
    @abstractmethod
    def calculate(self, amount: float) -> float: ...

class PercentageDiscount(DiscountStrategy):
    def __init__(self, p): self.p = p
    def calculate(self, a): return a * (self.p / 100)

# Plus FixedDiscount, DiscountConfig, DiscountCalculator
# for what should be one function
```

**Good shape**

```python
def calculate_discount(amount: float, percent: float) -> float:
    return amount * (percent / 100)
```

**Test**: count types and classes added per actual use
case. If types-added exceeds use-cases-served, the
pattern is premature.

## AP-4: Speculative Features

**Maps to**: Principle 2 (Simplicity First)

**Trigger**: "Save user preferences to database"
becomes a class with optional caching, validation,
notification hooks, and merge semantics. None were
asked for.

**Bad shape**

```python
class PreferenceManager:
    def __init__(self, db, cache=None, validator=None):
        ...
    def save(self, user_id, prefs,
             merge=True, validate=True, notify=False):
        # 60 lines of optional behavior
```

**Good shape**

```python
def save_preferences(db, user_id: int, preferences: dict):
    db.execute(
        "UPDATE users SET preferences = ? WHERE id = ?",
        (json.dumps(preferences), user_id),
    )
```

**Test**: list every parameter that was not in the
reques

Install

What is this skill?

Recommended Skills

Journey fit

Is Karpathy Principles safe to install?

SKILL.md

This week for builders

Install

What is this skill?

Recommended Skills

Journey fit

Is Karpathy Principles safe to install?

SKILL.md