
Paper2code Arxiv Implementation
Turn an arXiv paper URL into citation-anchored Python that implements equations and sections without silently guessing missing details.
Overview
paper2code — Arxiv Paper to Working Implementation is an agent skill for the Build phase that converts an arXiv URL into citation-anchored Python with an explicit ambiguity audit.
Install
npx skills add https://github.com/aradotso/trending-skills --skill paper2code-arxiv-implementationWhat is this skill?
- Slash command /paper2code with optional --framework jax, pytorch (default), or other overrides
- Citation-anchored code: decisions tie to paper sections and equations with explicit ambiguity audit
- Trigger phrases include implement this arxiv paper, reproduce this ML paper, and paper2code
- Install via npx skills add with global symlink scope recommended for Claude Code
- Flags gaps instead of inventing missing hyperparameters or unspecified architecture details
Adoption & trust: 531 installs on skills.sh; 31 GitHub stars; 1/3 security scanners passed (skills.sh audits).
What problem does it solve?
You have a paper URL but no trustworthy, traceable Python reproduction and fear the agent will invent unspecified details.
Who is it for?
Solo ML builders reproducing arXiv methods for demos, benchmarks, or feature prototypes who need audit trails to the source PDF.
Skip if: Teams that only need a high-level paper summary, lack Python execution context, or require legally sensitive reimplementation without reviewing flagged gaps.
When should I use this skill?
User says implement this arxiv paper, turn this paper into code, paper2code, reproduce this ML paper, or provides an arXiv URL for implementation.
What do I get? / Deliverables
You get a working Python implementation with section- and equation-linked decisions plus a list of flagged ambiguities instead of silent guesses.
- Citation-anchored Python implementation
- Ambiguity and gap audit tied to paper sections
Recommended Skills
Journey fit
How it compares
Use instead of generic "implement this paper" chat when you need citation anchors and ambiguity reporting per section.
Common Questions / FAQ
Who is paper2code-arxiv-implementation for?
Indie developers and researchers using Claude Code or similar agents who want arXiv papers translated into traceable Python implementations they can run and extend.
When should I use paper2code-arxiv-implementation?
Use it in Build → backend when triggers match—implement this arxiv paper, turn this paper into code, paper2code, or reproduce this ML paper—and you have a stable arXiv URL ready.
Is paper2code-arxiv-implementation safe to install?
It drives code generation and dependency choices from external papers; review the Security Audits panel on this page and inspect generated code and third-party packages before running untrusted workloads.
SKILL.md
READMESKILL.md - Paper2code Arxiv Implementation
# paper2code — Arxiv Paper to Working Implementation > Skill by [ara.so](https://ara.so) — Daily 2026 Skills collection. paper2code is a Claude Code agent skill that converts any arxiv paper URL into a citation-anchored Python implementation. Every code decision references the exact paper section and equation it implements, and all gaps/ambiguities are explicitly flagged rather than silently filled in. --- ## Install ```bash npx skills add PrathamLearnsToCode/paper2code/skills/paper2code ``` During install you'll choose: - **Agents**: which coding agents get the skill (e.g., Claude Code) - **Scope**: Global (recommended) or project-level - **Method**: Symlink (recommended) or copy Then launch your agent: ```bash claude ``` --- ## Core Commands ### Basic usage ``` /paper2code https://arxiv.org/abs/1706.03762 ``` ### With framework override ``` /paper2code https://arxiv.org/abs/2006.11239 --framework jax /paper2code https://arxiv.org/abs/2006.11239 --framework pytorch # default /paper2code https://arxiv.org/abs/2006.11239 --framework tensorflow ``` ### With mode flag ``` /paper2code 1706.03762 --mode minimal # architecture only (default) /paper2code 1706.03762 --mode full # includes training loop + data pipeline /paper2code 1706.03762 --mode educational # extra comments + pedagogical notebook ``` ### Bare arxiv ID (no URL required) ``` /paper2code 1706.03762 /paper2code 2106.09685 ``` --- ## Output Structure Every run produces a directory named after the paper slug: ``` attention_is_all_you_need/ ├── README.md # Paper summary + quick-start ├── REPRODUCTION_NOTES.md # Ambiguity audit, unspecified choices, known deviations ├── requirements.txt # Pinned dependencies ├── src/ │ ├── model.py # Architecture — every layer cited to paper section │ ├── loss.py # Loss functions with equation references │ ├── data.py # Dataset skeleton with preprocessing TODOs │ ├── train.py # Training loop (full/educational mode) │ ├── evaluate.py # Metric computation │ └── utils.py # Shared utilities ├── configs/ │ └── base.yaml # All hyperparams — each cited or flagged [UNSPECIFIED] └── notebooks/ └── walkthrough.ipynb # Paper section → code → shape checks ``` --- ## Citation Anchoring Convention The core value of paper2code is traceability. Every non-trivial decision is tagged: | Tag | Meaning | |-----|---------| | `§X.Y` | Directly specified in section X.Y | | `§X.Y, Eq. N` | Implements equation N from section X.Y | | `[UNSPECIFIED]` | Paper doesn't state this — choice made with alternatives listed | | `[PARTIALLY_SPECIFIED]` | Paper mentions it but is ambiguous — quote included | | `[ASSUMPTION]` | Reasonable inference — reasoning explained | | `[FROM_OFFICIAL_CODE]` | Taken from authors' official implementation | ### Example — model.py with citation anchors ```python import torch import torch.nn as nn import math class MultiHeadAttention(nn.Module): """§3.2 — Multi-Head Attention Implements Eq. 4: MultiHead(Q, K, V) = Concat(head_1, ..., head_h) W^O where head_i = Attention(Q W_i^Q, K W_i^K, V W_i^V) """ def __init__(self, d_model: int, num_heads: int, dropout: float = 0.1): super().__init__() # §3.2 — d_model = 512, h = 8 stated in Table 1 assert d_model % num_heads == 0 self.d_k = d_model // num_heads # §3.2 — d_k = d_v = d_model / h = 64