Env And Assets Bootstrap

Name: Env And Assets Bootstrap
Author: lllllllama

lllllllama/ai-paper-reproduction-skill

140k installs
512 repo stars
Updated July 26, 2026
lllllllama/ai-paper-reproduction-skill

env-and-assets-bootstrap is an agent skill that plans conda environments and asset paths before running a README-documented ML reproduction.

About

The env-and-assets-bootstrap skill is the Rigor Setup step in a README-first deep learning reproduction pipeline. It runs after repo intake selects a credible target and before any training or evaluation commands execute. The skill produces conservative environment setup notes, candidate conda commands, asset path plans, checkpoint and dataset source hints, and explicit unresolved dependency or asset risks. It deliberately does not own target selection, full orchestration, paper interpretation, final run reporting, or generic package management outside a specific reproduction goal. Inputs include the target repo path, chosen reproduction objective, relevant README setup steps, and known OS or package constraints. Supporting references env-policy.md and assets-policy.md plus scripts bootstrap_env.py, plan_setup.py, and prepare_assets.py ground setup planning. Developers skip it when the repository already ships a ready-to-run environment that needs no translation from README instructions.

Acts as Rigor Setup after repo intake and before any reproduction run commands.
Outputs conservative conda-first environment notes and candidate commands from README steps.
Plans checkpoint, dataset, and cache directory assumptions with explicit unresolved risks.
Defers target selection, orchestration, and final reporting to sibling reproduction skills.
Uses bootstrap_env.py, plan_setup.py, and prepare_assets.py reference scripts.

Env And Assets Bootstrap by the numbers

139,884 all-time installs (skills.sh)
+29 installs in the week ending Jul 28, 2026 (Skillselion tracking)
Ranked #8 of 2,066 Data Science & ML skills by installs in the Skillselion catalog
Security screen: MEDIUM risk (skills.sh audit)
Data as of Jul 28, 2026 (Skillselion catalog sync)

At a glance

env-and-assets-bootstrap capabilities & compatibility

Capabilities: conservative conda environment planning · checkpoint and dataset path assumptions · readme setup step translation · unresolved dependency risk surfacing · script backed bootstrap and asset prep
Use cases: research · data analysis

From the docs

What env-and-assets-bootstrap says it does

Use this as the Rigor Setup skill.

SKILL.md

This skill prepares environment and asset assumptions.

SKILL.md

It does not own target selection.

SKILL.md

npx skills add https://github.com/lllllllama/ai-paper-reproduction-skill --skill env-and-assets-bootstrap

Add your badge

Show developers this skill is listed on Skillselion. Paste this into your README.

[![Listed on Skillselion](https://skillselion.com/badge/skills/lllllllama/ai-paper-reproduction-skill/env-and-assets-bootstrap.svg)](https://skillselion.com/skills/lllllllama/ai-paper-reproduction-skill/env-and-assets-bootstrap)

Installs	140k
repo stars	★ 512
Security audit	2 / 3 scanners passed
Last updated	July 26, 2026
Repository	lllllllama/ai-paper-reproduction-skill ↗

How do I conservatively prepare conda, checkpoints, and datasets before attempting to reproduce a deep learning paper repo?

Prepare conservative conda environments, checkpoint paths, and dataset cache plans before reproducing a README-documented ML repo.

Who is it for?

Researchers reproducing README-documented ML repos who need setup planning before executing training scripts.

Skip if: Skip when the repo environment is already ready to run or the task is only scanning without execution prep.

When should I use this skill?

After repo intake identifies a reproduction target and before running setup or training commands.

What you get

Documented environment commands, asset path plan, and flagged dependency risks ready for a reproduction run attempt.

Environment setup notes
Asset path plan
Checkpoint and dataset source hints

By the numbers

Clear boundaries list four tasks this skill does not own: selection, orchestration, reporting, and generic conda help.

Files

SKILL.mdMarkdownGitHub ↗

env-and-assets-bootstrap

Use this as the Rigor Setup skill. The installed slug remains env-and-assets-bootstrap for compatibility.

Use the shared operating principles in ../../references/agent-operating-principles.md; this skill should keep setup planning conservative while leaving environment-specific judgment to the model.

When to apply

After repo intake identifies a credible reproduction target.
When environment creation or asset path preparation is needed before running commands.
When the repo depends on checkpoints, datasets, or cache directories.
When the user explicitly wants setup help before any run attempt.

When not to apply

When the repository already ships a ready-to-run environment that does not need translation.
When the task is only to scan and plan.
When the task is only to report results from commands that already ran.
When the request is a generic conda or package-management question outside repo reproduction.

Clear boundaries

This skill prepares environment and asset assumptions.
It does not own target selection.
It does not own final reporting.
It does not perform paper lookup except by forwarding gaps to the optional paper resolver.

Input expectations

target repo path
selected reproduction goal
relevant README setup steps
any known OS or package constraints

Output expectations

conservative environment setup notes
candidate conda commands
asset path plan
checkpoint and dataset source hints
unresolved dependency or asset risks

Notes

Use references/env-policy.md, references/assets-policy.md, scripts/bootstrap_env.py, scripts/plan_setup.py, and scripts/prepare_assets.py. Use scripts/bootstrap_env.sh only as a POSIX wrapper around the Python bootstrapper when a shell entrypoint is more convenient.

display_name: Rigor Setup
short_description: Rigor Setup mode for conservative environment and asset assumptions before a reproduction run.
default_prompt: Prepare a conservative conda-first environment plus checkpoint, dataset, and cache assumptions for this README-documented reproduction target before any run.

#!/usr/bin/env python3
"""Bootstrap a conservative research environment on Windows, macOS, or Linux."""

from __future__ import annotations

import argparse
import shutil
import subprocess
import sys
from pathlib import Path
from typing import Iterable, List, Optional

from plan_setup import ENV_FILES, find_first, parse_env_name, venv_activation_commands


CONDA_ENV_FILES = {"environment.yml", "environment.yaml", "conda.yml"}


def format_command(command: Iterable[str]) -> str:
    return " ".join(str(part) for part in command)


def run_command(command: List[str], *, cwd: Path, dry_run: bool) -> None:
    print(f"+ {format_command(command)}")
    if dry_run:
        return
    subprocess.run(command, cwd=cwd, check=True)


def choose_manager(preferred: str) -> Optional[str]:
    if preferred != "auto":
        if shutil.which(preferred):
            return preferred
        raise FileNotFoundError(f"Requested manager `{preferred}` was not found on PATH.")

    for candidate in ["conda", "mamba"]:
        if shutil.which(candidate):
            return candidate
    return None


def venv_python(env_dir: Path) -> Path:
    if sys.platform.startswith("win"):
        return env_dir / "Scripts" / "python.exe"
    return env_dir / "bin" / "python"


def print_activation_instructions(env_name: Optional[str], using_conda: bool) -> None:
    if using_conda:
        target = env_name or "<env-name>"
        print(f"Activate with: conda activate {target}")
        return

    print("Activate the virtualenv with one of:")
    for item in venv_activation_commands():
        platforms = ", ".join(item.get("platforms", []))
        print(f"  [{platforms}] {item['command']}")


def install_with_manager(manager: str, env_name: str, repo_path: Path, rel_env_file: Optional[str]) -> None:
    if rel_env_file == "requirements.txt":
        run_command(
            [manager, "run", "-n", env_name, "python", "-m", "pip", "install", "-r", rel_env_file],
            cwd=repo_path,
            dry_run=False,
        )
    elif rel_env_file in {"pyproject.toml", "setup.py"}:
        run_command(
            [manager, "run", "-n", env_name, "python", "-m", "pip", "install", "-e", "."],
            cwd=repo_path,
            dry_run=False,
        )


def install_with_venv(env_python: Path, repo_path: Path, rel_env_file: Optional[str], *, dry_run: bool) -> None:
    if rel_env_file == "requirements.txt":
        run_command(
            [str(env_python), "-m", "pip", "install", "-r", rel_env_file],
            cwd=repo_path,
            dry_run=dry_run,
        )
    elif rel_env_file in {"pyproject.toml", "setup.py"}:
        run_command(
            [str(env_python), "-m", "pip", "install", "-e", "."],
            cwd=repo_path,
            dry_run=dry_run,
        )


def main() -> int:
    parser = argparse.ArgumentParser(description="Bootstrap a conservative AI research environment.")
    parser.add_argument("repo", nargs="?", default=".", help="Target repository path.")
    parser.add_argument("env_name", nargs="?", default="repro-env", help="Fallback environment name.")
    parser.add_argument("--python-version", default="3.10", help="Python version to use for conda or mamba environments.")
    parser.add_argument(
        "--manager",
        choices=["auto", "conda", "mamba"],
        default="auto",
        help="Conda-compatible manager to use when available.",
    )
    parser.add_argument("--dry-run", action="store_true", help="Print commands without executing them.")
    args = parser.parse_args()

    repo_path = Path(args.repo).resolve()
    env_file = find_first(repo_path, ENV_FILES)
    rel_env_file = env_file.relative_to(repo_path).as_posix() if env_file else None
    declared_env_name = parse_env_name(env_file) if env_file else None
    resolved_env_name = declared_env_name or args.env_name
    manager = choose_manager(args.manager)

    print(f"Target repo: {repo_path}")
    print(f"Detected environment file: {rel_env_file or 'none'}")

    if env_file and env_file.name in CONDA_ENV_FILES:
        if manager is None:
            raise SystemExit("A conda-compatible manager is required for environment.yml-based setup. Install conda or mamba first.")

        create_command = [manager, "env", "create", "-f", rel_env_file]
        if not declared_env_name:
            create_command.extend(["-n", resolved_env_name])
        run_command(create_command, cwd=repo_path, dry_run=args.dry_run)
        print_activation_instructions(declared_env_name or resolved_env_name, using_conda=True)
        return 0

    if manager is not None:
        run_command(
            [manager, "create", "-y", "-n", resolved_env_name, f"python={args.python_version}"],
            cwd=repo_path,
            dry_run=args.dry_run,
        )
        if not args.dry_run:
            install_with_manager(manager, resolved_env_name, repo_path, rel_env_file)
        print_activation_instructions(resolved_env_name, using_conda=True)
        return 0

    env_dir = repo_path / ".venv"
    run_command([sys.executable, "-m", "venv", str(env_dir)], cwd=repo_path, dry_run=args.dry_run)
    install_with_venv(venv_python(env_dir), repo_path, rel_env_file, dry_run=args.dry_run)
    print_activation_instructions(None, using_conda=False)
    return 0


if __name__ == "__main__":
    raise SystemExit(main())

#!/usr/bin/env bash
set -euo pipefail

SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
PYTHON_BIN="${PYTHON:-python3}"

if ! command -v "${PYTHON_BIN}" >/dev/null 2>&1; then
  PYTHON_BIN="python"
fi

exec "${PYTHON_BIN}" "${SCRIPT_DIR}/bootstrap_env.py" "$@"

#!/usr/bin/env python3
"""Create a conservative environment setup plan for a research repository."""

from __future__ import annotations

import argparse
import json
import re
from pathlib import Path
from typing import Any, Dict, List, Optional


ENV_FILES = [
    "environment.yml",
    "environment.yaml",
    "conda.yml",
    "requirements.txt",
    "pyproject.toml",
    "setup.py",
]
ALL_PLATFORMS = ["windows", "macos", "linux"]


def find_first(repo: Path, candidates: List[str]) -> Optional[Path]:
    for name in candidates:
        path = repo / name
        if path.exists():
            return path
    return None


def parse_env_name(path: Path) -> Optional[str]:
    if path.suffix not in {".yml", ".yaml"}:
        return None
    text = path.read_text(encoding="utf-8", errors="replace")
    match = re.search(r"^\s*name:\s*([A-Za-z0-9._-]+)\s*$", text, flags=re.MULTILINE)
    return match.group(1) if match else None


def command_entry(label: str, command: str, platforms: Optional[List[str]] = None) -> Dict[str, Any]:
    return {
        "label": label,
        "command": command,
        "platforms": list(platforms or ALL_PLATFORMS),
    }


def venv_activation_commands() -> List[Dict[str, Any]]:
    return [
        command_entry("adapted", ".\\.venv\\Scripts\\Activate.ps1", ["windows"]),
        command_entry("adapted", "source .venv/bin/activate", ["macos", "linux"]),
    ]


def append_venv_flow(setup_commands: List[Dict[str, Any]], install_command: Optional[str] = None) -> None:
    setup_commands.append(command_entry("adapted", "python -m venv .venv"))
    setup_commands.extend(venv_activation_commands())
    if install_command:
        setup_commands.append(command_entry("documented", install_command))


def build_setup_commands(repo: Path) -> Dict[str, object]:
    setup_commands: List[Dict[str, Any]] = []
    notes: List[str] = []
    unresolved: List[str] = []

    env_file = find_first(repo, ENV_FILES)
    env_name = parse_env_name(env_file) if env_file else None

    if env_file is None:
        unresolved.append("No top-level environment specification file was found.")
        setup_commands.append(command_entry("inferred", "python -m venv .venv"))
        setup_commands.extend(
            [
                command_entry("inferred", ".\\.venv\\Scripts\\Activate.ps1", ["windows"]),
                command_entry("inferred", "source .venv/bin/activate", ["macos", "linux"]),
            ]
        )
        notes.append("Defaulted to a virtualenv fallback because no environment file was detected.")
        return {
            "environment_file": None,
            "environment_name": None,
            "setup_commands": setup_commands,
            "setup_notes": notes,
            "unresolved_setup_risks": unresolved,
        }

    rel_env_file = env_file.relative_to(repo).as_posix()
    notes.append(f"Detected environment file `{rel_env_file}`.")
    if env_name:
        notes.append(f"Detected conda environment name `{env_name}`.")

    if env_file.name in {"environment.yml", "environment.yaml", "conda.yml"}:
        setup_commands.append(command_entry("documented", f"conda env create -f {rel_env_file}"))
        setup_commands.append(command_entry("adapted", f"conda activate {env_name}" if env_name else "conda activate <env-name>"))
        if not env_name:
            unresolved.append("The conda environment name was not declared and still needs confirmation.")
    elif env_file.name == "requirements.txt":
        append_venv_flow(setup_commands, f"python -m pip install -r {rel_env_file}")
        notes.append("Fell back to a virtualenv plus requirements installation plan.")
    elif env_file.name == "pyproject.toml":
        append_venv_flow(setup_commands, "python -m pip install -e .")
        notes.append("Detected a pyproject-based installation flow.")
    elif env_file.name == "setup.py":
        append_venv_flow(setup_commands, "python -m pip install -e .")
        notes.append("Detected a setup.py-based editable install flow.")

    return {
        "environment_file": rel_env_file,
        "environment_name": env_name,
        "setup_commands": setup_commands,
        "setup_notes": notes,
        "unresolved_setup_risks": unresolved,
    }


def main() -> int:
    parser = argparse.ArgumentParser(description="Create a conservative environment setup plan.")
    parser.add_argument("--repo", required=True, help="Path to the target repository.")
    parser.add_argument("--json", action="store_true", help="Emit JSON output.")
    args = parser.parse_args()

    repo = Path(args.repo).resolve()
    payload = build_setup_commands(repo)
    text = json.dumps(payload, indent=2, ensure_ascii=False)
    print(text)
    return 0


if __name__ == "__main__":
    raise SystemExit(main())

#!/usr/bin/env python3
"""Prepare a conservative asset manifest for reproduction work."""

from __future__ import annotations

import argparse
import json
import re
from pathlib import Path
from typing import Dict, List


COMMON_ASSET_DIRS = ["datasets", "data", "checkpoints", "weights", "cache", ".cache"]
KEYWORDS = ("checkpoint", "weight", "dataset", "cache", "model", "download")
URL_RE = re.compile(r"https?://\S+")
PATH_RE = re.compile(r"[\w./-]+\.(?:ckpt|pth|pt|bin|safetensors|zip|tar|gz|json|yaml)")


def first_existing(root: Path, names: List[str]) -> Path | None:
    for name in names:
        candidate = root / name
        if candidate.exists():
            return candidate
    return None


def collect_text_hints(repo: Path) -> List[Dict[str, str]]:
    hints: List[Dict[str, str]] = []
    readme = first_existing(repo, ["README.md", "README"])
    if readme:
        text = readme.read_text(encoding="utf-8", errors="replace")
        for line in text.splitlines():
            lowered = line.lower()
            if not any(keyword in lowered for keyword in KEYWORDS):
                continue
            urls = URL_RE.findall(line)
            paths = PATH_RE.findall(line)
            if not urls and not paths:
                continue
            hints.append(
                {
                    "source": str(readme.resolve()),
                    "line": line.strip(),
                    "urls": ", ".join(urls) if urls else "",
                    "paths": ", ".join(paths) if paths else "",
                }
            )

    for directory in ["configs", "config"]:
        config_root = repo / directory
        if not config_root.exists():
            continue
        for path in config_root.rglob("*"):
            if not path.is_file() or path.suffix.lower() not in {".py", ".yaml", ".yml", ".json", ".toml"}:
                continue
            text = path.read_text(encoding="utf-8", errors="replace")
            if not any(keyword in text.lower() for keyword in KEYWORDS):
                continue
            matches = PATH_RE.findall(text)
            urls = URL_RE.findall(text)
            if not matches and not urls:
                continue
            hints.append(
                {
                    "source": str(path.resolve()),
                    "line": "config hint",
                    "urls": ", ".join(urls[:3]) if urls else "",
                    "paths": ", ".join(matches[:5]) if matches else "",
                }
            )
    return hints


def prepare_assets(repo: Path, assets_root: Path) -> Dict[str, object]:
    assets_root.mkdir(parents=True, exist_ok=True)
    manifest: List[Dict[str, str]] = []

    for name in COMMON_ASSET_DIRS:
        repo_candidate = repo / name
        manifest.append(
            {
                "asset_group": name,
                "source_hint": str(repo_candidate.resolve()) if repo_candidate.exists() else "not found in repo",
                "target_path": str((assets_root / name).resolve()),
                "status": "present" if repo_candidate.exists() else "missing",
            }
        )

    return {
        "repo_path": str(repo.resolve()),
        "assets_root": str(assets_root.resolve()),
        "manifest": manifest,
        "text_hints": collect_text_hints(repo),
    }


def main() -> int:
    parser = argparse.ArgumentParser(description="Create a conservative asset manifest.")
    parser.add_argument("--repo", required=True, help="Path to the target repository.")
    parser.add_argument("--assets-root", default="artifacts/assets", help="Directory where prepared assets should live.")
    parser.add_argument(
        "--output-json",
        default="artifacts/assets/asset_manifest.json",
        help="Path to write the manifest JSON.",
    )
    args = parser.parse_args()

    repo = Path(args.repo).resolve()
    assets_root = Path(args.assets_root).resolve()
    output_json = Path(args.output_json).resolve()
    output_json.parent.mkdir(parents=True, exist_ok=True)

    data = prepare_assets(repo, assets_root)
    output_json.write_text(json.dumps(data, indent=2, ensure_ascii=False), encoding="utf-8")
    print(json.dumps(data, indent=2, ensure_ascii=False))
    return 0


if __name__ == "__main__":
    raise SystemExit(main())

Related skills

Microsoft FoundryDeploy, evaluate, and continuously improve Microsoft Foundry agents from a single agent interface.478k1.3k

Ai Research ReproductionOrchestrate trustworthy, auditable reproduction of deep learning repositories directly from their READMEs.164k507

Run TrainSafely execute selected deep learning training commands with standardized evidence capture.164k507

Explore RunSafely run isolated exploratory experiments with clear recording and conservative selection before committing changes.164k507

Paper Context ResolverFetch precise reproduction-critical details like dataset splits, preprocessing steps, or evaluation protocols from the original academic paper when the repo README leav141k507

Repo Intake And PlanScan unfamiliar AI research repositories and receive a minimal, trustworthy reproduction target before investing significant time.140k507

Forks & variants (3)

Env And Assets Bootstrap has 3 known copies in the catalog totaling 176k installs. They canonicalize to this original listing.

lllllllama - 176k installs
lllllllama - 29 installs
lllllllama - 11 installs

How it compares

Pre-run ML reproduction setup planner, not full pipeline orchestration or results reporting.

FAQ

Who is env-and-assets-bootstrap for?

Developers reproducing ML paper repos who need conda and asset planning from README instructions.

When should I use env-and-assets-bootstrap?

Before first run when checkpoints, datasets, or conda translation from README steps are uncertain.

Is env-and-assets-bootstrap safe to install?

Review the Security Audits panel on this page before installing in production.

Data Science & MLanalyticspipelines

Env And Assets Bootstrap

About

Env And Assets Bootstrap by the numbers

env-and-assets-bootstrap capabilities & compatibility

What env-and-assets-bootstrap says it does

Add your badge

How do I conservatively prepare conda, checkpoints, and datasets before attempting to reproduce a deep learning paper repo?

Who is it for?

When should I use this skill?

What you get

By the numbers

Files

env-and-assets-bootstrap

When to apply

When not to apply

Clear boundaries

Input expectations

Output expectations

Notes

Assets Policy

Goal

Order of evidence

Behavior

Common asset groups

Reporting

Environment Policy

Default preference

Order of trust

OS guidance

Dependency handling

Out of scope by default

Related skills

Forks & variants (3)

How it compares

FAQ

Who is env-and-assets-bootstrap for?

When should I use env-and-assets-bootstrap?

Is env-and-assets-bootstrap safe to install?

About

Env And Assets Bootstrap by the numbers

env-and-assets-bootstrap capabilities & compatibility

What env-and-assets-bootstrap says it does

Add your badge

How do I conservatively prepare conda, checkpoints, and datasets before attempting to reproduce a deep learning paper repo?

Who is it for?

When should I use this skill?

What you get

By the numbers

Files

env-and-assets-bootstrap

When to apply

When not to apply

Clear boundaries

Input expectations

Output expectations

Notes

Assets Policy

Goal

Order of evidence

Behavior

Common asset groups

Reporting

Environment Policy

Default preference

Order of trust

OS guidance

Dependency handling

Out of scope by default

Related skills

Forks & variants (3)

How it compares

FAQ

Who is env-and-assets-bootstrap for?

When should I use env-and-assets-bootstrap?

Is env-and-assets-bootstrap safe to install?

This week in AI coding