
Esm
Call Biohub-hosted ESMFold2 structure prediction via the EvolutionaryScale SDK with correct auth, installs, and Forge client patterns.
Overview
esm is an agent skill for the Build phase that guides Biohub authentication, SDK installs, and ESMFold2 structure prediction through the EvolutionaryScale forge client APIs.
Install
npx skills add https://github.com/k-dense-ai/scientific-agent-skills --skill esmWhat is this skill?
- Clarifies Forge-to-Biohub migration and when to use biohub.ai for ESMFold2
- ESM_API_KEY from Biohub developer console; never commit keys to git
- Dual install paths: Biohub git pin (e.g. commit c94ed8d) vs PyPI esm==3.2.3 for ESM3/ESM C
- ESMFold2 via SequenceStructureForgeInferenceClient on Biohub API host
- Explicit warning to confirm install source before mixing Forge/Biohub clients in one env
- ESMFold2 is described as built on ESMC 6B in the SKILL.md overview
- PyPI path documents esm==3.2.3 for ESM3/ESM C workflows
Adoption & trust: 514 installs on skills.sh; 27.6k GitHub stars; 2/3 security scanners passed (skills.sh audits).
What problem does it solve?
Hosted structure prediction docs mention both Forge and Biohub, and your agent risks wrong endpoints, mixed package installs, or leaked API keys.
Who is it for?
Indie builders adding protein structure prediction to agents, APIs, or internal bio tools on EvolutionaryScale/Biohub.
Skip if: Teams with no molecular biology use case or who only need generic LLM chat without structure APIs.
When should I use this skill?
When structure prediction should use Biohub/ESMFold2 or when docs reference biohub.ai instead of forge.evolutionaryscale.ai.
What do I get? / Deliverables
After the skill runs, the agent uses ESM_API_KEY correctly, picks the right install source, and invokes ESMFold2 via the documented inference client against Biohub.
- Authenticated client setup snippet for ESMFold2 inference
- Install commands with pinned versions for reproducible environments
Recommended Skills
Journey fit
Structure prediction is an integration task once you are building features that call hosted biology models, not early ideation. ESM_API_KEY, Biohub endpoints, and SequenceStructureForgeInferenceClient are external API wiring—classic integrations subphase.
How it compares
Hosted-model SDK integration skill, not a local open-weight weight runner or a Prism MCP catalog entry.
Common Questions / FAQ
Who is esm for?
Solo and indie builders wiring EvolutionaryScale structure prediction—especially ESMFold2 on Biohub—into Python agents or backend services.
When should I use esm?
Use it in Build integrations when upstream docs reference biohub.ai, you need ESMFold2 all-atom prediction, or you must pin esm from GitHub versus PyPI without mixing clients.
Is esm safe to install?
The skill requires API secrets; review the Security Audits panel on this page and keep keys in environment variables only, never in committed notebooks.
SKILL.md
READMESKILL.md - Esm
# Biohub Platform and ESMFold2 ## Overview EvolutionaryScale is migrating hosted inference from [Forge](https://forge.evolutionaryscale.ai) to the [Biohub platform](https://biohub.ai). The Python SDK still uses `esm.sdk.forge` client classes and "Forge" naming in many places, but newer structure-prediction APIs (ESMFold2) run on Biohub endpoints. Use this reference when you need **all-atom structure prediction** (ESMFold2) or when upstream docs point to `biohub.ai` instead of `forge.evolutionaryscale.ai`. ## Authentication Create API keys in the [Biohub developer console](https://biohub.ai/developer-console/api-keys). Store the key in `ESM_API_KEY` (same env var used by `esm.sdk.client()` on Forge). ```python import os token = os.environ["ESM_API_KEY"] ``` Never commit API keys or paste them into notebooks checked into git. ## Installation For ESMFold2 and latest Biohub SDK features, upstream may recommend installing from the Biohub GitHub repo (pin a specific commit for reproducibility): ```bash uv pip install "esm@git+https://github.com/Biohub/esm.git@c94ed8d" ``` For ESM3/ESM C workflows on PyPI, `uv pip install "esm==3.2.3"` remains the standard path. Confirm which install source your task requires before mixing both in one environment. ## ESMFold2 Structure Prediction ESMFold2 is a structure prediction model built on ESMC 6B, available through `SequenceStructureForgeInferenceClient` with Biohub as the API host. ```python import os from esm.sdk.forge import SequenceStructureForgeInferenceClient from esm.sdk.api import FoldingConfig from esm.utils.structure.input_builder import ProteinInput, StructurePredictionInput client = SequenceStructureForgeInferenceClient( model="esmfold2-fast-2026-05", url="https://biohub.ai", token=os.environ["ESM_API_KEY"], ) sequence = "MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK" fold_input = StructurePredictionInput( sequences=[ProteinInput(id="A", sequence=sequence)] ) config = FoldingConfig(num_loops=3, num_sampling_steps=32) result = client.fold_all_atom(fold_input, config=config) with open("result.cif", "w") as f: f.write(result.complex.to_mmcif()) ``` ### Model IDs | Model ID | Use case | |----------|----------| | `esmfold2-fast-2026-05` | Fast single-sequence folding | | Check Biohub docs for additional variants | MSA-augmented or higher-accuracy modes | ## Relationship to Forge (ESM3 / ESM C) | Capability | Typical endpoint | Client | |------------|------------------|--------| | ESM3 generation | `https://forge.evolutionaryscale.ai` | `esm.sdk.client()` or `ESM3ForgeInferenceClient` | | ESM C 6B embeddings (hosted) | Forge | `ESM3ForgeInferenceClient` with `esmc-6b-2024-12` | | ESMFold2 structure prediction | `https://biohub.ai` | `SequenceStructureForgeInferenceClient` | For ESM3 and ESM C cloud usage patterns, see `forge-api.md`. For local open-weight models, see `esm3-api.md` and `esm-c-api.md`. ## Additional Resources - **Biohub:** https://biohub.ai - **Biohub/esm repository:** https://github.com/Biohub/esm - **Tutorials:** https://github.com/Biohub/esm/tree/main/cookbook/tutorials - **ESMC & ESMFold2 preprint:** https://biohub.ai/papers/esm_protein.pdf # ESM C API Reference ## Overview ESM C (Cambrian) is a family of protein language models optimized for representation learning and efficient embedding generation. Designed as a drop-in replacement for ESM2, ESM C provides significant improvements in speed and quality across all model sizes. ## Model Architecture **ESM C Family Models:** | Model ID | Parameters | Layers | Best For | |----------|-----------|--------|----------| | `esmc-300m` | 300M | 30 | Fast inference, lightweight applications | | `esmc-600m` | 600M | 36 | Balanced performance and quality | | `esmc-6b-2024-12` | 6B | 80 |