
Hamelsmu Evals Skills
Design and run LLM eval suites—datasets, rubrics, regression baselines, and failure triage—before shipping agent or prompt changes to production.
/plugin marketplace add hamelsmu/evals-skills| GitHub stars | ★ 1.4k |
|---|---|
| Repository | hamelsmu/evals-skills ↗ |
Plugins in this marketplace
1 plugin - install individually after you add the marketplace.