
Eval Recipes Runner
Run standardized evaluation recipes against LLM prompts, agents, and tool chains to catch regressions, compare model versions, and gate releases with repeatable quality checks.
npx skills add https://github.com/rysweet/amplihack --skill eval-recipes-runner| Installs | 119 |
|---|---|
| Repository | rysweet/amplihack ↗ |