
Model Evaluator
Compare candidate LLM or ML models on accuracy, latency, cost, and failure modes before committing to one model in a prototype or production architecture.
npx skills add https://github.com/jmsktm/claude-settings --skill model-evaluator| Installs | 172 |
|---|---|
| Repository | jmsktm/claude-settings ↗ |