
Deepeval
Run DeepEval-style LLM evaluations—metrics, test cases, regressions, and CI checks—before releasing chatbots, RAG apps, or agent features to production users.
npx skills add https://github.com/confident-ai/deepeval --skill deepeval| Installs | 696 |
|---|---|
| Repository | confident-ai/deepeval ↗ |