
Llm Obs Eval Pipeline
Implement an automated Datadog LLM evaluation pipeline that ingests production or staging traces, runs scorers, stores results, and surfaces regressions on each model or prompt change.
npx skills add https://github.com/datadog-labs/agent-skills --skill llm-obs-eval-pipeline| Installs | 217 |
|---|---|
| Repository | datadog-labs/agent-skills ↗ |