
Vlm
Integrate vision-language models into agent pipelines for image understanding, OCR, UI screenshot analysis, and multimodal tool routing.
npx skills add https://github.com/answerzhao/agent-skills --skill vlm| Installs | 313 |
|---|---|
| Repository | answerzhao/agent-skills ↗ |