
Glmv Grounding
Implement GLM-V vision grounding to map natural-language references to image regions, enabling multimodal agents with spatial visual understanding.
npx skills add https://github.com/modelscope.cn --skill glmv-grounding| Installs | 188 |
|---|---|
| Repository | modelscope.cn ↗ |