Poster "zero-shot grounding" Papers
2 papers found
iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning
Manyi Yao, Bingbing Zhuang, Sparsh Garg et al.
NEURIPS 2025posterarXiv:2509.19552
1
citations
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Kevin Qinghong Lin, Linjie Li, Difei Gao et al.
CVPR 2025posterarXiv:2411.17465
128
citations