2025 Oral "visual grounding" Papers
2 papers found
Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning
Ankan Deria, Adinath Dukre, feilong tang et al.
NeurIPS 2025oralarXiv:2506.15649
ESCA: Contextualizing Embodied Agents via Scene-Graph Generation
Jiani Huang, Amish Sethi, Matthew Kuo et al.
NeurIPS 2025oralarXiv:2510.15963