NeurIPS "vision-language tasks" Papers
3 papers found
Gatekeeper: Improving Model Cascades Through Confidence Tuning
Stephan Rabanser, Nathalie Rauschmayr, Achin Kulshrestha et al.
NeurIPS 2025posterarXiv:2502.19335
4
citations
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
Lorenzo Basile, Valentino Maiorca, Diego Doimo et al.
NeurIPS 2025spotlightarXiv:2510.21518
2
citations
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang, Chen-Wei Xie, Haiyang Wang et al.
NeurIPS 2025spotlightarXiv:2503.01342
14
citations