Spotlight "vision-language tasks" Papers
2 papers found
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
Lorenzo Basile, Valentino Maiorca, Diego Doimo et al.
NeurIPS 2025spotlightarXiv:2510.21518
2
citations
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang, Chen-Wei Xie, Haiyang Wang et al.
NeurIPS 2025spotlightarXiv:2503.01342
14
citations