"cross-modal interactions" Papers
2 papers found
Explaining Similarity in Vision-Language Encoders with Weighted Banzhaf Interactions
Hubert Baniecki, Maximilian Muschalik, Fabian Fumagalli et al.
NEURIPS 2025posterarXiv:2508.05430
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Qi Qin, Le Zhuo, Yi Xin et al.
ICCV 2025posterarXiv:2503.21758
55
citations