2025 "multimodal benchmarks" Papers
4 papers found
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
Zhiyuan Liang, Dongwen Tang, Yuhao Zhou et al.
NeurIPS 2025posterarXiv:2506.16406
3
citations
Learning to Instruct for Visual Instruction Tuning
Zhihan Zhou, Feng Hong, JIAAN LUO et al.
NeurIPS 2025posterarXiv:2503.22215
3
citations
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs
Haoran Lou, Chunxiao Fan, Ziyan Liu et al.
ICCV 2025posterarXiv:2507.00505
MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers
Yang Tian, Zheng Lu, Mingqi Gao et al.
ICCV 2025posterarXiv:2503.16856
2
citations