Poster "instruction tuning dataset" Papers
2 papers found
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists
Zhiyang Xu, Minqian Liu, Ying Shen et al.
ICLR 2025posterarXiv:2407.03604
8
citations
Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs
Fangrui Zhu, Hanhui Wang, Yiming Xie et al.
NEURIPS 2025posterarXiv:2506.04220