2025 "multi-modal alignment" Papers
2 papers found
HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models
Zelin Peng, Zhengqin Xu, Qingyang Liu et al.
NeurIPS 2025oralarXiv:2510.20322
1
citations
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Yijing Lin, Mengqi Huang, Shuhan Zhuang et al.
ICCV 2025posterarXiv:2503.10406
12
citations