Poster "large multi-modality models" Papers
3 papers found
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Haoning Wu, Zicheng Zhang, Weixia Zhang et al.
ICML 2024poster
Towards Open-ended Visual Quality Comparison
Haoning Wu, Hanwei Zhu, Zicheng Zhang et al.
ECCV 2024posterarXiv:2402.16641
93
citations
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun CEN, Chenfei Wu, Xiao Liu et al.
ICML 2024poster