ICCV "text-image alignment" Papers
3 papers found
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
Jiarui Wang, Huiyu Duan, Yu Zhao et al.
ICCV 2025highlightarXiv:2504.08358
15
citations
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
Shijie Zhou, Ruiyi Zhang, Huaisheng Zhu et al.
ICCV 2025posterarXiv:2507.21391
6
citations
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Zhengyao Lyu, Tianlin Pan, Chenyang Si et al.
ICCV 2025posterarXiv:2506.07986
5
citations