2025 Poster "benchmark evaluation suite" Papers
2 papers found
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng, Jin Wang, Chuanhao Li et al.
ICLR 2025posterarXiv:2408.02718
48
citations
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans
Shubhankar Borse, Seokeon Choi, Sunghyun Park et al.
NEURIPS 2025posterarXiv:2506.20879
2
citations