2025 Poster "dynamic benchmark generation" Papers
2 papers found
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Yue Yang, Shuibo Zhang, Kaipeng Zhang et al.
ICLR 2025posterarXiv:2410.08695
15
citations
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
Yahan Tu, Rui Hu, Jitao Sang
CVPR 2025posterarXiv:2409.09318
3
citations