by Mohan Jiang Papers
2 papers found
Conference
MAC: A Live Benchmark for Multimodal Large Language Models in Scientific Understanding
Mohan Jiang, Jin Gao, Jiahao Zhan et al.
COLM 2025paperarXiv:2508.15802
3
citations
PersonaEval: Are LLM Evaluators Human Enough to Judge Role-Play?
Lingfeng Zhou, Jialing Zhang, Jin Gao et al.
COLM 2025paperarXiv:2508.10014
5
citations