2025 by Tianhao Liang Papers
2 papers found
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Jiacheng Chen, Tianhao Liang, Sherman Siu et al.
ICLR 2025posterarXiv:2410.10563
30
citations
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Xeron Du, Yifan Yao, Kaijing Ma et al.
NEURIPS 2025posterarXiv:2502.14739
118
citations