Yuansheng Ni
3
Papers
146
Total Citations
1
Affiliations
Affiliations
University of Waterloo
Papers (3)
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
NeurIPS 2025arXiv
118
citations
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
ICLR 2025arXiv
28
citations
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024arXiv
0
citations