Ge Zhang
17
Papers
615
Total Citations
Papers (17)
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models
ICLR 2025
135
citations
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
ECCV 2024
127
citations
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
AAAI 2025
99
citations
Training Socially Aligned Language Models on Simulated Social Interactions
ICLR 2024
88
citations
General-Reasoner: Advancing LLM Reasoning Across All Domains
NeurIPS 2025
74
citations
OmniBench: Towards The Future of Universal Omni-Language Models
NeurIPS 2025
51
citations
McEval: Massively Multilingual Code Evaluation
ICLR 2025
28
citations
Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
ICML 2025
9
citations
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
NeurIPS 2025
4
citations
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
0
citations
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
ICCV 2025
0
citations
Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
AAAI 2025
0
citations
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
0
citations
Improving Depth Completion via Depth Feature Upsampling
CVPR 2024
0
citations
LRRU: Long-short Range Recurrent Updating Networks for Depth Completion
ICCV 2023
0
citations
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
NeurIPS 2022
0
citations
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
NeurIPS 2023
0
citations