Ge Zhang

17
Papers
615
Total Citations

Papers (17)

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models

ICLR 2025
135
citations

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

ECCV 2024
127
citations

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

AAAI 2025
99
citations

Training Socially Aligned Language Models on Simulated Social Interactions

ICLR 2024
88
citations

General-Reasoner: Advancing LLM Reasoning Across All Domains

NeurIPS 2025
74
citations

OmniBench: Towards The Future of Universal Omni-Language Models

NeurIPS 2025
51
citations

McEval: Massively Multilingual Code Evaluation

ICLR 2025
28
citations

Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment

ICML 2025
9
citations

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

NeurIPS 2025
4
citations

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

ICCV 2025
0
citations

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

ICCV 2025
0
citations

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

AAAI 2025
0
citations

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

CVPR 2024
0
citations

Improving Depth Completion via Depth Feature Upsampling

CVPR 2024
0
citations

LRRU: Long-short Range Recurrent Updating Networks for Depth Completion

ICCV 2023
0
citations

Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits

NeurIPS 2022
0
citations

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

NeurIPS 2023
0
citations