Xian Li
6
Papers
111
Total Citations
Papers (6)
Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge
ICML 2025
53
citations
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions
NeurIPS 2025
46
citations
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
CVPR 2025
10
citations
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
NeurIPS 2025
2
citations
MEMORYLLM: Towards Self-Updatable Large Language Models
ICML 2024
0
citations
Self-Rewarding Language Models
ICML 2024
0
citations