by Yuting Ning Papers
3 papers found
A Closed-Form Solution for Fast and Reliable Adaptive Testing
Yan Zhuang, Chenye Ke, Zirui Liu et al.
NeurIPS 2025oral
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Boyu Gou, Zanming Huang, Yuting Ning et al.
NeurIPS 2025poster
20
citations
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Ziru Chen, Shijie Chen, Yuting Ning et al.
ICLR 2025poster