NEURIPS Oral "large language model agents" Papers
2 papers found
Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models
Christopher Chiu, Silviu Pitis, Mihaela van der Schaar
NEURIPS 2025oralarXiv:2510.10278
TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets
Yuzhe YANG, Yifei Zhang, Minghao Wu et al.
NEURIPS 2025oralarXiv:2502.01506
19
citations