NEURIPS Oral "llm-based agents" Papers
2 papers found
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia
Chandler Smith, Marwa Abdulhai, Manfred Díaz et al.
NEURIPS 2025oralarXiv:2512.03318
4
citations
WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Zimu Lu, Yunqiao Yang, Houxing Ren et al.
NEURIPS 2025oralarXiv:2505.03733
16
citations