NEURIPS Poster "llm-based agents" Papers
5 papers found
Agentic Plan Caching: Test-Time Memory for Fast and Cost-Efficient LLM Agents
Qizheng Zhang, Michael Wornow, Kunle Olukotun
NEURIPS 2025posterarXiv:2506.14852
7
citations
MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Zeyu Zhang, Quanyu Dai, Luyu Chen et al.
NEURIPS 2025posterarXiv:2409.20163
13
citations
PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors
Yimeng Chen, Piotr Piękos, Mateusz Ostaszewski et al.
NEURIPS 2025posterarXiv:2507.15550
2
citations
SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents
Yifu Guo, Jiaye Lin, Huacan Wang et al.
NEURIPS 2025posterarXiv:2508.02085
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Ibragim Badertdinov, Alexander Golubev, Maksim Nekrashevich et al.
NEURIPS 2025posterarXiv:2505.20411
25
citations