Most Cited 2025 Spotlight by Tim Rocktaeschel Papers
2 papers found
Conference
#1
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri, Bartłomiej Cupiał, Samuel Coward et al.
ICLR 2025arXiv:2411.13543
74
citations
#2
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Laura Ruis, Maximilian Mozes, Juhan Bae et al.
ICLR 2025arXiv:2411.12580
28
citations