Most Cited 2025 Poster by Tim Rocktaeschel Papers
2 papers found
Conference
#1
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri, Bartłomiej Cupiał, Samuel Coward et al.
ICLR 2025posterarXiv:2411.13543
70
citations
#2
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Laura Ruis, Maximilian Mozes, Juhan Bae et al.
ICLR 2025posterarXiv:2411.12580
26
citations