Paper "tool use" Papers
3 papers found
Conference
Don’t lie to your friends: Learning what you know from collaborative self-play
Jacob Eisenstein, Reza Aghajani, Adam Fisch et al.
COLM 2025paper
5
citations
Plancraft: an evaluation dataset for planning with LLM agents
Gautier Dagan, Frank Keller, Alex Lascarides
COLM 2025paperarXiv:2412.21033
6
citations
ThoughtTerminator: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models
Xiao Pu, Michael Saxon, Wenyue Hua et al.
COLM 2025paperarXiv:2504.13367
23
citations