Paper "agents" Papers
3 papers found
Conference
DoomArena: A framework for Testing AI Agents Against Evolving Security Threats
Léo Boisvert, Abhay Puri, Gabriel Huang et al.
COLM 2025paperarXiv:2504.14064
17
citations
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen, Dongyan Lin, Mandana Samiei et al.
COLM 2025paper
2
citations
Plancraft: an evaluation dataset for planning with LLM agents
Gautier Dagan, Frank Keller, Alex Lascarides
COLM 2025paperarXiv:2412.21033
6
citations