by Sumeet Motwani Papers
2 papers found
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites
Div Garg, Diego Caples, Andis Draguns et al.
NeurIPS 2025posterarXiv:2504.11543
19
citations
STARC: A General Framework For Quantifying Differences Between Reward Functions
Joar Skalse, Lucy Farnik, Sumeet Motwani et al.
ICLR 2024poster