Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving

23citations

arXiv:2409.18343 PDF

Citations

#315

in ECCV 2024

of 2387 papers

Authors

Data Points

Authors

Zhenghao Peng Wenjie Luo Yiren Lu Tianyi Shen Cole Gulino Ari Seff Justin Fu

Topics

autonomous driving agent behavior modeling reinforcement learning fine-tuning distribution shift motion forecasting simulation evaluation collision rate reduction closed-loop training

Abstract

A major challenge in autonomous vehicle research is modeling agent behaviors, which has critical applications including constructing realistic and reliable simulations for off-board evaluation and forecasting traffic agents motion for onboard planning. While supervised learning has shown success in modeling agents across various domains, these models can suffer from distribution shift when deployed at test-time. In this work, we improve the reliability of agent behaviors by closed-loop fine-tuning of behavior models with reinforcement learning. Our method demonstrates improved overall performance, as well as improved targeted metrics such as collision rate, on the Waymo Open Sim Agents challenge. Additionally, we present a novel policy evaluation benchmark to directly assess the ability of simulated agents to measure the quality of autonomous vehicle planners and demonstrate the effectiveness of our approach on this new benchmark.

Citation History

Jan 26, 2026

Jan 31, 2026