"behavior alignment" Papers
2 papers found
Bootstrap Off-policy with World Model
Guojian Zhan, Likun Wang, Xiangteng Zhang et al.
NeurIPS 2025posterarXiv:2511.00423
1
citations
Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization
Jian-Ting Guo, Yu-Cheng Chen, Ping-Chun Hsieh et al.
NeurIPS 2025posterarXiv:2511.15055