"kl regularization" Papers
3 papers found
Last Iterate Convergence in Monotone Mean Field Games
Noboru Isobe, Kenshi Abe, Kaito Ariu
NeurIPS 2025posterarXiv:2410.05127
Preference Learning with Lie Detectors can Induce Honesty or Evasion
Chris Cundy, Adam Gleave
NeurIPS 2025posterarXiv:2505.13787
4
citations
Iterative Regularized Policy Optimization with Imperfect Demonstrations
Xudong Gong, Feng Dawei, Kele Xu et al.
ICML 2024poster