Nino Vieillard
5
Papers
57
Total Citations
Papers (5)
BOND: Aligning LLMs with Best-of-N Distillation
ICLR 2025
50
citations
Loss Functions and Operators Generated by f-Divergences
ICML 2025
7
citations
WARM: On the Benefits of Weight Averaged Reward Models
ICML 2024
0
citations
Munchausen Reinforcement Learning
NeurIPS 2020
0
citations
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning
NeurIPS 2020
0
citations