Alekh Agarwal
5
Papers
92
Total Citations
Papers (5)
Theoretical guarantees on the best-of-n alignment policy
ICML 2025
89
citations
Design Considerations in Offline Preference-based RL
ICML 2025
3
citations
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
ICML 2024
0
citations
The Non-linear $F$-Design and Applications to Interactive Learning
ICML 2024
0
citations
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
ICML 2024
0
citations