T. Sandholm
5
Papers
92
Total Citations
Papers (5)
Confronting Reward Model Overoptimization with Constrained RLHF
ICLR 2024
73
citations
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
ICLR 2024
12
citations
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property
AAAI 2024arXiv
3
citations
The Complexity of Symmetric Equilibria in Min-Max Optimization and Team Zero-Sum Games
NeurIPS 2025arXiv
2
citations
Mediator Interpretation and Faster Learning Algorithms for Linear Correlated Equilibria in General Sequential Games
ICLR 2024
2
citations