T. Sandholm

5

Papers

92

Total Citations

Papers (5)

Confronting Reward Model Overoptimization with Constrained RLHF

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property

The Complexity of Symmetric Equilibria in Min-Max Optimization and Team Zero-Sum Games

NeurIPS 2025arXiv

Mediator Interpretation and Faster Learning Algorithms for Linear Correlated Equilibria in General Sequential Games