"dueling bandits" Papers
5 papers found
Learning Across the Gap: Hybrid Multi-armed Bandits with Heterogeneous Offline and Online Data
Qijia He, Minghan Wang, Xutong Liu et al.
NeurIPS 2025poster
Non-Stationary Dueling Bandits Under a Weighted Borda Criterion
Joe Suk, Arpit Agarwal
ICLR 2025posterarXiv:2403.12950
2
citations
Borda Regret Minimization for Generalized Linear Dueling Bandits
Yue Wu, Tao Jin, Qiwei Di et al.
ICML 2024poster
Eliciting Kemeny Rankings
Anne-Marie George, Christos Dimitrakakis
AAAI 2024paperarXiv:2312.11663
1
citations
Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought
Zhen-Yu Zhang, Siwei Han, Huaxiu Yao et al.
ICML 2024poster