Paper "thompson sampling" Papers
3 papers found
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan, Taoan Huang, Bistra Dilkina et al.
AAAI 2024paperarXiv:2312.16767
10
citations
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs
Tianyuan Jin, Hao-Lun Hsu, William Chang et al.
AAAI 2024paperarXiv:2312.15549
The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models
Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama
AAAI 2024paperarXiv:2302.14407