NUS
Optimal Clustering with Bandit Feedback
Vincent Y. F. Tan
Adaptivity and Confounding in Multi-Armed Bandit Experiments
Daniel Russo
Deepmind
Reinforcement Learning, Bit by Bit
Xiuyuan (Lucy) Lu
Stanford
Provable Model-based Nonlinear Bandit and Reinforcement Learning
Tengyu Ma
Stanford
Diffusion Asymptotics for Sequential Experiments
Kuang Xu
1
2
3
4