Adaptivity and Confounding in Multi-Armed Bandit Experiments
Daniel Russo
Deepmind
Reinforcement Learning, Bit by Bit
Xiuyuan (Lucy) Lu
Stanford
Provable Model-based Nonlinear Bandit and Reinforcement Learning
Tengyu Ma
Stanford
Diffusion Asymptotics for Sequential Experiments
Kuang Xu
Deepmind
Lectures on Information Directed Sampling
Tor Lattimore
1
2
3