Categories
2 pages
Stanford
Provable Model-based Nonlinear Bandit and Reinforcement Learning
Diffusion Asymptotics for Sequential Experiments