Categories
Berkeley
CMU
Deepmind
NUS
Stanford
Theory
Toronto
University of Washington
Upcoming
2025
Building RLHF around psychological models of human preference
2024
Computing and Planning with Large Generative Models
Challenges in Scalable Training Data Attribution
Recent Advances in Average-Reward Restless Bandits
2023
Continual Subtask Learning
Reinforcement Learning from Static Datasets Algorithms, Analysis and Applications
Insights into Intelligence
2022
Understanding Information-Directed Sampling, When and How to Use It?
Towards Instance-Optimal Algorithms for Reinforcement Learning
Epistemic Neural Networks
Optimal Clustering with Bandit Feedback
Adaptivity and Confounding in Multi-Armed Bandit Experiments
2021
Reinforcement Learning, Bit by Bit
Provable Model-based Nonlinear Bandit and Reinforcement Learning
Diffusion Asymptotics for Sequential Experiments
Lectures on Information Directed Sampling