Archives

Categories

Berkeley

CMU

Deepmind

NUS

Stanford

Theory

Toronto

University of Washington

Upcoming

2025

Building RLHF around psychological models of human preference

2024

Computing and Planning with Large Generative Models

Featured image of post Computing and Planning with Large Generative Models

Challenges in Scalable Training Data Attribution

Featured image of post Challenges in Scalable Training Data Attribution

Recent Advances in Average-Reward Restless Bandits

Featured image of post Recent Advances in Average-Reward Restless Bandits

2023

Continual Subtask Learning

Featured image of post Continual Subtask Learning

Reinforcement Learning from Static Datasets Algorithms, Analysis and Applications

Insights into Intelligence

Featured image of post Insights into Intelligence

2022

Understanding Information-Directed Sampling, When and How to Use It?

Towards Instance-Optimal Algorithms for Reinforcement Learning

Epistemic Neural Networks

Optimal Clustering with Bandit Feedback

Adaptivity and Confounding in Multi-Armed Bandit Experiments

2021

Reinforcement Learning, Bit by Bit

Provable Model-based Nonlinear Bandit and Reinforcement Learning

Diffusion Asymptotics for Sequential Experiments

Lectures on Information Directed Sampling