A Doubly Robust Approach to Sparse Reinforcement Learning
Improved Algorithms for Multi-Class Multi-Period Packing Problems with Bandit Feedback
Double Doubly Robust Thompson Sampling with Generalized Linear Payoffs
Doubly Robust Thompson Sampling with Linear Payoffs