Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits

ICML 2020