Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets

NeurIPS 2020