Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets