Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization

NeurIPS 2020