An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods

NeurIPS 2020