An Operator View of Policy Gradients

ICML 2020