MCTS as Regularized Policy Optimization

ICML 2020