Learning to Score Behaviors for Guided Policy Optimization

ICML 2020