Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates

NeurIPS 2020