Agnostic $Q$-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity

NeurIPS 2020