Accountable Off-Policy Evaluation via a Kernelized Bellman Statistics

ICML 2020