Efficient continuous-action contextual bandits (via extreme classification)

ICML 2020