Reinforcement learning that optimizes what real people want