Representations for Stable Off-Policy RL