Inferring DQN structure for high-dimensional continuous control