Minimax Value Interval for Off-policy Evaluation and Policy Optimization