Designing Optimal Dynamic Treatment Regimes: A Causal RL Approach