On Training Effective Reinforcement Learning Agents for Real-time Power Grid Operation and Control