Autonomous navigation of stratospheric balloons using reinforcement learning