On The Evaluation of Machine Translation SystemsTrained With Back-Translation