A Call for More Rigor in Unsupervised Cross-lingual Learning