Can Wikipedia Categories Improve Masked Language Model Pretraining?

ACL 2020