Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT

ACL 2020