Distilling Knowledge Learned in BERT for Text Generation

ACL 2020