BERT Loses Patience: Fast and Robust Inference with Early Exit

NeurIPS 2020