How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope

ACL 2020