PoWER-BERT: Accelerating BERT Inference via Progressive Word-vector Elimination