Combining Subword Representations into Word-level Representations in the Transformer Architecture

ACL 2020