Historical perspective on extreme classification in language modeling