Estimating Mutual Information Between Dense Word Embeddings