Learning Probabilistic Sentence Representations from Paraphrases