Bayesian Hierarchical Words Representation Learning