Sinkhorn Divergences: Bridging the gap between Optimal Transport and MMD