A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis