Decoding Emotions: How Temporal Modelling Enhances Recognition Accuracy