Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation (2403.20289v1)
Abstract: Emotion Recognition in Conversation (ERC) involves detecting the underlying emotion behind each utterance within a conversation. Effectively generating representations for utterances remains a significant challenge in this task. Recent works propose various models to address this issue, but they still struggle with differentiating similar emotions such as excitement and happiness. To alleviate this problem, We propose an Emotion-Anchored Contrastive Learning (EACL) framework that can generate more distinguishable utterance representations for similar emotions. To achieve this, we utilize label encodings as anchors to guide the learning of utterance representations and design an auxiliary loss to ensure the effective separation of anchors for similar emotions. Moreover, an additional adaptation process is proposed to adapt anchors to serve as effective classifiers to improve classification performance. Across extensive experiments, our proposed EACL achieves state-of-the-art emotion recognition performance and exhibits superior performance on similar emotions. Our code is available at https://github.com/Yu-Fangxu/EACL.
- Iemocap: Interactive emotional dyadic motion capture database. Language resources and evaluation, 42:335–359.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR.
- Simcse: Simple contrastive learning of sentence embeddings. arXiv preprint arXiv:2104.08821.
- Cosmic: Commonsense knowledge for emotion identification in conversations. arXiv preprint arXiv:2010.02795.
- Dialoguegcn: A graph convolutional neural network for emotion recognition in conversation. arXiv preprint arXiv:1908.11540.
- Supervised contrastive learning for pre-trained language model fine-tuning. arXiv preprint arXiv:2011.01403.
- Label confusion learning to enhance text classification models. In Proceedings of the AAAI conference on artificial intelligence, volume 35, pages 12929–12936.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9729–9738.
- Deberta: Decoding-enhanced bert with disentangled attention. arXiv preprint arXiv:2006.03654.
- Supervised adversarial contrastive learning for emotion recognition in conversations. arXiv preprint arXiv:2306.01505.
- Dialoguecrn: Contextual reasoning networks for emotion recognition in conversations. arXiv preprint arXiv:2106.01978.
- Mmgcn: Multimodal fusion via deep graph convolution network for emotion recognition in conversation. arXiv preprint arXiv:2107.06779.
- Relation-aware graph attention networks with relational position encodings for emotion recognition in conversations. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7360–7370.
- Speaker-aware interactive graph attention network for emotion recognition in conversation. ACM Transactions on Asian and Low-Resource Language Information Processing, 22(12):1–18.
- Improved universal sentence embeddings with prompt-based contrastive learning and energy-based learning. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 3021–3035.
- Cogmen: Contextualized gnn based multimodal emotion recognition. arXiv preprint arXiv:2205.02455.
- Exploring balanced feature spaces for representation learning. In International Conference on Learning Representations.
- Decoupling representation and classifier for long-tailed recognition. arXiv preprint arXiv:1910.09217.
- Supervised contrastive learning. Advances in neural information processing systems, 33:18661–18673.
- Joosung Lee. 2022. The emotion is not one-hot encoding: Learning with grayscale label for emotion recognition in conversation. arXiv preprint arXiv:2206.07359.
- Joosung Lee and Wooin Lee. 2021. Compm: Context modeling with speaker’s pre-trained memory tracking for emotion recognition in conversation. arXiv preprint arXiv:2108.11626.
- Instructerc: Reforming emotion recognition in conversation with a retrieval multi-task llms framework. arXiv preprint arXiv:2309.11911.
- Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461.
- Contrast and generation make bart a good dialogue emotion recognizer. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 11002–11010.
- Targeted supervised contrastive learning for long-tailed recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6918–6928.
- Emocaps: Emotion capsule based model for conversational emotion recognition. arXiv preprint arXiv:2203.13504.
- Dialogueein: Emotion interaction network for dialogue affective analysis. In Proceedings of the 29th International Conference on Computational Linguistics, pages 684–693.
- Dialoguernn: An attentive rnn for emotion detection in conversations. In Proceedings of the AAAI conference on artificial intelligence, volume 33, pages 6818–6825.
- Long-tail learning via logit adjustment. arXiv preprint arXiv:2007.07314.
- Decoupled training for long-tailed classification with stochastic representations. arXiv preprint arXiv:2304.09426.
- Is discourse role important for emotion recognition in conversation? In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 11121–11129.
- Meld: A multimodal multi-party dataset for emotion recognition in conversations. arXiv preprint arXiv:1810.02508.
- Directed acyclic graph network for conversational emotion recognition. arXiv preprint arXiv:2105.12907.
- Supervised prototypical contrastive learning for emotion recognition in conversation. arXiv preprint arXiv:2210.08713.
- Context or knowledge is not always necessary: A contrastive learning framework for emotion recognition in conversations. In Findings of the Association for Computational Linguistics: ACL 2023, pages 14054–14067.
- Contrastive learning-enhanced nearest neighbor mechanism for multi-label text classification. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 672–679.
- Cluster-level contrastive learning for emotion recognition in conversations. IEEE Transactions on Affective Computing.
- Hybrid curriculum learning for emotion recognition in conversation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 11595–11603.
- Sayyed M Zahiri and Jinho D Choi. 2017. Emotion detection on tv show transcripts with sequence-based convolutional neural networks. arXiv preprint arXiv:1708.04299.
- Dualgats: Dual graph attention networks for emotion recognition in conversations. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7395–7408.
- Mimicking the thinking process for emotion recognition in conversation with prompts and paraphrasing. arXiv preprint arXiv:2306.06601.
- Dialoguellm: Context and emotion knowledge-tuned llama models for emotion recognition in conversations. arXiv preprint arXiv:2310.11374.
- Label anchored contrastive learning for language understanding. arXiv preprint arXiv:2205.10227.
- Label-driven denoising framework for multi-label few-shot aspect category detection. arXiv preprint arXiv:2210.04220.
- Dieu: A dynamic interaction emotion unit for emotion recognition in conversation. ACM Transactions on Asian and Low-Resource Language Information Processing, 22(10):1–18.
- Is chatgpt equipped with emotional dialogue capabilities? arXiv preprint arXiv:2304.09582.
- Knowledge-enriched transformer for emotion detection in textual conversations. arXiv preprint arXiv:1909.10681.
- Balanced contrastive learning for long-tailed visual recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6908–6917.
- Fangxu Yu (4 papers)
- Junjie Guo (18 papers)
- Zhen Wu (79 papers)
- Xinyu Dai (116 papers)