Multi-loop graph convolutional network for multimodal conversational emotion recognition

被引:2
|
作者
Ren, Minjie [1 ]
Huang, Xiangdong [1 ]
Li, Wenhui [1 ]
Liu, Jing [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国博士后科学基金;
关键词
Conversational emotion recognition; Multi-modal sentiment analysis; Graph convolutional network;
D O I
10.1016/j.jvcir.2023.103846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion recognition in conversations (ERC) has gained increasing research attention in recent years due to its wide applications in a surge of emerging tasks, such as social media analysis, dialog generation, and recommender systems. Since constituent utterances in a conversation are closely semantic-related, the constituent utterances' emotional states are also closely related. In our consideration, this correlation could serve as a guide for the emotion recognition of constituent utterances. Accordingly, we propose a novel approach named Semantic-correlation Graph Convolutional Network (SC-GCN) to take advantage of this correlation for the ERC task in multimodal scenario. Specifically, we first introduce a hierarchical fusion module to model the dynamics among the textual, acoustic and visual features and fuse the multimodal information. Afterward, we construct a graph structure based on the speaker and temporal dependency of the dialog. We put forward a novel multi-loop architecture to explore the semantic correlations by the self-attention mechanism and enhance the correlation information via multiple loops. Through the graph convolution process, the proposed SC-GCN finally obtains a refined representation of each utterance, which is used for the final prediction. Extensive experiments are conducted on two benchmark datasets and the experimental results demonstrate the superiority of our SC-GCN.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] DGSNet: Dual Graph Structure Network for Emotion Recognition in Multimodal Conversations
    Tang, Shimin
    Wang, Changjian
    Tian, Fengyu
    Xu, Kele
    Xu, Minpeng
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 78 - 85
  • [32] MMDAG: Multimodal Directed Acyclic Graph Network for Emotion Recognition in Conversation
    Xu, Shuo
    Jia, Yuxiang
    Niu, Changyong
    Zan, Hongying
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6802 - 6807
  • [33] Multimodal Decoupled Distillation Graph Neural Network for Emotion Recognition in Conversation
    Dai, Yijing
    Li, Yingjian
    Chen, Dongpeng
    Li, Jinxing
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9910 - 9924
  • [34] Emotion Recognition in Conversation Based on a Dynamic Complementary Graph Convolutional Network
    Yang, Zhenyu
    Li, Xiaoyang
    Cheng, Yuhu
    Zhang, Tong
    Wang, Xuesong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1567 - 1579
  • [35] Multi-View Hierarchical Attention Graph Convolutional Network with Domain Adaptation for EEG Emotion Recognition
    Li, Chao
    Wang, Feng
    Bian, Ning
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 624 - 630
  • [36] Emotion recognition using multi-scale EEG features through graph convolutional attention network
    Cao, Liwen
    Zhao, Wenfeng
    Sun, Biao
    NEURAL NETWORKS, 2025, 184
  • [37] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    Christy, A.
    Vaithyasubramanian, S.
    Jesudoss, A.
    Praveena, M. D. Anto
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 381 - 388
  • [38] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition
    Zhang, Shiqing
    Zhang, Shiliang
    Huang, Tiejun
    Gao, Wen
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
  • [39] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    A. Christy
    S. Vaithyasubramanian
    A. Jesudoss
    M. D. Anto Praveena
    International Journal of Speech Technology, 2020, 23 : 381 - 388
  • [40] Graph Convolutional Neural Network Based Emotion Recognition with Brain Functional Connectivity Network
    Gao, Pengzhi
    Zheng, Xiangwei
    Wang, Tao
    Zhang, Yuang
    International Journal of Crowd Science, 2024, 8 (04) : 195 - 204