Multi-loop graph convolutional network for multimodal conversational emotion recognition

被引:2
|
作者
Ren, Minjie [1 ]
Huang, Xiangdong [1 ]
Li, Wenhui [1 ]
Liu, Jing [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国博士后科学基金;
关键词
Conversational emotion recognition; Multi-modal sentiment analysis; Graph convolutional network;
D O I
10.1016/j.jvcir.2023.103846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion recognition in conversations (ERC) has gained increasing research attention in recent years due to its wide applications in a surge of emerging tasks, such as social media analysis, dialog generation, and recommender systems. Since constituent utterances in a conversation are closely semantic-related, the constituent utterances' emotional states are also closely related. In our consideration, this correlation could serve as a guide for the emotion recognition of constituent utterances. Accordingly, we propose a novel approach named Semantic-correlation Graph Convolutional Network (SC-GCN) to take advantage of this correlation for the ERC task in multimodal scenario. Specifically, we first introduce a hierarchical fusion module to model the dynamics among the textual, acoustic and visual features and fuse the multimodal information. Afterward, we construct a graph structure based on the speaker and temporal dependency of the dialog. We put forward a novel multi-loop architecture to explore the semantic correlations by the self-attention mechanism and enhance the correlation information via multiple loops. Through the graph convolution process, the proposed SC-GCN finally obtains a refined representation of each utterance, which is used for the final prediction. Extensive experiments are conducted on two benchmark datasets and the experimental results demonstrate the superiority of our SC-GCN.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] MFGCN: Multimodal fusion graph convolutional network for speech emotion recognition
    Qi, Xin
    Wen, Yujun
    Zhang, Pengzhou
    Huang, Heyan
    NEUROCOMPUTING, 2025, 611
  • [2] Multimodal EEG Emotion Recognition Based on the Attention Recurrent Graph Convolutional Network
    Chen, Jingxia
    Liu, Yang
    Xue, Wen
    Hu, Kailei
    Lin, Wentao
    INFORMATION, 2022, 13 (11)
  • [3] Directed Acyclic Graph Network for Conversational Emotion Recognition
    Shen, Weizhou
    Wu, Siyue
    Yang, Yunyi
    Quan, Xiaojun
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1551 - 1560
  • [4] PGIF: A Personality-Guided Iterative Feedback Graph Network for Multimodal Conversational Emotion Recognition
    Xie, Yunhe
    Mao, Rui
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
  • [5] Dense Graph Convolutional With Joint Cross-Attention Network for Multimodal Emotion Recognition
    Cheng, Cheng
    Liu, Wenzhe
    Feng, Lin
    Jia, Ziyu
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (05): : 6672 - 6683
  • [6] Context- and Knowledge-Aware Graph Convolutional Network for Multimodal Emotion Recognition
    Fu, Yahui
    Okada, Shogo
    Wang, Longbiao
    Guo, Lili
    Song, Yaodong
    Liu, Jiaxing
    Dang, Jianwu
    IEEE MULTIMEDIA, 2022, 29 (03) : 91 - 99
  • [7] Topics Guided Multimodal Fusion Network for Conversational Emotion Recognition
    Yuan, Peicong
    Cai, Guoyong
    Chen, Ming
    Tang, Xiaolv
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 250 - 262
  • [8] MALN: Multimodal Adversarial Learning Network for Conversational Emotion Recognition
    Ren, Minjie
    Huang, Xiangdong
    Liu, Jing
    Liu, Ming
    Li, Xuanya
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6965 - 6980
  • [9] Affect-GCN: a multimodal graph convolutional network for multi-emotion with intensity recognition and sentiment analysis in dialogues
    Mauajama Firdaus
    Gopendra Vikram Singh
    Asif Ekbal
    Pushpak Bhattacharyya
    Multimedia Tools and Applications, 2023, 82 : 43251 - 43272
  • [10] Affect-GCN: a multimodal graph convolutional network for multi-emotion with intensity recognition and sentiment analysis in dialogues
    Firdaus, Mauajama
    Singh, Gopendra Vikram
    Ekbal, Asif
    Bhattacharyya, Pushpak
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 43251 - 43272