Speaker-Aware Interactive Graph Attention Network for Emotion Recognition in Conversation

被引:0
|
作者
Jia, Zhaohong [1 ]
Shi, Yunwei [2 ]
Liu, Weifeng [1 ]
Huang, Zhenhua [1 ]
Sun, Xiao [3 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ, Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[3] Hefei Univ Technol, Inst Artificial Intelligence, Sch Comp Sci & Informat Engn, Hefei Comprehens Natl Sci Ctr, Hefei 230601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition in conversation; text classification; natural language processing; CONVOLUTIONAL NETWORK;
D O I
10.1145/3627806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, Emotion Recognition in Conversation (ERC) has attracted much attention and has become a hot topic in the field of natural language processing. Conversation is conducted in chronological order; current utterance is more likely influenced by nearby utterances. At the same time, speaker dependency also plays a core role in the conversation dynamic. The combined effect of the sequence-aware information and the speaker-aware information makes the emotion's dynamic change. However, past works used simple information fusion methods to model the two kinds of information but ignored their interactive influence. Thus, we propose a novel method entitled SIGAT (Speaker-aware Interactive Graph Attention Network) to solve the problem. The core module is a mutual interactive module in which a dual-connection (self-connection and interact-connection) graph attention network is constructed. The advantage of SIGAT is modeling the speaker-aware and sequence-aware information in a unified graph and updating them simultaneously. In this way, we model the interactive influence of them and obtain the final representations, which have richer contextual clues. Experimental results on the four public datasets demonstrate that SIGAT outperforms the state-of-the-art models.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Speaker-aware cognitive network with cross-modal attention for multimodal emotion recognition in conversation
    Guo, Lili
    Song, Yikang
    Ding, Shifei
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [2] SAPBERT: Speaker-Aware Pretrained BERT for Emotion Recognition in Conversation
    Lim, Seunguook
    Kim, Jihie
    ALGORITHMS, 2023, 16 (01)
  • [3] Interactive Multimodal Attention Network for Emotion Recognition in Conversation
    Ren, Minjie
    Huang, Xiangdong
    Shi, Xiaoqi
    Nie, Weizhi
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1046 - 1050
  • [4] Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information
    Guo, Lili
    Wang, Longbiao
    Dang, Jianwu
    Liu, Zhilei
    Guan, Haotian
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 14 - 25
  • [5] Speaker-aware Cross-modal Fusion Architecture for Conversational Emotion Recognition
    Zhao, Huan
    Li, Bo
    Zhang, Zixing
    INTERSPEECH 2023, 2023, : 2718 - 2722
  • [6] Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 247 - 253
  • [7] A speaker-aware multiparty dialogue discourse parser with heterogeneous graph neural network
    Li, Jiaqi
    Liu, Ming
    Wang, Yuxin
    Zhang, Daxing
    Qin, Bing
    COGNITIVE SYSTEMS RESEARCH, 2023, 79 : 15 - 23
  • [8] Speaker-Aware Speech Enhancement with Self-Attention
    Lin, Ju
    Van Wijngaarden, Adriaan J.
    Smith, Melissa C.
    Wang, Kuang-Ching
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 486 - 490
  • [9] Residual Relation-Aware Attention Deep Graph-Recurrent Model for Emotion Recognition in Conversation
    Duong, Anh-Quang
    Ho, Ngoc-Huynh
    Pant, Sudarshan
    Kim, Seungwon
    Kim, Soo-Hyung
    Yang, Hyung-Jeong
    IEEE ACCESS, 2024, 12 (2349-2360): : 2349 - 2360
  • [10] A Contextual Attention Network for Multimodal Emotion Recognition in Conversation
    Wang, Tana
    Hou, Yaqing
    Zhou, Dongsheng
    Zhang, Qiang
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,