Modeling Sentiment-Speaker-Dependency for Emotion Recognition in Conversation

被引:0
|
作者
Ge, Lin [1 ]
Huang, Faliang [1 ]
Li, Qi [1 ]
Ye, Yihua [1 ]
机构
[1] Nanning Normal Univ, Sch Comp & Informat Engn, Nanning, Peoples R China
关键词
Emotion recognition in conversation; Speaker dependency; Sentiment dependency;
D O I
10.1109/IJCNN60899.2024.10650672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion Recognition in Conversations (ERC) plays a crucial role in the development of human-machine interaction. Conversations are a multi-party, multi-emotion, and multi-turn process of information propagation. However, existing works, which focus on designing models and algorithms for better learning representations of dialogue context and speakers, but rarely care about the key element of the strong correlation and inseparable interdependence of emotional states on the sentiment polarity in the process. To address this issue, we propose a novel model, named S2D-ERC (Sentiment-Speaker-Dependency for Emotion Recognition in Conversation), for ERC task. The proposed model constructs a conversation as a directed acyclic graph and represents both speaker- and sentiment-dependencies between utterances with heterogeneous edges. Additionally, to capture the information interaction dynamics in conversation context, we employ a cross-attention mechanism where latent representations of speaker and sentiment are learned with two different directions of information flow. The experimental results on two benchmarks, compared with state-of-the-art models, demonstrate the superiority and effectiveness of our model.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Data Dependency on Measurement Uncertainties in Speaker Recognition Evaluation
    Wu, Jin Chu
    Martin, Alvin F.
    Greenberg, Craig S.
    Kacker, Raghu N.
    ACTIVE AND PASSIVE SIGNATURES III, 2012, 8382
  • [32] Significance Test with Data Dependency in Speaker Recognition Evaluation
    Wu, Jin Chu
    Martin, Alvin F.
    Greenberg, Craig S.
    Kacker, Raghu N.
    Stanford, Vincent M.
    ACTIVE AND PASSIVE SIGNATURES IV, 2013, 8734
  • [33] Is Discourse Role Important for Emotion Recognition in Conversation?
    Ong, Donovan
    Su, Jian
    Chen, Bin
    Anh Tuan Luu
    Narendranath, Ashok
    Li, Yue
    Sun, Shuqi
    Lin, Yingzhan
    Wang, Haifeng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11121 - 11129
  • [34] Multimodal Emotion Recognition in Conversation Based on Hypergraphs
    Li, Jiaze
    Mei, Hongyan
    Jia, Liyun
    Zhang, Xing
    ELECTRONICS, 2023, 12 (22)
  • [35] Hybrid Curriculum Learning for Emotion Recognition in Conversation
    Yang, Lin
    Shen, Yi
    Mao, Yue
    Cai, Longjun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11595 - 11603
  • [36] Contextualized Emotion Recognition in Conversation as Sequence Tagging
    Wang, Yan
    Zhang, Jiayu
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 186 - 195
  • [37] Speaker to Emotion: Domain Adaptation for Speech Emotion Recognition with Residual Adapters
    Xi, Yuxuan
    Li, Pengcheng
    Song, Yan
    Jiang, Yiheng
    Dai, Lirong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 513 - 518
  • [38] DISTRIBUTION-BASED EMOTION RECOGNITION IN CONVERSATION
    Wu, Wen
    Zhang, Chao
    Woodland, Philip C.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 860 - 867
  • [39] MRSLN: A Multimodal Residual Speaker-LSTM Network to alleviate the over-smoothing issue for Emotion Recognition in Conversation
    Lu, Nannan
    Tan, Zhen
    Qian, Jiansheng
    NEUROCOMPUTING, 2024, 580
  • [40] Phonetically optimized speaker modeling for robust speaker recognition
    Lee, Bong-Jin
    Choi, Jeung-Yoon
    Kang, Hong-Goo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (03): : EL100 - EL106