SAPBERT: Speaker-Aware Pretrained BERT for Emotion Recognition in Conversation

被引:2
|
作者
Lim, Seunguook [1 ]
Kim, Jihie [1 ]
机构
[1] Dongguk Univ Seoul, Dept Artificial Intelligence, 30 Pildong Ro 1 Gil, Seoul 04620, South Korea
关键词
natural language processing; motion recognition in conversation; dialogue modeling; pre-training; hierarchical BERT;
D O I
10.3390/a16010008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition in conversation (ERC) is receiving more and more attention, as interactions between humans and machines increase in a variety of services such as chat-bot and virtual assistants. As emotional expressions within a conversation can heavily depend on the contextual information of the participating speakers, it is important to capture self-dependency and inter-speaker dynamics. In this study, we propose a new pre-trained model, SAPBERT, that learns to identify speakers in a conversation to capture the speaker-dependent contexts and address the ERC task. SAPBERT is pre-trained with three training objectives including Speaker Classification (SC), Masked Utterance Regression (MUR), and Last Utterance Generation (LUG). We investigate whether our pre-trained speaker-aware model can be leveraged for capturing speaker-dependent contexts for ERC tasks. Experiments show that our proposed approach outperforms baseline models through demonstrating the effectiveness and validity of our method.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Performance Comparison of Speaker and Emotion Recognition
    Revathy, A.
    Shanmugapriya, P.
    Mohan, V.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [42] Speaker Attentive Speech Emotion Recognition
    Le Moine, Clement
    Obin, Nicolas
    Roebel, Axel
    INTERSPEECH 2021, 2021, : 2866 - 2870
  • [43] INTERACTIVE EMOTION INFERENCE MODEL FOR EMOTION RECOGNITION IN CONVERSATION
    Qian, Y. A. N. J. U. N.
    Zhang, X. U. E. J. I. E.
    Wang, J. I. N.
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2022, 23 (10) : 2175 - 2193
  • [44] CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation
    Lee, Joosung
    Lee, Wooin
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5669 - 5679
  • [45] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [46] Pretrained Natural Language Processing Model for Intent Recognition (BERT-IR)
    Vasima Khan
    Tariq Azfar Meenai
    Human-Centric Intelligent Systems, 2021, 1 (3-4): : 66 - 74
  • [47] A Discourse-Aware Graph Neural Network for Emotion Recognition in Multi-Party Conversation
    Sun, Yang
    Yu, Nan
    Fu, Guohong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2949 - 2958
  • [48] Temporal Shift Module with Pretrained Representations for Speech Emotion Recognition
    Shen, Siyuan
    Liu, Feng
    Wang, Hanyang
    Wang, Yunlong
    Zhou, Aimin
    INTELLIGENT COMPUTING, 2024, 3
  • [49] Benchmarking Pretrained Models for Speech Emotion Recognition: A Focus on Xception
    Hassan, Ahmed
    Masood, Tehreem
    Ahmed, Hassan A.
    Shahzad, H. M.
    Khushi, Hafiz Muhammad Tayyab
    COMPUTERS, 2024, 13 (12)
  • [50] Emotion interactive robot focus on speaker independently emotion recognition
    Kim, Eun Ho
    Hyun, Kyung Hak
    Kim, Soo Hyun
    Kwak, Yoon Keun
    2007 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2007, : 280 - 285