SAPBERT: Speaker-Aware Pretrained BERT for Emotion Recognition in Conversation

被引:2
|
作者
Lim, Seunguook [1 ]
Kim, Jihie [1 ]
机构
[1] Dongguk Univ Seoul, Dept Artificial Intelligence, 30 Pildong Ro 1 Gil, Seoul 04620, South Korea
关键词
natural language processing; motion recognition in conversation; dialogue modeling; pre-training; hierarchical BERT;
D O I
10.3390/a16010008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition in conversation (ERC) is receiving more and more attention, as interactions between humans and machines increase in a variety of services such as chat-bot and virtual assistants. As emotional expressions within a conversation can heavily depend on the contextual information of the participating speakers, it is important to capture self-dependency and inter-speaker dynamics. In this study, we propose a new pre-trained model, SAPBERT, that learns to identify speakers in a conversation to capture the speaker-dependent contexts and address the ERC task. SAPBERT is pre-trained with three training objectives including Speaker Classification (SC), Masked Utterance Regression (MUR), and Last Utterance Generation (LUG). We investigate whether our pre-trained speaker-aware model can be leveraged for capturing speaker-dependent contexts for ERC tasks. Experiments show that our proposed approach outperforms baseline models through demonstrating the effectiveness and validity of our method.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Speaker-Aware Long Short-Term Memory Multi-Task Learning for Speech Recognition
    Pironkov, Gueorgui
    Dupont, Stephane
    Dutoit, Thierry
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1911 - 1915
  • [22] SPEAKER-AWARE TRAINING OF LSTM-RNNS FOR ACOUSTIC MODELLING
    Tan, Tian
    Qian, Yanmin
    Yu, Dong
    Kundu, Souvik
    Lu, Liang
    Sim, Khe Chai
    Xiao, Xiong
    Zhang, Yu
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5280 - 5284
  • [23] Low-Resource Speech Synthesis with Speaker-Aware Embedding
    Yang, Li-Jen
    Yeh, I-Ping
    Chien, Jen-Tzung
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 235 - 239
  • [24] Who is Speaking? Speaker-Aware Multiparty Dialogue Act Classification
    Qamar, Ayesha
    Pyarelal, Adarsh
    Huang, Ruihong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10122 - 10135
  • [25] OPTIMIZATION OF SPEAKER-AWARE MULTICHANNEL SPEECH EXTRACTION WITH ASR CRITERION
    Zmolikova, Katerina
    Delcroix, Marc
    Kinoshita, Keisuke
    Higuchi, Takuya
    Nakatani, Tomohiro
    Cernocky, Jan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6702 - 6706
  • [26] Context-and Sentiment-Aware Networks for Emotion Recognition in Conversation
    Tu G.
    Wen J.
    Liu C.
    Jiang D.
    Cambria E.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (05): : 699 - 708
  • [27] HIERARCHICAL SPEAKER-AWARE SEQUENCE-TO-SEQUENCE MODEL FOR DIALOGUE SUMMARIZATION
    Lei, Yuejie
    Yan, Yuanmeng
    Zeng, Zhiyuan
    He, Keqing
    Zhang, Ximing
    Xu, Weiran
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7823 - 7827
  • [28] Leveraging speaker-aware structure and factual knowledge for faithful dialogue summarization
    Zhao, Lulu
    Xu, Weiran
    Zhang, Chunyun
    Guo, Jun
    KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [29] Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
    Liu, Longxiang
    Zhang, Zhuosheng
    Zhao, Hai
    Zhou, Xi
    Zhou, Xiang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13406 - 13414
  • [30] Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 247 - 253