AN INTERACTION-AWARE ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOGS

被引:0
|
作者
Yeh, Sung-Lin [1 ]
Lin, Yun-Shao
Lee, Chi-Chun
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
speech emotion recognition; interaction; attention mechanism; spoken dialogs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Obtaining robust speech emotion recognition (SER) in scenarios of spoken interactions is critical to the developments of next generation human-machine interface. Previous research has largely focused on performing SER by modeling each utterance of the dialog in isolation without considering the transactional and dependent nature of the human-human conversation. In this work, we propose an interaction-aware attention network (IAAN) that incorporate contextual information in the learned vocal representation through a novel attention mechanism. Our proposed method achieves 66.3% accuracy (7.9% over baseline methods) in four class emotion recognition and is also the current state-of-art recognition rates obtained on the benchmark database.
引用
收藏
页码:6685 / 6689
页数:5
相关论文
共 50 条
  • [1] Negative Emotion Recognition in Spoken Dialogs
    Zhang, Xiaodong
    Wang, Houfeng
    Li, Li
    Zhao, Maoxiang
    Li, Quanzhong
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 103 - 115
  • [2] General Interaction-Aware Neural Network for Action Recognition
    Gao, Jialin
    Li, Jiani
    Wang, Guanshuo
    Yuan, Yufeng
    Zhou, Xi
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 93 - 106
  • [3] CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION
    Ramet, Gaetan
    Garner, Philip N.
    Baeriswyl, Michael
    Lazaridis, Alexandros
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 126 - 131
  • [4] APIN: Amplitude- and phase-aware interaction network for speech emotion recognition
    Guo, Lili
    Li, Jie
    Ding, Shifei
    Dang, Jianwu
    SPEECH COMMUNICATION, 2025, 169
  • [5] Attention Based Fully Convolutional Network for Speech Emotion Recognition
    Zhang, Yuanyuan
    Du, Jun
    Wang, Zirui
    Zhang, Jianshu
    Tu, Yanhui
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1771 - 1775
  • [6] A Joint Network Based on Interactive Attention for Speech Emotion Recognition
    Hu, Ying
    Hou, Shijing
    Yang, Huamin
    Huang, Hao
    He, Liang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1715 - 1720
  • [7] Constrained Tuple Extraction with Interaction-Aware Network
    Xue, Xiaojun
    Zhang, Chunxia
    Xu, Tianxiang
    Niu, Zhendong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11430 - 11444
  • [8] A DIALOGICAL EMOTION DECODER FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOG
    Yeh, Sung-Lin
    Lin, Yun-Shao
    Lee, Chi-Chun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6479 - 6483
  • [9] Sparse temporal aware capsule network for robust speech emotion recognition
    Zhang, Huiyun
    Huang, Heming
    Zhao, Puyang
    Yu, Zhenbao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
  • [10] Context-Aware Attention Network for Human Emotion Recognition in Video
    Liu, Xiaodong
    Wang, Miao
    ADVANCES IN MULTIMEDIA, 2020, 2020