AN INTERACTION-AWARE ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOGS

被引:0
|
作者
Yeh, Sung-Lin [1 ]
Lin, Yun-Shao
Lee, Chi-Chun
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
speech emotion recognition; interaction; attention mechanism; spoken dialogs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Obtaining robust speech emotion recognition (SER) in scenarios of spoken interactions is critical to the developments of next generation human-machine interface. Previous research has largely focused on performing SER by modeling each utterance of the dialog in isolation without considering the transactional and dependent nature of the human-human conversation. In this work, we propose an interaction-aware attention network (IAAN) that incorporate contextual information in the learned vocal representation through a novel attention mechanism. Our proposed method achieves 66.3% accuracy (7.9% over baseline methods) in four class emotion recognition and is also the current state-of-art recognition rates obtained on the benchmark database.
引用
收藏
页码:6685 / 6689
页数:5
相关论文
共 50 条
  • [31] Decision-Making for Autonomous Vehicles With Interaction-Aware Behavioral Prediction and Social-Attention Neural Network
    Li, Xiao
    Liu, Kaiwen
    Tseng, H. Eric
    Girard, Anouck
    Kolmanovsky, Ilya
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024,
  • [32] Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition
    Jalal, Md Asif
    Milner, Rosanna
    Hain, Thomas
    INTERSPEECH 2020, 2020, : 4113 - 4117
  • [33] SIA-Net: Scalable Interaction-Aware Network for Vehicle Trajectory Prediction Based on Self-Attention
    Huang, Junan
    Huang, Zhiqiu
    Shen, Guohua
    Xu, Heng
    Hua, Gaoyang
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 780 - 787
  • [34] Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification
    Du, Yang
    Yuan, Chunfeng
    Li, Bing
    Zhao, Lili
    Li, Yangxi
    Hu, Weiming
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 388 - 404
  • [35] Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification
    Hu, Weiming
    Liu, Haowei
    Du, Yang
    Yuan, Chunfeng
    Li, Bing
    Maybank, Stephen John
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 7010 - 7028
  • [36] DEEP CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH ATTENTION MECHANISM FOR ROBUST SPEECH EMOTION RECOGNITION
    Huang, Che-Wei
    Narayanan, Shrikanth
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 583 - 588
  • [37] DSTCNet: Deep Spectro-Temporal-Channel Attention Network for Speech Emotion Recognition
    Guo, Lili
    Ding, Shifei
    Wang, Longbiao
    Dang, Jianwu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 188 - 197
  • [38] DSTCNet: Deep Spectro-Temporal-Channel Attention Network for Speech Emotion Recognition
    Guo, Lili
    Ding, Shifei
    Wang, Longbiao
    Dang, Jianwu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 188 - 197
  • [39] TWACapsNet: a capsule network with two-way attention mechanism for speech emotion recognition
    Wen, Xin-Cheng
    Liu, Kun-Hong
    Luo, Yan
    Ye, Jiaxin
    Chen, Liyan
    SOFT COMPUTING, 2023, 28 (15-16) : 8701 - 8713
  • [40] Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions
    Yang Liu
    Haoqin Sun
    Wenbo Guan
    Yuqi Xia
    Zhen Zhao
    Machine Intelligence Research, 2023, 20 : 595 - 604