Dimensional emotion recognition based on two stream CNN fusion attention mechanism

被引:1
|
作者
Qi, Mei [1 ]
Zhang, Hairong [1 ]
机构
[1] Anhui Open Univ, Sch Informat & Construct Engn, 3 JiuHuashan Rd, Hefei 230022, Anhui, Peoples R China
关键词
Two stream CNN; sharing and global attention mechanism; dimensional emotion;
D O I
10.1117/12.2678902
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Aiming at the problem that discrete emotion recognition cannot depict continuous emotion changes, in order to capture high-level dimensional emotional information, this paper integrates attention mechanism into the two stream CNN model and proposes a Two Stream Convolutional Neural Network with Shared and Global attention mechanism (TSCNN-SGA). TSCNN-SGA uses the same structure of CNN network structure to extract the static stream of expression images and dynamic stream of expression sequences features respectively, firstly, in the dynamic and static dual flow feature extraction network, the output feature map of the previous convolution layer group is used to cascade to calculate the shared attention weight of the next layer group, secondly, the two stream convolution feature map with shared attention is cascaded, the attention weights of different positions are mapped onto the cascaded feature map and weighted, finally, the shared weight matrix in the convolution end of TSCNN-SSA and the global attention mechanism after the two stream feature cascade work together to obtain the depth space-time feature, which is input to the bidirectional long-short time network to obtain the final dimensional sentiment prediction value. Compared with different baseline methods, the average value of the proposed method's concordance correlation coefficient (CCC) in the arousal-valence space reached 0.576, which can effectively identify dimensional emotions.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Gesture Recognition Based on Two Stream CNN with Local Attention Mechanism
    Wang, Wentong
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 321 - 322
  • [2] Violin Music Emotion Recognition with Fusion of CNN-BiGRU and Attention Mechanism
    Ma, Sihan
    Zhou, Ruohua
    INFORMATION, 2024, 15 (04)
  • [3] Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition
    Tao, Huawei
    Geng, Lei
    Shan, Shuai
    Mai, Jingchao
    Fu, Hongliang
    ENTROPY, 2022, 24 (08)
  • [4] Combined CNN LSTM with attention for speech emotion recognition based on feature-level fusion
    Liu Y.
    Chen A.
    Zhou G.
    Yi J.
    Xiang J.
    Wang Y.
    Multimedia Tools and Applications, 2024, 83 (21) : 59839 - 59859
  • [5] A speech emotion recognition method for the elderly based on feature fusion and attention mechanism
    Jian, Qijian
    Xiang, Min
    Huang, Wei
    THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167
  • [6] A multimodal fusion emotion recognition method based on multitask learning and attention mechanism
    Xie, Jinbao
    Wang, Jiyu
    Wang, Qingyan
    Yang, Dali
    Gu, Jinming
    Tang, Yongqiang
    Varatnitski, Yury I.
    NEUROCOMPUTING, 2023, 556
  • [7] Signals Recognition by CNN Based on Attention Mechanism
    Tian, Feng
    Wang, Li
    Xia, Meng
    ELECTRONICS, 2022, 11 (13)
  • [8] An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network
    Tellai M.
    Gao L.
    Mao Q.
    International Journal of Speech Technology, 2023, 26 (02) : 541 - 557
  • [9] A Model for EEG-Based Emotion Recognition: CNN-Bi-LSTM with Attention Mechanism
    Huang, Zhentao
    Ma, Yahong
    Wang, Rongrong
    Li, Weisu
    Dai, Yongsheng
    ELECTRONICS, 2023, 12 (14)
  • [10] Multichannel Fusion Based on modified CNN for Image Emotion Recognition
    Zhao, Juntao
    Journal of Computers (Taiwan), 2022, 33 (01) : 13 - 19