Two-Stream Attention Network for Pain Recognition from Video Sequences

被引:23
|
作者
Thiam, Patrick [1 ,2 ]
Kestler, Hans A. [1 ]
Schwenker, Friedhelm [2 ]
机构
[1] Ulm Univ, Inst Med Syst Biol, Albert Einstein Allee 11, D-89081 Ulm, Germany
[2] Ulm Univ, Inst Neural Informat Proc, James Frank Ring, D-89081 Ulm, Germany
关键词
convolutional neural networks; long short-term memory recurrent neural networks; information fusion; pain recognition;
D O I
10.3390/s20030839
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Several approaches have been proposed for the analysis of pain-related facial expressions. These approaches range from common classification architectures based on a set of carefully designed handcrafted features, to deep neural networks characterised by an autonomous extraction of relevant facial descriptors and simultaneous optimisation of a classification architecture. In the current work, an end-to-end approach based on attention networks for the analysis and recognition of pain-related facial expressions is proposed. The method combines both spatial and temporal aspects of facial expressions through a weighted aggregation of attention-based neural networks' outputs, based on sequences of Motion History Images (MHIs) and Optical Flow Images (OFIs). Each input stream is fed into a specific attention network consisting of a Convolutional Neural Network (CNN) coupled to a Bidirectional Long Short-Term Memory (BiLSTM) Recurrent Neural Network (RNN). An attention mechanism generates a single weighted representation of each input stream (MHI sequence and OFI sequence), which is subsequently used to perform specific classification tasks. Simultaneously, a weighted aggregation of the classification scores specific to each input stream is performed to generate a final classification output. The assessment conducted on both the BioVid Heat Pain Database (Part A) and SenseEmotion Database points at the relevance of the proposed approach, as its classification performance is on par with state-of-the-art classification approaches proposed in the literature.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Human action recognition using two-stream attention based LSTM networks
    Dai, Cheng
    Liu, Xingang
    Lai, Jinfeng
    APPLIED SOFT COMPUTING, 2020, 86
  • [42] Recognition and location of marine animal sounds using two-stream ConvNet with attention
    Hu, Shaoxiang
    Hou, Rong
    Liao, Zhiwu
    Chen, Peng
    FRONTIERS IN MARINE SCIENCE, 2023, 10
  • [43] Automated Video Behavior Recognition of Pigs Using Two-Stream Convolutional Networks
    Zhang, Kaifeng
    Li, Dan
    Huang, Jiayun
    Chen, Yifei
    SENSORS, 2020, 20 (04)
  • [44] Two-Stream Action Recognition-Oriented Video Super-Resolution
    Zhang, Haochen
    Liu, Dong
    Xiong, Zhiwei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8798 - 8807
  • [45] Human Action Recognition Based on Improved Two-Stream Convolution Network
    Wang, Zhongwen
    Lu, Haozhu
    Jin, Junlan
    Hu, Kai
    APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [46] Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
    Xiong, Xin
    Min, Weidong
    Han, Qing
    Wang, Qi
    Zha, Cheng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [47] Human Action Recognition Based on a Two-stream Convolutional Network Classifier
    Silva, Vincius de Oliveira
    Vidal, Flavio de Barros
    Soares Romariz, Alexandre Ricardo
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 774 - 778
  • [48] Two-Stream Gait Network for Cross-View Gait Recognition
    Wang K.
    Lei Y.
    Zhang J.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (05): : 383 - 392
  • [49] Novel method for the recognition of Jinnan cattle action using bottleneck attention enhanced two-stream neural network
    Hao, Wangli
    Han, Meng
    Zhang, Kai
    Zhang, Li
    Hao, Wangbao
    Li, Fuzhong
    Liu, Zhenyu
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND BIOLOGICAL ENGINEERING, 2024, 17 (03) : 203 - 210
  • [50] A simulated two-stream network via multilevel distillation of reviewed features and decoupled logits for video action recognition
    Gao, Zitao
    Liu, Xiangjian
    Wang, Anna K.
    Lin, Liyu
    VISUAL COMPUTER, 2024, : 3907 - 3923