Separable 3D residual attention network for human action recognition

被引:1
|
作者
Zhang, Zufan [1 ]
Peng, Yue [1 ]
Gan, Chenquan [1 ]
Abate, Andrea Francesco [2 ]
Zhu, Lianxiang [3 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Univ Salerno, Dept Comp Sci, Via Giovanni Paolo II 132, I-84084 Fisciano, SA, Italy
[3] Xian Shiyou Univ, Sch Comp Sci, Xian 710065, Peoples R China
关键词
Human computer interaction; Human action recognition; Residual network; Attention mechanism; Multi-stage training strategy; SPATIAL-TEMPORAL ATTENTION; FEATURES; LSTM;
D O I
10.1007/s11042-022-12972-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important research issue in computer vision, human action recognition has been regarded as a crucial mean of communication and interaction between humans and computers. To help computers automatically recognize human behaviors and accurately understand human intentions, this paper proposes a separable three-dimensional residual attention network (defined as Sep-3D RAN), which is a lightweight network and can extract the informative spatial-temporal representations for the applications of video-based human computer interaction. Specifically, Sep-3D RAN is constructed via stacking multiple separable three-dimensional residual attention blocks, in which each standard three-dimensional convolution is approximated as a cascaded two-dimensional spatial convolution and a one-dimensional temporal convolution, and then a dual attention mechanism is built by embedding a channel attention sub-module and a spatial attention sub-module sequentially in each residual block, thereby acquiring more discriminative features to improve the model guidance capability. Furthermore, a multi-stage training strategy is used for Sep-3D RAN training, which can relieve the over-fitting effectively. Finally, experimental results demonstrate that the performance of Sep-3D RAN can surpass the existing state-of-the-art methods.
引用
收藏
页码:5435 / 5453
页数:19
相关论文
共 50 条
  • [41] Recognition of Human Continuous Action with 3D CNN
    Yu, Gang
    Li, Ting
    COMPUTER VISION SYSTEMS, ICVS 2017, 2017, 10528 : 314 - 322
  • [42] HUMAN ACTION RECOGNITION IN 3D MOTION SEQUENCES
    Kelgeorgiadis, Konstantinos
    Nikolaidis, Nikos
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2205 - 2209
  • [43] Multifeature Selection for 3D Human Action Recognition
    Huang, Min
    Su, Song-Zhi
    Zhang, Hong-Bo
    Cai, Guo-Rong
    Gong, Dongying
    Cao, Donglin
    Li, Shao-Zi
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (02)
  • [44] 3D Human Action Recognition with Skeleton Orientation Vectors and Stacked Residual Bi-LSTM
    Wan, Xiaoyi
    Xing, Tengfei
    Ji, Yi
    Gong, Shengrong
    Liu, Chunping
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 571 - 576
  • [45] Global Spatio-Temporal Attention for Action Recognition Based on 3D Human Skeleton Data
    Han, Yun
    Chung, Sheng-Luen
    Xiao, Qiang
    Lin, Wei You
    Su, Shun-Feng
    IEEE ACCESS, 2020, 8 : 88604 - 88616
  • [46] Accurate recognition of human abnormal behaviours using adaptive 3D residual attention network with gated recurrent units (GRU) in the video sequences
    Balakrishnan, T. Suresh
    Jayalakshmi, D.
    Geetha, P.
    Raj, T. Saju
    Hemavathi, R.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2024, 12 (01):
  • [47] DAN: Deep-Attention Network for 3D Shape Recognition
    Nie, Weizhi
    Zhao, Yue
    Song, Dan
    Gao, Yue
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4371 - 4383
  • [48] Human Action Recognition Using Non-separable Oriented 3D Dual-Tree Complex Wavelets
    Minhas, Rashid
    Baradarani, Aryaz
    Seifzadeh, Sepideh
    Wu, Q. M. Jonathan
    COMPUTER VISION - ACCV 2009, PT III, 2010, 5996 : 226 - +
  • [49] Video action recognition method based on attention residual network and LSTM
    Zhang, Yu
    Dong, Pengyue
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3611 - 3616
  • [50] Residual Gating Fusion Network for Human Action Recognition
    Zhang, Junxuan
    Hu, Haifeng
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 79 - 86