AMA: attention-based multi-feature aggregation module for action recognition

被引:1
|
作者
Yu, Mengyun [1 ]
Chen, Ying [1 ]
机构
[1] Jiangnan Univ, Minist Educ, Key Lab Adv Proc Control Light Ind, Wuxi 214000, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Channel excitation; Spatial-temporal aggregation; Convolution neural network; FRAMEWORK;
D O I
10.1007/s11760-022-02268-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spatial information learning, temporal modeling and channel relationships capturing are important for action recognition in videos. In this work, an attention-based multi-feature aggregation (AMA) module that encodes the above features in a unified module is proposed, which contains a spatial-temporal aggregation (STA) structure and a channel excitation (CE) structure. STA mainly employs two convolutions to model spatial and temporal features, respectively. The matrix multiplication in STA has the ability of capturing long-range dependencies. The CE learns the importance of each channel, so as to bias the allocation of available resources toward the informative features. AMA module is simple yet efficient enough that can be inserted into a standard ResNet architecture without any modification. In this way, the representation of the network can be enhanced. We equip ResNet-50 with AMA module to build an effective AMA Net with limited extra computation cost, only 1.002 times that of ResNet-50. Extensive experiments indicate that AMA Net outperforms the state-of-the-art methods on UCF101 and HMDB51, which is 6.2% and 10.0% higher than the baseline. In short, AMA Net achieves the high accuracy of 3D convolutional neural networks and maintains the complexity of 2D convolutional neural networks simultaneously.
引用
收藏
页码:619 / 626
页数:8
相关论文
共 50 条
  • [41] A multi-feature fusion method based on bilstm-attention-crf for chinese named entity recognition
    Zhang, Zhiyuan
    Sun, Shuihua
    Xu, Shiao
    Xu, Fan
    Liu, Jianhua
    Journal of Network Intelligence, 2021, 6 (03): : 518 - 534
  • [42] Attention-Based Multimodal Image Feature Fusion Module for Transmission Line Detection
    Choi, Hyeyeon
    Yun, Jong Pil
    Kim, Bum Jun
    Jang, Hyeonah
    Kim, Sang Woo
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (11) : 7686 - 7695
  • [43] Attention-based variable-size feature compression module for edge inference
    Li, Shibao
    Ma, Chenxu
    Zhang, Yunwu
    Li, Longfei
    Wang, Chengzhi
    Cui, Xuerong
    Liu, Jianhang
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (06): : 8469 - 8484
  • [44] Attention-based variable-size feature compression module for edge inference
    Shibao Li
    Chenxu Ma
    Yunwu Zhang
    Longfei Li
    Chengzhi Wang
    Xuerong Cui
    Jianhang Liu
    The Journal of Supercomputing, 2024, 80 : 8469 - 8484
  • [45] Traffic lights detection and recognition based on multi-feature fusion
    Wenhao Wang
    Shanlin Sun
    Mingxin Jiang
    Yunyang Yan
    Xiaobing Chen
    Multimedia Tools and Applications, 2017, 76 : 14829 - 14846
  • [46] Multi-feature gait recognition with DNN based on sEMG signals
    Yao, Ting
    Gao, Farong
    Zhang, Qizhong
    Ma, Yuliang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 3521 - 3542
  • [47] Traffic lights detection and recognition based on multi-feature fusion
    Wang, Wenhao
    Sun, Shanlin
    Jiang, Mingxin
    Yan, Yunyang
    Chen, Xiaobing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (13) : 14829 - 14846
  • [48] Chinese Named Entity Recognition Based on Multi-feature Fusion
    Sun, Zhenxiang
    Sun, Runyuan
    Liang, Zhifeng
    Su, Zhuang
    Yu, Yongxin
    Wu, Shuainan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 670 - 681
  • [49] Multi-Feature Encoder for Radar-Based Gesture Recognition
    Sun, Yuliang
    Fei, Tai
    Li, Xibo
    Warnecke, Alexander
    Warsitz, Ernst
    Pohl, Nils
    2020 IEEE INTERNATIONAL RADAR CONFERENCE (RADAR), 2020, : 351 - 356
  • [50] Human behavior recognition based on multi-feature fusion of image
    Xu Song
    Hongyu Zhou
    Guoying Liu
    Cluster Computing, 2019, 22 : 9113 - 9121