AMA: attention-based multi-feature aggregation module for action recognition

被引:1
|
作者
Yu, Mengyun [1 ]
Chen, Ying [1 ]
机构
[1] Jiangnan Univ, Minist Educ, Key Lab Adv Proc Control Light Ind, Wuxi 214000, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Channel excitation; Spatial-temporal aggregation; Convolution neural network; FRAMEWORK;
D O I
10.1007/s11760-022-02268-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spatial information learning, temporal modeling and channel relationships capturing are important for action recognition in videos. In this work, an attention-based multi-feature aggregation (AMA) module that encodes the above features in a unified module is proposed, which contains a spatial-temporal aggregation (STA) structure and a channel excitation (CE) structure. STA mainly employs two convolutions to model spatial and temporal features, respectively. The matrix multiplication in STA has the ability of capturing long-range dependencies. The CE learns the importance of each channel, so as to bias the allocation of available resources toward the informative features. AMA module is simple yet efficient enough that can be inserted into a standard ResNet architecture without any modification. In this way, the representation of the network can be enhanced. We equip ResNet-50 with AMA module to build an effective AMA Net with limited extra computation cost, only 1.002 times that of ResNet-50. Extensive experiments indicate that AMA Net outperforms the state-of-the-art methods on UCF101 and HMDB51, which is 6.2% and 10.0% higher than the baseline. In short, AMA Net achieves the high accuracy of 3D convolutional neural networks and maintains the complexity of 2D convolutional neural networks simultaneously.
引用
收藏
页码:619 / 626
页数:8
相关论文
共 50 条
  • [21] EEG FEATURE EXTRACTION AND RECOGNITION BASED ON MULTI-FEATURE FUSION
    Sun, Jian
    Wu, Quanyu
    Gao, Nan
    Pan, Lingjiao
    Tao, Weige
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2024, 36 (06):
  • [22] A Novel Human Action Recognition Algorithm Based on Decision Level Multi-Feature Fusion
    Song Wei
    Liu Ningning
    Yang Guosheng
    Yang Pei
    CHINA COMMUNICATIONS, 2015, 12 (02) : 93 - 102
  • [23] A Novel Human Action Recognition Algorithm Based on Decision Level Multi-Feature Fusion
    SONG Wei
    LIU Ningning
    YANG Guosheng
    YANG Pei
    China Communications, 2015, (S2) : 93 - 102
  • [24] A Novel Human Action Recognition Algorithm Based on Decision Level Multi-Feature Fusion
    SONG Wei
    LIU Ningning
    YANG Guosheng
    YANG Pei
    中国通信, 2015, 12(S2) (S2) : 93 - 102
  • [25] Attention-based Pyramid Aggregation Network for Visual Place Recognition
    Zhu, Yingying
    Wang, Jiong
    Xie, Lingxi
    Zheng, Liang
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 99 - 107
  • [26] FRUIT RECOGNITION BASED ON MULTI-FEATURE AND MULTI-DECISION
    Wang, Xiaohua
    Huang, Wei
    Jin, Chao
    Hu, Min
    Ren, Fuji
    2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 113 - 117
  • [27] Multi-head attention-based two-stream EfficientNet for action recognition
    Zhou, Aihua
    Ma, Yujun
    Ji, Wanting
    Zong, Ming
    Yang, Pei
    Wu, Min
    Liu, Mingzhe
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 487 - 498
  • [28] Multi-head attention-based two-stream EfficientNet for action recognition
    Aihua Zhou
    Yujun Ma
    Wanting Ji
    Ming Zong
    Pei Yang
    Min Wu
    Mingzhe Liu
    Multimedia Systems, 2023, 29 : 487 - 498
  • [29] Attention-based network for effective action recognition from multi-view video
    Hoang-Thuyen Nguyen
    Thi-Oanh Nguyen
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 971 - 980
  • [30] Multi-Feature based Hand-Gesture Recognition
    Herath, H. M. S. P. B.
    Ekanayake, M. P. B.
    Godaliyadda, G. M. R. I.
    Wijayakulasooriya, J. V.
    2015 FIFTEENTH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2015, : 63 - 68