Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network

被引:0
|
作者
Zhou, Xuan [1 ]
Yi, Jianping [2 ]
机构
[1] Xian Traff Engn Inst, Sch Mech & Elect Engn, Xian 710300, Peoples R China
[2] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
来源
关键词
Fine-grained action recognition; temporal pyramid excitation module; temporal receptive; multi-excitation module;
D O I
10.32604/iasc.2023.034855
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recognition. Previous action recognition methods utilize a fixed spatiotemporal window to learn local video representation. However, these methods failed to capture complex motion patterns due to their limited receptive field. To solve the above problems, this paper proposes a lightweight Temporal Pyramid Excitation (TPE) module to capture the short, medium, and longterm temporal context. In this method, Temporal Pyramid (TP) module can effectively expand the temporal receptive field of the network by using the multi-temporal kernel decomposition without significantly increasing the computational cost. In addition, the Multi Excitation module can emphasize temporal importance to enhance the temporal feature representation learning. TPE can be integrated into ResNet50, and building a compact video learning framework-TPENet. Extensive validation experiments on several challenging benchmark (Something-Something V1, Something-Something V2, UCF-101, and HMDB51) datasets demonstrate that our method achieves a preferable balance between computation and accuracy.
引用
收藏
页码:2103 / 2116
页数:14
相关论文
共 50 条
  • [31] FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
    Liu, Yi
    Wang, Limin
    Wang, Yali
    Ma, Xiao
    Qiao, Yu
    IEEE Transactions on Image Processing, 2022, 31 : 6937 - 6950
  • [32] FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
    Liu, Yi
    Wang, Limin
    Wang, Yali
    Ma, Xiao
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6937 - 6950
  • [33] Pipelining Localized Semantic Features for Fine-Grained Action Recognition
    Zhou, Yang
    Ni, Bingbing
    Yan, Shuicheng
    Moulin, Pierre
    Tian, Qi
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 481 - 496
  • [34] Hand Detection and Tracking in Videos for Fine-Grained Action Recognition
    Do, Nga H.
    Yanai, Keiji
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I, 2015, 9008 : 19 - 34
  • [35] Fine-grained Temporal Attention Network for EEG-based Seizure Detection
    Jeong, Seungwoo
    Jeon, Eunjin
    Ko, Wonjun
    Suk, Heung-Il
    2021 9TH IEEE INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE (BCI), 2021, : 329 - 332
  • [36] Fine-Grained Radio Frequency Fingerprint Recognition Network Based on Attention Mechanism
    Zhang, Yulan
    Hu, Jun
    Jiang, Rundong
    Lin, Zengrong
    Chen, Zengping
    ENTROPY, 2024, 26 (01)
  • [37] Fine-grained vehicle type detection and recognition based on dense attention network
    Ke, Xiao
    Zhang, Yufeng
    NEUROCOMPUTING, 2020, 399 : 247 - 257
  • [38] Fine-Grained Ship Recognition With Spatial-Aligned Feature Pyramid Network and Adaptive Prototypical Contrastive Learning
    Li, Yangfan
    Chen, Liang
    Li, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [39] Fine-grained traffic video vehicle recognition based orientation estimation and temporal information
    Anqi Hu
    Zhengxing Sun
    Qian Li
    Yechao Xu
    Yihuan Zhu
    Sheng Zhang
    Multimedia Tools and Applications, 2023, 82 : 13745 - 13763
  • [40] Fine-grained traffic video vehicle recognition based orientation estimation and temporal information
    Hu, Anqi
    Sun, Zhengxing
    Li, Qian
    Xu, Yechao
    Zhu, Yihuan
    Zhang, Sheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13745 - 13763