Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network

被引:0
|
作者
Zhou, Xuan [1 ]
Yi, Jianping [2 ]
机构
[1] Xian Traff Engn Inst, Sch Mech & Elect Engn, Xian 710300, Peoples R China
[2] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
来源
关键词
Fine-grained action recognition; temporal pyramid excitation module; temporal receptive; multi-excitation module;
D O I
10.32604/iasc.2023.034855
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recognition. Previous action recognition methods utilize a fixed spatiotemporal window to learn local video representation. However, these methods failed to capture complex motion patterns due to their limited receptive field. To solve the above problems, this paper proposes a lightweight Temporal Pyramid Excitation (TPE) module to capture the short, medium, and longterm temporal context. In this method, Temporal Pyramid (TP) module can effectively expand the temporal receptive field of the network by using the multi-temporal kernel decomposition without significantly increasing the computational cost. In addition, the Multi Excitation module can emphasize temporal importance to enhance the temporal feature representation learning. TPE can be integrated into ResNet50, and building a compact video learning framework-TPENet. Extensive validation experiments on several challenging benchmark (Something-Something V1, Something-Something V2, UCF-101, and HMDB51) datasets demonstrate that our method achieves a preferable balance between computation and accuracy.
引用
收藏
页码:2103 / 2116
页数:14
相关论文
共 50 条
  • [41] Multi-stream I3D Network for Fine-grained Action Recognition
    You, Jian
    Shi, Ping
    Bao, Xiaojie
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 611 - 614
  • [42] Fine-Grained Activity Recognition Based on Features of Action Subsegments and Incremental Broad Learning
    Chen, Shi
    Wu, Sheng
    Zhu, Licai
    Yang, Hao
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 100 - 114
  • [43] Global Topology Constraint Network for Fine-Grained Vehicle Recognition
    Xiang, Ye
    Fu, Ying
    Huang, Hua
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (07) : 2918 - 2929
  • [44] Fine-grained Vehicle Recognition by Deep Convolutional Neural Network
    Huang, Kun
    Zhang, Bailing
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 465 - 470
  • [45] Feature Correlation Residual Network for Fine-Grained Image Recognition
    Xu, Jiazhen
    Wei, Yantao
    Deng, Wei
    IEEE ACCESS, 2020, 8 : 214322 - 214331
  • [46] Fine-Grained Spatial-Temporal Gait Recognition Network Based on Millimeter-Wave Radar Point Cloud
    Xue, Shikun
    Du, Lan
    Shi, Yu
    Chen, Xiaoyang
    Xie, Meng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
  • [47] SPViM: Sparse Pyramid Video Representation Learning Framework for Fine-Grained Action Retrieval
    Wang, Lutong
    Yang, Chenglei
    Luan, Hongqiu
    Gai, Wei
    Geng, Wenxiu
    Zheng, Yawen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14866 : 323 - 334
  • [48] Collaborative Representation based Fine-grained Species Recognition
    Chakraborti, Tapabrata
    McCane, Brendan
    Mills, Steven
    Pal, Umapada
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2016, : 42 - 47
  • [49] Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition
    Wang, Peng
    Cao, Yuanzhouhan
    Shen, Chunhua
    Liu, Lingqiao
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2613 - 2622
  • [50] Spatial-temporal pyramid based Convolutional Neural Network for action recognition
    Zheng, Zhenxing
    An, Gaoyun
    Wu, Dapeng
    Ruan, Qiuqi
    NEUROCOMPUTING, 2019, 358 : 446 - 455