Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network

被引:0
|
作者
Zhou, Xuan [1 ]
Yi, Jianping [2 ]
机构
[1] Xian Traff Engn Inst, Sch Mech & Elect Engn, Xian 710300, Peoples R China
[2] Xian Polytech Univ, Sch Elect & Informat, Xian 710048, Peoples R China
来源
关键词
Fine-grained action recognition; temporal pyramid excitation module; temporal receptive; multi-excitation module;
D O I
10.32604/iasc.2023.034855
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recognition. Previous action recognition methods utilize a fixed spatiotemporal window to learn local video representation. However, these methods failed to capture complex motion patterns due to their limited receptive field. To solve the above problems, this paper proposes a lightweight Temporal Pyramid Excitation (TPE) module to capture the short, medium, and longterm temporal context. In this method, Temporal Pyramid (TP) module can effectively expand the temporal receptive field of the network by using the multi-temporal kernel decomposition without significantly increasing the computational cost. In addition, the Multi Excitation module can emphasize temporal importance to enhance the temporal feature representation learning. TPE can be integrated into ResNet50, and building a compact video learning framework-TPENet. Extensive validation experiments on several challenging benchmark (Something-Something V1, Something-Something V2, UCF-101, and HMDB51) datasets demonstrate that our method achieves a preferable balance between computation and accuracy.
引用
收藏
页码:2103 / 2116
页数:14
相关论文
共 50 条
  • [21] Aircraft target detection and fine-grained recognition based on RHTC network
    Cao X.
    Zou H.
    Cheng F.
    Li R.
    He S.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (12): : 3439 - 3451
  • [22] WAVELET-DECOUPLING CONTRASTIVE ENHANCEMENT NETWORK FOR FINE-GRAINED SKELETON-BASED ACTION RECOGNITION
    Chang, Haochen
    Chen, Jing
    Li, Yilin
    Chen, Jixiang
    Zhang, Xiaofeng
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4060 - 4064
  • [23] A Fine-Grained Vehicle Behavior Recognition Framework: Struct Segment Temporal Convolutional Network
    Yan, Guozhi
    Liu, Kai
    Hu, Junbo
    Jin, Feiyu
    Zhang, Hao
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3404 - 3410
  • [24] Fine-grained lung nodule segmentation with pyramid deconvolutional neural network
    Zhao, Xinzhuo
    Sun, Wenqing
    Qian, Wei
    Qi, Shouliang
    Sun, Jianjun
    Zhang, Bo
    Yang, Zhigang
    MEDICAL IMAGING 2019: COMPUTER-AIDED DIAGNOSIS, 2019, 10950
  • [25] Context Sensitive Network for weakly-supervised fine-grained temporal action localization
    Dong, Cerui
    Liu, Qinying
    Wang, Zilei
    Zhang, Yixin
    Zhao, Feng
    NEURAL NETWORKS, 2025, 185
  • [26] Faster-slow network fused with enhanced fine-grained features for action recognition
    Wu, Xuegang
    Zhu, Jiawei
    Yang, Liu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 105
  • [27] Hierarchical gate network for fine-grained visual recognition
    Chen, Ying
    Song, Jie
    Song, Mingli
    NEUROCOMPUTING, 2022, 470 : 170 - 181
  • [28] Fine-Grained Skeleton-Based Human Action Recognition for Figure Skating
    Wei, Zhihong
    Qin, Jianshu
    Lie, Bangmao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2686 - 2691
  • [29] Fine-grained Human Action Recognition Based on Zero-Shot Learning
    Zhao, Yahui
    Shi, Ping
    You, Jian
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 294 - 297
  • [30] Local Temporal Bilinear Pooling for Fine-Grained Action Parsing
    Zhang, Yan
    Tang, Siyu
    Muandet, Krikamol
    Jarvers, Christian
    Neumann, Heiko
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11997 - 12007