Hybrid attentive prototypical network for few-shot action recognition

被引:1
|
作者
Ruan, Zanxi [1 ]
Wei, Yingmei [1 ]
Guo, Yanming [1 ]
Xie, Yuxiang [1 ]
机构
[1] Natl Univ Def Technol, Lab Big Data & Decis, Changsha, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot action recognition; Few-shot learning; Video understanding; Metric learning;
D O I
10.1007/s40747-024-01571-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most previous few-shot action recognition works tend to process video temporal and spatial features separately, resulting in insufficient extraction of comprehensive features. In this paper, a novel hybrid attentive prototypical network (HAPN) framework for few-shot action recognition is proposed. Distinguished by its joint processing of temporal and spatial information, the HAPN framework strategically manipulates these dimensions from feature extraction to the attention module, consequently enhancing its ability to perform action recognition tasks. Our framework utilizes the R(2+1)D backbone network, coupling the extraction of integrated temporal and spatial features to ensure a comprehensive understanding of video content. Additionally, our framework introduces the novel Residual Tri-dimensional Attention (ResTriDA) mechanism, specifically designed to augment feature information across the temporal, spatial, and channel dimensions. ResTriDA dynamically enhances crucial aspects of video features by amplifying significant channel-wise features for action distinction, accentuating spatial details vital for capturing the essence of actions within frames, and emphasizing temporal dynamics to capture movement over time. We further propose a prototypical attentive matching module (PAM) built on the concept of metric learning to resolve the overfitting issue common in few-shot tasks. We evaluate our HAPN framework on three classical few-shot action recognition datasets: Kinetics-100, UCF101, and HMDB51. The results indicate that our framework significantly outperformed state-of-the-art methods. Notably, the 1-shot task, demonstrated an increase of 9.8% in accuracy on UCF101 and improvements of 3.9% on HMDB51 and 12.4% on Kinetics-100. These gains confirm the robustness and effectiveness of our approach in leveraging limited data for precise action recognition.
引用
收藏
页码:8249 / 8272
页数:24
相关论文
共 50 条
  • [21] Few-shot classification with prototypical neural network for hospital flow recognition under uncertainty
    Chang, Mike C.
    Alaeddini, Adel
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [22] Hybrid Relation Guided Set Matching for Few-shot Action Recognition
    Wang, Xiang
    Zhang, Shiwei
    Qing, Zhiwu
    Tang, Mingqian
    Zuo, Zhengrong
    Gao, Changxin
    Jin, Rong
    Sang, Nong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19916 - 19925
  • [23] A Pairwise Attentive Adversarial Spatiotemporal Network for Cross-Domain Few-Shot Action Recognition-R2
    Gao, Zan
    Guo, Leming
    Guan, Weili
    Liu, Anan
    Ren, Tongwei
    Chen, Shengyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 767 - 782
  • [24] Dual Prototypical Network for Robust Few-shot Image Classification
    Song, Qi
    Peng, Zebin
    Ji, Luchen
    Yang, Xiaochen
    Li, Xiaoxu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 533 - 537
  • [25] Bidirectional Matching Prototypical Network for Few-Shot Image Classification
    Fu, Wen
    Zhou, Li
    Chen, Jie
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 982 - 986
  • [26] Adaptive adversarial prototyping network for few-shot prototypical translation*
    Phaphuangwittayakul, Aniwat
    Ying, Fangli
    Guo, Yi
    Santisookrat, Surachai
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 94
  • [27] Global Prototypical Network for Few-Shot Hyperspectral Image Classification
    Zhang, Chengye
    Yue, Jun
    Qin, Qiming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 (13) : 4748 - 4759
  • [28] Revisiting Prototypical Network for Cross Domain Few-Shot Learning
    Zhou, Fei
    Wang, Peng
    Zhang, Lei
    Wei, Wei
    Zhang, Yanning
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20061 - 20070
  • [29] Semantic Transportation Prototypical Network for Few-shot Intent Detection
    Xu, Weiyuan
    Zhou, Peilin
    You, Chenyu
    Zou, Yuexian
    INTERSPEECH 2021, 2021, : 251 - 255
  • [30] Reweighted Regularized Prototypical Network for Few-Shot Fault Diagnosis
    Li, Kang
    Shang, Chao
    Ye, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 6206 - 6217