Multidimensional Prototype Refactor Enhanced Network for Few-Shot Action Recognition

被引:12
|
作者
Liu, Shuwen [1 ]
Jiang, Min [1 ]
Kong, Jun [2 ]
机构
[1] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Jiangsu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Prototypes; Training; Feature extraction; Optimization; Image recognition; Face recognition; Visualization; Few-shot action recognition; prototype enhancement; similarity optimization; temporal modeling;
D O I
10.1109/TCSVT.2022.3175923
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot action recognition classifies new actions with only few training samples, of which the mainstream methods adopt class means to obtain prototypes as the representations of each category. However, affected by sample capacity and extreme samples, mean-of-class prototypes can't well represent the average level of samples. In this paper, we enhance the prototypes from multiple dimensions for better classification. We firstly propose a novel similarity optimization mechanism where Prototype Aggregation Adaptive Loss (PAAL) is designed to deeply mine the similarity between samples and prototypes for enhancing the ability of inter-class differential detail identification. Secondly, for mitigating the impact of the samples on class prototypes, we refactor the prototype calculation formula with Cross-Enhanced Prototype (CEP) to narrow intra-class differences in which Reweighted Similarity Attention (RSA) is designed to update prototypes. Finally, Dynamic Temporal Transformation (DTT) is proposed to alleviate inconsistent distribution of temporal information for obtaining better video-level descriptors. Extensive experiments on standard benchmark datasets demonstrate that our proposed method achieves the state-of-the-art results.
引用
收藏
页码:6955 / 6966
页数:12
相关论文
共 50 条
  • [1] Multimodal Prototype-Enhanced Network for Few-Shot Action Recognition
    Ni, Xinzhe
    Liu, Yong
    Wen, Hao
    Ji, Yatai
    Xiao, Jing
    Yang, Yujiu
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1 - 10
  • [2] Compound Prototype Matching for Few-Shot Action Recognition
    Huang, Yifei
    Yang, Lijin
    Sato, Yoichi
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 351 - 368
  • [3] Reconstructed Prototype Network Combined with CDC-TAGCN for Few-Shot Action Recognition
    Wu, Aihua
    Ding, Songyu
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [4] CLIP-guided Prototype Modulating for Few-shot Action Recognition
    Wang, Xiang
    Zhang, Shiwei
    Cen, Jun
    Gao, Changxin
    Zhang, Yingya
    Zhao, Deli
    Sang, Nong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 1899 - 1912
  • [5] Hybrid attentive prototypical network for few-shot action recognition
    Ruan, Zanxi
    Wei, Yingmei
    Guo, Yanming
    Xie, Yuxiang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 8249 - 8272
  • [6] Intermediate prototype network for few-shot segmentation
    Luo, Xiaoliu
    Duan, Zhao
    Zhang, Taiping
    SIGNAL PROCESSING, 2023, 203
  • [7] Contrastive prototype network with prototype augmentation for few-shot classification
    Jiang, Mengjuan
    Fan, Jiaqing
    He, Jiangzhen
    Du, Weidong
    Wang, Yansong
    Li, Fanzhang
    INFORMATION SCIENCES, 2025, 686
  • [8] Pointer-prototype fusion network for few-shot named entity recognition
    Zhao Haiying
    Guo Xuan
    The Journal of China Universities of Posts and Telecommunications, 2023, 30 (05) : 32 - 41
  • [9] Pointer-prototype fusion network for few-shot named entity recognition
    Haiying, Zhao
    Xuan, Guo
    Journal of China Universities of Posts and Telecommunications, 2023, 30 (05): : 32 - 41
  • [10] Gaussian Prototype Rectification For Few-shot Image Recognition
    Lin, Jinfu
    Shen, Junmin
    He, Xiaojian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,