Multidimensional Prototype Refactor Enhanced Network for Few-Shot Action Recognition

被引:12
|
作者
Liu, Shuwen [1 ]
Jiang, Min [1 ]
Kong, Jun [2 ]
机构
[1] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Jiangsu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Prototypes; Training; Feature extraction; Optimization; Image recognition; Face recognition; Visualization; Few-shot action recognition; prototype enhancement; similarity optimization; temporal modeling;
D O I
10.1109/TCSVT.2022.3175923
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot action recognition classifies new actions with only few training samples, of which the mainstream methods adopt class means to obtain prototypes as the representations of each category. However, affected by sample capacity and extreme samples, mean-of-class prototypes can't well represent the average level of samples. In this paper, we enhance the prototypes from multiple dimensions for better classification. We firstly propose a novel similarity optimization mechanism where Prototype Aggregation Adaptive Loss (PAAL) is designed to deeply mine the similarity between samples and prototypes for enhancing the ability of inter-class differential detail identification. Secondly, for mitigating the impact of the samples on class prototypes, we refactor the prototype calculation formula with Cross-Enhanced Prototype (CEP) to narrow intra-class differences in which Reweighted Similarity Attention (RSA) is designed to update prototypes. Finally, Dynamic Temporal Transformation (DTT) is proposed to alleviate inconsistent distribution of temporal information for obtaining better video-level descriptors. Extensive experiments on standard benchmark datasets demonstrate that our proposed method achieves the state-of-the-art results.
引用
收藏
页码:6955 / 6966
页数:12
相关论文
共 50 条
  • [41] VDARN: Video Disentangling Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
    Su, Yong
    Xing, Meng
    An, Simin
    Peng, Weilong
    Feng, Zhiyong
    AD HOC NETWORKS, 2021, 113
  • [42] Prototype Reinforcement for Few-Shot Learning
    Xu, Liheng
    Xie, Qian
    Jiang, Baoqing
    Zhang, Jiashuo
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4912 - 4916
  • [43] Knowledge Graph Transfer Network for Few-Shot Recognition
    Chen, Riquan
    Chen, Tianshui
    Hui, Xiaolu
    Wu, Hefeng
    Li, Guanbin
    Lin, Liang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10575 - 10582
  • [44] Overall positive prototype for few-shot open-set recognition
    Sun, Liang-Yu
    Chu, Wei-Ta
    PATTERN RECOGNITION, 2024, 151
  • [45] Task-Aware Dual-Representation Network for Few-Shot Action Recognition
    Wang, Xiao
    Ye, Weirong
    Qi, Zhongang
    Wang, Guangge
    Wu, Jianping
    Shan, Ying
    Qie, Xiaohu
    Wang, Hanzi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5932 - 5946
  • [46] Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition
    Wu, Jiamin
    Zhang, Tianzhu
    Zhang, Zhe
    Wu, Feng
    Zhang, Yongdong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9141 - 9150
  • [47] Parallel Attention Interaction Network for Few-Shot Skeleton-based Action Recognition
    Liu, Xingyu
    Zhou, Sanping
    Wang, Le
    Hua, Gang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1379 - 1388
  • [48] Enhanced prototypical network for few-shot relation extraction
    Wen, Wen
    Liu, Yongbin
    Ouyang, Chunping
    Lin, Qiang
    Chung, Tonglee
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [49] Active Exploration of Multimodal Complementarity for Few-Shot Action Recognition
    Wanyan, Yuyang
    Yang, Xiaoshan
    Chen, Chaofan
    Xu, Changsheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6492 - 6502
  • [50] VISUAL TEMPO CONTRASTIVE LEARNING FOR FEW-SHOT ACTION RECOGNITION
    Wang, Guangge
    Ye, Weirong
    Wang, Xiao
    Jin, Rongrong
    Wang, Hanzi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1096 - 1100