Multidimensional Prototype Refactor Enhanced Network for Few-Shot Action Recognition

被引:12
|
作者
Liu, Shuwen [1 ]
Jiang, Min [1 ]
Kong, Jun [2 ]
机构
[1] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Jiangsu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Prototypes; Training; Feature extraction; Optimization; Image recognition; Face recognition; Visualization; Few-shot action recognition; prototype enhancement; similarity optimization; temporal modeling;
D O I
10.1109/TCSVT.2022.3175923
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot action recognition classifies new actions with only few training samples, of which the mainstream methods adopt class means to obtain prototypes as the representations of each category. However, affected by sample capacity and extreme samples, mean-of-class prototypes can't well represent the average level of samples. In this paper, we enhance the prototypes from multiple dimensions for better classification. We firstly propose a novel similarity optimization mechanism where Prototype Aggregation Adaptive Loss (PAAL) is designed to deeply mine the similarity between samples and prototypes for enhancing the ability of inter-class differential detail identification. Secondly, for mitigating the impact of the samples on class prototypes, we refactor the prototype calculation formula with Cross-Enhanced Prototype (CEP) to narrow intra-class differences in which Reweighted Similarity Attention (RSA) is designed to update prototypes. Finally, Dynamic Temporal Transformation (DTT) is proposed to alleviate inconsistent distribution of temporal information for obtaining better video-level descriptors. Extensive experiments on standard benchmark datasets demonstrate that our proposed method achieves the state-of-the-art results.
引用
收藏
页码:6955 / 6966
页数:12
相关论文
共 50 条
  • [31] Matching Compound Prototypes for Few-Shot Action Recognition
    Huang, Yifei
    Yang, Lijin
    Chen, Guo
    Zhang, Hongjie
    Lu, Feng
    Sato, Yoichi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3977 - 4002
  • [32] Label-Description Enhanced Network for Few-Shot Named Entity Recognition
    Zhang, Xinyue
    Gao, Hui
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 444 - 455
  • [33] Hierarchical compositional representations for few-shot action recognition
    Li, Changzhen
    Zhang, Jie
    Wu, Shuzhe
    Jin, Xin
    Shan, Shiguang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [34] Advances in Few-Shot Action Recognition: A Comprehensive Review
    Ruan, Zanxi
    Wei, Yingmei
    Yuan, Yifei
    Li, Yu
    Guo, Yanming
    Xie, Yuxiang
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 390 - 398
  • [35] A Generative Approach to Zero-Shot and Few-Shot Action Recognition
    Mishra, Ashish
    Verma, Vinay Kumar
    Reddy, M. Shiva Krishna
    Arulkumar, S.
    Rai, Piyush
    Mittal, Anurag
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 372 - 380
  • [36] Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation
    Liu, Jie
    Bao, Yanqi
    Xie, Guo-Sen
    Xiong, Huan
    Sonke, Jan-Jakob
    Gavves, Efstratios
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11543 - 11552
  • [37] Cycle association prototype network for few-shot semantic segmentation
    Hao, Zhuangzhuang
    Shao, Ji
    Gong, Bo
    Yang, Jingwen
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [38] Mutual Learning Prototype Network for Few-Shot Text Classification
    Liu, Jun
    Qin, Xiaorui
    Tao, Jian
    Dong, Hongfei
    Li, Xiaoxu
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (03): : 30 - 35
  • [39] Two-Stream Prototype Learning Network for Few-Shot Face Recognition Under Occlusions
    Yang, Xingyu
    Han, Mengya
    Luo, Yong
    Hu, Han
    Wen, Yonggang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1555 - 1563
  • [40] Prototype Completion for Few-Shot Learning
    Zhang, Baoquan
    Li, Xutao
    Ye, Yunming
    Feng, Shanshan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12250 - 12268