Multidimensional Prototype Refactor Enhanced Network for Few-Shot Action Recognition

被引:12
|
作者
Liu, Shuwen [1 ]
Jiang, Min [1 ]
Kong, Jun [2 ]
机构
[1] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Jiangsu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Prototypes; Training; Feature extraction; Optimization; Image recognition; Face recognition; Visualization; Few-shot action recognition; prototype enhancement; similarity optimization; temporal modeling;
D O I
10.1109/TCSVT.2022.3175923
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Few-shot action recognition classifies new actions with only few training samples, of which the mainstream methods adopt class means to obtain prototypes as the representations of each category. However, affected by sample capacity and extreme samples, mean-of-class prototypes can't well represent the average level of samples. In this paper, we enhance the prototypes from multiple dimensions for better classification. We firstly propose a novel similarity optimization mechanism where Prototype Aggregation Adaptive Loss (PAAL) is designed to deeply mine the similarity between samples and prototypes for enhancing the ability of inter-class differential detail identification. Secondly, for mitigating the impact of the samples on class prototypes, we refactor the prototype calculation formula with Cross-Enhanced Prototype (CEP) to narrow intra-class differences in which Reweighted Similarity Attention (RSA) is designed to update prototypes. Finally, Dynamic Temporal Transformation (DTT) is proposed to alleviate inconsistent distribution of temporal information for obtaining better video-level descriptors. Extensive experiments on standard benchmark datasets demonstrate that our proposed method achieves the state-of-the-art results.
引用
收藏
页码:6955 / 6966
页数:12
相关论文
共 50 条
  • [21] Semantic-Guided Relation Propagation Network for Few-shot Action Recognition
    Wang, Xiao
    Ye, Weirong
    Qi, Zhongang
    Zhao, Xun
    Wang, Guangge
    Shan, Ying
    Wang, Hanzi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 816 - 825
  • [22] Spatiotemporal Orthogonal Projection Capsule Network for Incremental Few-Shot Action Recognition
    Feng, Yangbo
    Gao, Junyu
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9825 - 9838
  • [23] Learning Similarity: Feature-Aligning Network for Few-shot Action Recognition
    Tan, Shaoqing
    Yang, Ruoyu
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [24] Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition
    Wang, Xiao
    Yan, Yan
    Hu, Hai-Miao
    Li, Bo
    Wang, Hanzi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1257 - 1271
  • [25] Multi-level semantic-assisted prototype learning for Few-Shot Action Recognition
    Liu, Dan
    Xia, Qing
    Meng, Fanrong
    Ye, Mao
    Zhang, Jianwei
    NEUROCOMPUTING, 2025, 636
  • [26] On the Importance of Spatial Relations for Few-shot Action Recognition
    Zhang, Yilun
    Fu, Yuqian
    Ma, Xingjun
    Qi, Lizhe
    Chen, Jingjing
    Wu, Zuxuan
    Jiang, Yu-Gang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2243 - 2251
  • [27] An Enhanced Prototypical Network Architecture for Few-Shot Handwritten Urdu Character Recognition
    Sahay, Rajat
    Coustaty, Mickael
    IEEE ACCESS, 2023, 11 : 33682 - 33696
  • [28] Task Adaptive Modeling for Few-shot Action Recognition
    Wang, Jiayi
    Jin, Yi
    Feng, Songhe
    Li, Yidong
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [29] Anomalous Action Recognition Research for Few-shot Learning
    Qi, Yufei
    Liu, Ting
    Fu, Yuzhuo
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1306 - 1310
  • [30] Elastic temporal alignment for few-shot action recognition
    Pan, Fei
    Xu, Chunlei
    Zhang, Hongjie
    Guo, Jie
    Guo, Yanwen
    IET COMPUTER VISION, 2023, 17 (01) : 39 - 50