Spatio-Temporal Self-supervision for Few-Shot Action Recognition

被引:0
|
作者
Yu, Wanchuan [1 ]
Guo, Hanyu [1 ]
Yan, Yan [1 ]
Li, Jie [2 ]
Wang, Hanzi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Video & Image Proc Syst Lab, Xian, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I | 2024年 / 14425卷
基金
中国国家自然科学基金;
关键词
Few-shot learning; Action recognition; Self-supervised learning;
D O I
10.1007/978-981-99-8429-9_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot action recognition aims to classify unseen action classes with limited labeled training samples. Most current works follow the metric learning technology to learn a good embedding and an appropriate comparison metric. Due to the limited labeled data, the generalization of embedding networks is limited when employing the meta-learning process with episodic tasks. In this paper, we aim to repurpose self-supervised learning to learn a more generalized few-shot embedding model. Specifically, a Spatio-Temporal Self-supervision (STS) framework for few-shot action recognition is proposed to generate self-supervision loss at the spatial and temporal levels as auxiliary losses. By this means, the proposed STS can provide a robust representation for few-shot action recognition. Furthermore, we propose a Spatio-Temporal Aggregation (STA) module that accounts for the spatial information relationship among all frames within a video sequence to achieve optimal video embedding. Experiments on several challenging few-shot action recognition benchmarks show the effectiveness of the proposed method in achieving state-of-the-art performance for few-shot action recognition.
引用
收藏
页码:84 / 96
页数:13
相关论文
共 50 条
  • [41] Task Adaptive Modeling for Few-shot Action Recognition
    Wang, Jiayi
    Jin, Yi
    Feng, Songhe
    Li, Yidong
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [42] Matching Compound Prototypes for Few-Shot Action Recognition
    Huang, Yifei
    Yang, Lijin
    Chen, Guo
    Zhang, Hongjie
    Lu, Feng
    Sato, Yoichi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3977 - 4002
  • [43] Anomalous Action Recognition Research for Few-shot Learning
    Qi, Yufei
    Liu, Ting
    Fu, Yuzhuo
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1306 - 1310
  • [44] EdgeFont: Enhancing style and content representations in few-shot font generation with multi-scale edge self-supervision
    Wang, Yefei
    Xiong, Kangyue
    Yuan, Yiyang
    Zeng, Jinshan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
  • [45] Enhancing Few-Shot Action Recognition Using Skeleton Temporal Alignment and Adversarial Training
    Xu, Qingyang
    Yang, Jianjun
    Zhang, Hongyi
    Jie, Xin
    Bandara, Danushka
    IEEE ACCESS, 2024, 12 : 31745 - 31755
  • [46] Hierarchical Task-aware Temporal Modeling and Matching for few-shot action recognition
    Zhan, Yucheng
    Pan, Yijun
    Wu, Siying
    Zhang, Yueyi
    Sun, Xiaoyan
    NEUROCOMPUTING, 2025, 624
  • [47] Hierarchical compositional representations for few-shot action recognition
    Li, Changzhen
    Zhang, Jie
    Wu, Shuzhe
    Jin, Xin
    Shan, Shiguang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [48] Advances in Few-Shot Action Recognition: A Comprehensive Review
    Ruan, Zanxi
    Wei, Yingmei
    Yuan, Yifei
    Li, Yu
    Guo, Yanming
    Xie, Yuxiang
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 390 - 398
  • [49] Motion-modulated Temporal Fragment Alignment Network For Few-Shot Action Recognition
    Wu, Jiamin
    Zhang, Tianzhu
    Zhang, Zhe
    Wu, Feng
    Zhang, Yongdong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9141 - 9150
  • [50] A Generative Approach to Zero-Shot and Few-Shot Action Recognition
    Mishra, Ashish
    Verma, Vinay Kumar
    Reddy, M. Shiva Krishna
    Arulkumar, S.
    Rai, Piyush
    Mittal, Anurag
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 372 - 380