UPL-Net: Uncertainty-aware prompt learning network for semi-supervised action recognition

被引:0
|
作者
Yang, Shu [1 ]
Li, Ya-Li [1 ]
Wang, Shengjin [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
Semi-supervised learning; Prompt learning; Vision-language pre-training; Action recognition; Uncertainty estimation;
D O I
10.1016/j.neucom.2024.129126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on understanding human behavior in videos by reframing the traditional video classification task as a transfer learning problem centered on visual concepts. Unlike existing action recognition approaches that rely solely on single-modal representations and video classifiers, our method leverages an uncertainty- aware prompt learning network (UPL-Net). This network is designed to extract spatiotemporal features that are pertinent to action-related concepts in videos while ensuring that the visual concepts derived from images are preserved. Furthermore, we introduce an uncertainty-guided semi-supervised learning strategy that harnesses unlabeled videos to enhance the model's generalizability. Extensive experiments conducted on benchmark datasets, namely UCF and HMDB, demonstrate the superiority of our approach over state-of-the-art semi- supervised action recognition methods. Notably, under a 1% labeling rate on the UCF dataset, our method achieves a significant improvement of 12.8%, underscoring its effectiveness in leveraging limited labeled data and abundant unlabeled videos for improved performance.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Multi-head co-training: An uncertainty-aware and robust semi-supervised learning framework
    Chen, Mingcai
    Wang, Chongjun
    KNOWLEDGE-BASED SYSTEMS, 2024, 302
  • [22] Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition
    Patel, Gaurav
    Allebach, Jan
    Qiu, Qiang
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6169 - 6179
  • [23] Uncertainty Aware Graph Gaussian Process for Semi-Supervised Learning
    Liu, Zhao-Yang
    Li, Shao-Yuan
    Chen, Songcan
    Hu, Yao
    Huang, Sheng-Jun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4957 - 4964
  • [24] UNCERTAINTY-AWARE SEMI-SUPERVISED FRAMEWORK FOR AUTOMATIC SEGMENTATION OF MACULAR EDEMA IN OCT IMAGES
    Liu, Xiaoming
    Wang, Shaocheng
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1453 - 1456
  • [25] Uncertainty-aware deep co-training for semi-supervised medical image segmentation
    Zheng, Xu
    Fu, Chong
    Xie, Haoyu
    Chen, Jialei
    Wang, Xingwei
    Sham, Chiu-Wing
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 149
  • [26] Uncertainty-aware pseudo-label and consistency for semi-supervised medical image segmentation
    Lu, Liyun
    Yin, Mengxiao
    Fu, Liyao
    Yang, Feng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [27] Uncertainty-Aware Self-Training for Semi-Supervised Event Temporal Relation Extraction
    Cao, Pengfei
    Zuo, Xinyu
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    Bi, Wei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2900 - 2904
  • [28] Evaluation of semi-supervised learning method on action recognition
    Shen, Haoquan
    Yan, Yan
    Xu, Shicheng
    Ballas, Nicolas
    Chen, Wenzhi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 523 - 542
  • [29] 3D Semi-Supervised Learning with Uncertainty-Aware Multi-View Co-Training
    Xia, Yingda
    Liu, Fengze
    Yang, Dong
    Cai, Jinzheng
    Yu, Lequan
    Zhu, Zhuotun
    Xu, Daguang
    Yuille, Alan
    Roth, Holger
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3635 - 3644
  • [30] Evaluation of semi-supervised learning method on action recognition
    Haoquan Shen
    Yan Yan
    Shicheng Xu
    Nicolas Ballas
    Wenzhi Chen
    Multimedia Tools and Applications, 2015, 74 : 523 - 542