EMPC: Efficient multi-view parallel co-learning for semi-supervised action recognition

被引:1
|
作者
Tong, Anyang [1 ,2 ]
Tang, Chao [1 ,2 ]
Wang, Wenjian [3 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Anhui, Peoples R China
[3] Shanxi Univ, Sch Comp & Informat Sci, Taiyuan 030006, Shanxi, Peoples R China
关键词
Action recognition; Semi-supervised learning; Temporal gradient; Co-learning; Dropout;
D O I
10.1016/j.eswa.2024.124634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) is an effective approach to address the challenge of limited labeled data in action recognition. Existing methods have explored temporal augmentation and consistent learning, which have received widespread attention. However, these methods come with an exponential increase in computational effort, and neglect the potential for divergence and collaboration between modalities. Additionally, the models may exhibit randomness during pseudo-label evaluation and inconsistency between training and inference. To address these challenges, we propose an efficient multi-view parallel co-learning (EMPC) method for semisupervised action recognition. First, we explore the temporal gradient (TG) and create a new view that contains rich motion history information, called the historical temporal gradient (HTG). Second, inspired by the working mechanism of Dropout, we assemble a low-computational multi-functional committee (MFC) and perform pseudo-label editing based on two evaluation criteria: confidence and consistency. We further design a new regularization strategy based on MFC, called mean regularized dropout (MR-Drop), which measures and reduces the output distribution's uncertainty between sub-models to improve the model's performance. Finally, based on the complementary information between RGB and HTG views, we build an efficient parallel network with multi-view feature sharing and pseudo-label collaboration. We evaluate EMPC on three public datasets: UCF-101, HMDB-51, and Kinetics-100. The experimental results demonstrate that EMPC achieves better classification performance with a limited amount of labeled data and a large amount of unlabeled data.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Research on multi-view semi-supervised learning algorithm based on Co-learning
    Wang, Xing-Qi
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1276 - 1280
  • [2] Human Action Recognition Based on Multi-view Semi-supervised Learning
    Tang C.
    Wang W.
    Wang X.
    Zhang C.
    Zou L.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (04): : 376 - 384
  • [3] Regularized extreme learning machine for multi-view semi-supervised action recognition
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    NEUROCOMPUTING, 2014, 145 : 250 - 262
  • [4] Co-GCN for Multi-View Semi-Supervised Learning
    Li, Shu
    Li, Wen-Tao
    Wang, Wei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4691 - 4698
  • [5] View Construction for Multi-view Semi-supervised Learning
    Sun, Shiliang
    Jin, Feng
    Tu, Wenting
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT I, 2011, 6675 : 595 - 601
  • [6] Multi-view classification with semi-supervised learning for SAR target recognition
    Zhang, Yukun
    Guo, Xiansheng
    Ren, Haohao
    Li, Lin
    SIGNAL PROCESSING, 2021, 183
  • [7] Efficient multi-view semi-supervised feature selection
    Zhang, Chenglong
    Jiang, Bingbing
    Wang, Zidong
    Yang, Jie
    Lu, Yangfeng
    Wu, Xingyu
    Sheng, Weiguo
    INFORMATION SCIENCES, 2023, 649
  • [8] Multi-view Learning for Semi-supervised Sentiment Classification
    Su, Yan
    Li, Shoushan
    Ju, Shengfeng
    Zhou, Guodong
    Li, Xiaojun
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 13 - 16
  • [9] Multi-view semi-supervised learning for image classification
    Zhu, Songhao
    Sun, Xian
    Jin, Dongliang
    NEUROCOMPUTING, 2016, 208 : 136 - 142
  • [10] A Multi-view Regularization Method for Semi-supervised Learning
    Wang, Jiao
    Luo, Siwei
    Li, Yan
    ADVANCES IN NEURAL NETWORKS - ISNN 2010, PT 1, PROCEEDINGS, 2010, 6063 : 444 - 449