EMPC: Efficient multi-view parallel co-learning for semi-supervised action recognition

被引:1
|
作者
Tong, Anyang [1 ,2 ]
Tang, Chao [1 ,2 ]
Wang, Wenjian [3 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Anhui, Peoples R China
[3] Shanxi Univ, Sch Comp & Informat Sci, Taiyuan 030006, Shanxi, Peoples R China
关键词
Action recognition; Semi-supervised learning; Temporal gradient; Co-learning; Dropout;
D O I
10.1016/j.eswa.2024.124634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) is an effective approach to address the challenge of limited labeled data in action recognition. Existing methods have explored temporal augmentation and consistent learning, which have received widespread attention. However, these methods come with an exponential increase in computational effort, and neglect the potential for divergence and collaboration between modalities. Additionally, the models may exhibit randomness during pseudo-label evaluation and inconsistency between training and inference. To address these challenges, we propose an efficient multi-view parallel co-learning (EMPC) method for semisupervised action recognition. First, we explore the temporal gradient (TG) and create a new view that contains rich motion history information, called the historical temporal gradient (HTG). Second, inspired by the working mechanism of Dropout, we assemble a low-computational multi-functional committee (MFC) and perform pseudo-label editing based on two evaluation criteria: confidence and consistency. We further design a new regularization strategy based on MFC, called mean regularized dropout (MR-Drop), which measures and reduces the output distribution's uncertainty between sub-models to improve the model's performance. Finally, based on the complementary information between RGB and HTG views, we build an efficient parallel network with multi-view feature sharing and pseudo-label collaboration. We evaluate EMPC on three public datasets: UCF-101, HMDB-51, and Kinetics-100. The experimental results demonstrate that EMPC achieves better classification performance with a limited amount of labeled data and a large amount of unlabeled data.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Semi-supervised Multi-view Sentiment Analysis
    Lazarova, Gergana
    Koychev, Ivan
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 181 - 190
  • [42] Semi-supervised multi-view concept decomposition
    Jiang, Qi
    Zhou, Guoxu
    Zhao, Qibin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [43] Semi-supervised Deep Multi-view Stereo
    Xu, Hongbin
    Chen, Weitao
    Liu, Yang
    Zhou, Zhipeng
    Xiao, Haihong
    Sun, Baigui
    Xie, Xuansong
    Kang, Wenxiong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4616 - 4625
  • [44] Latent Multi-view Semi-Supervised Classification
    Bo, Xiaofan
    Kang, Zhao
    Zhao, Zhitong
    Su, Yuanzhang
    Chen, Wenyu
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 348 - 362
  • [45] Safe Multi-view Co-training for Semi-supervised Regression
    Liu, Li Yan
    Huang, Peng
    Min, Fan
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 56 - 65
  • [46] MMatch: Semi-Supervised Discriminative Representation Learning for Multi-View Classification
    Wang, Xiaoli
    Fu, Liyong
    Zhang, Yudong
    Wang, Yongli
    Li, Zechao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6425 - 6436
  • [47] Semi-supervised Unified Latent Factor learning with multi-view data
    Jiang, Yu
    Liu, Jing
    Li, Zechao
    Lu, Hanqing
    MACHINE VISION AND APPLICATIONS, 2014, 25 (07) : 1635 - 1645
  • [48] Semi-supervised Multi-view Manifold Discriminant Intact Space Learning
    Han, Lu
    Wu, Fei
    Jing, Xiao-Yuan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (09): : 4317 - 4335
  • [49] Semi-Supervised Learning and Feature Fusion for Multi-view Data Clustering
    Salman, Hadi
    Zhan, Justin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 645 - 650
  • [50] Semi-supervised Unified Latent Factor learning with multi-view data
    Yu Jiang
    Jing Liu
    Zechao Li
    Hanqing Lu
    Machine Vision and Applications, 2014, 25 : 1635 - 1645