The Staged Knowledge Distillation in Video Classification: Harmonizing Student Progress by a Complementary Weakly Supervised Framework

被引:2
|
作者
Wang, Chao [1 ]
Tang, Zheng [2 ]
机构
[1] China Acad Railway Sci, Beijing 100081, Peoples R China
[2] NVIDIA, Redmond, WA 98052 USA
关键词
Training; Uncertainty; Correlation; Generators; Data models; Task analysis; Computational modeling; Knowledge distillation; weakly supervised learning; teacher-student architecture; substage learning process; video classification; label-efficient learning;
D O I
10.1109/TCSVT.2023.3294977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the context of label-efficient learning on video data, the distillation method and the structural design of the teacher-student architecture have a significant impact on knowledge distillation. However, the relationship between these factors has been overlooked in previous research. To address this gap, we propose a new weakly supervised learning framework for knowledge distillation in video classification that is designed to improve the efficiency and accuracy of the student model. Our approach leverages the concept of substage-based learning to distill knowledge based on the combination of student substages and the correlation of corresponding substages. We also employ the progressive cascade training method to address the accuracy loss caused by the large capacity gap between the teacher and the student. Additionally, we propose a pseudo-label optimization strategy to improve the initial data label. To optimize the loss functions of different distillation substages during the training process, we introduce a new loss method based on feature distribution. We conduct extensive experiments on both real and simulated data sets, demonstrating that our proposed approach outperforms existing distillation methods in terms of knowledge distillation for video classification tasks. Our proposed substage-based distillation approach has the potential to inform future research on label-efficient learning for video data.
引用
收藏
页码:6646 / 6660
页数:15
相关论文
共 50 条
  • [41] Local-Global Multi-Modal Distillation for Weakly-Supervised Temporal Video Grounding
    Bao, Peijun
    Xia, Yong
    Yang, Wenhan
    Ng, Boon Poh
    Er, Meng Hwa
    Kot, Alex C.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 738 - 746
  • [42] SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification
    Li, Zuoyong
    Lin, Qinghua
    Fan, Haoyi
    Zhao, Tiesong
    Zhang, David
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2603 - 2615
  • [43] A framework-based transformer and knowledge distillation for interior style classification
    Vo, Anh H.
    Nguyen, Bao T.
    NEUROCOMPUTING, 2024, 565
  • [44] Heterogeneous Student Knowledge Distillation From BERT Using a Lightweight Ensemble Framework
    Lin, Ching-Sheng
    Tsai, Chung-Nan
    Jwo, Jung-Sing
    Lee, Cheng-Hsiung
    Wang, Xin
    IEEE ACCESS, 2024, 12 : 33079 - 33088
  • [45] Knowledge distillation-driven semi-supervised multi-view classification
    Wang, Xiaoli
    Wang, Yongli
    Ke, Guanzhou
    Wang, Yupeng
    Hong, Xiaobin
    INFORMATION FUSION, 2024, 103
  • [46] Collaborative deep semi-supervised learning with knowledge distillation for surface defect classification
    Manivannan, Siyamalan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 186
  • [47] Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification from the Bottom Up
    Ge, Weifeng
    Lin, Xiangru
    Yu, Yizhou
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3029 - 3038
  • [48] Knowledge evolution learning: A cost-free weakly supervised semantic segmentation framework for high-resolution land cover classification
    Cui, Hao
    Zhang, Guo
    Chen, Yujia
    Li, Xue
    Hou, Shasha
    Li, Haifeng
    Ma, Xiaolong
    Guan, Na
    Tang, Xuemin
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 207 : 74 - 91
  • [49] A Weakly-Supervised Cross-Domain Query Framework for Video Camouflage Object Detection
    Lu, Zelin
    Xie, Liang
    Zhao, Xing
    Xu, Binwei
    Liang, Haoran
    Liang, Ronghua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1506 - 1518
  • [50] A Self-Paced Multiple Instance Learning Framework for Weakly Supervised Video Anomaly Detection
    He, Ping
    Li, Huibin
    Han, Miaolin
    APPLIED SCIENCES-BASEL, 2025, 15 (03):