Enhanced Residual Networks via Mixed Knowledge Fraction

被引:0
|
作者
Tang S. [1 ]
Ye P. [1 ]
Lin W. [1 ]
Chen T. [1 ]
机构
[1] School of Information Science and Technology, Fudan University, Shanghai
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Deep Learning; Knowledge Distillation; Network Enhancement; Neural Network; Residual Network;
D O I
10.16451/j.cnki.issn1003-6059.202404004
中图分类号
学科分类号
摘要
Methods such as stimulative training and group knowledge based training are employed to collect group knowledge from shallow subnets in residual networks for self-distillation, thereby enhancing network performance. However, the group knowledge acquired by these methods suffers from issues such as slow updating and difficulties in combining with DataMix techniques. To address these issues, enhanced residual networks via mixed knowledge fraction(MKF) are proposed. The mixed knowledge is decomposed and modeled as quadratic programming by minimizing the fraction loss, and thus high-quality group knowledge is obtained from the mixed knowledge. To improve the robustness and diversity of the knowledge, a compound DataMix technique is proposed to construct a composite data augmentation method. Different from high-precision optimization algorithms with poor efficiency, a simple and efficient linear knowledge fraction technique is designed. The previous group knowledge is taken as knowledge bases, and the mixed knowledge is decomposed based on the knowledge bases. The enhanced group knowledge is then adopted to distill sampled subnetworks. Experiments on mainstream residual networks and classification datasets verify the effectiveness of MKF. © 2024 Science Press. All rights reserved.
引用
收藏
页码:328 / 338
页数:10
相关论文
共 51 条
  • [1] Wightman R., Touvron H., Jegou H., ResNet Strikes Back: An Improved Training Procedure in Timm [C/OL]
  • [2] Ding X.H., Zhang X.Y., Ma N.N., Et al., RepVGG: Making VGG-Style ConvNets Great Again, Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13728-13737, (2021)
  • [3] He K.M., Zhang X.Y., Ren S.Q., Et al., Deep Residual Learning for Image Recognition, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [4] Srivastava S., Sharma G., OmniVec: Learning Robust Representations with Cross Modal Sharing, Proc of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1225-1237, (2024)
  • [5] Ronneberger O., Fischer P., Brox T., U-Net: Convolutional Networks for Biomedical Image Segmentation, Proc of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 234-241, (2015)
  • [6] Ye P., Li B.P., Chen T., Et al., Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation, International Journal of Computer Vision, 130, 11, pp. 2674-2694, (2022)
  • [7] Wang W.H., Dai J.F., Chen Z., Et al., InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions, Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14408-14419, (2023)
  • [8] Zhao H.S., Shi J.P., Qi X.J., Et al., Pyramid Scene Parsing Network, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6230-6239, (2017)
  • [9] Girshick R., Fast R-CNN, Proc of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [10] Ren S.Q., He K.M., Girshick R., Et al., Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)