Learning Student Networks via Feature Embedding

被引:52
|
作者
Chen, Hanting [1 ]
Wang, Yunhe [2 ]
Xu, Chang [3 ]
Xu, Chao [1 ]
Tao, Dacheng [3 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci EECS, Cooperat Medianet Innovat Ctr, Key Lab Machine Percept,Minist Educ, Beijing 100871, Peoples R China
[2] Huawei Technol Co Ltd, Noahs Ark Lab, Beijing 100085, Peoples R China
[3] Univ Sydney, Fac Engn, Sch Comp Sci, Darlington, NSW 2008, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Knowledge engineering; Convolution; Training; Learning systems; Computational complexity; Graphics processing units; Mobile handsets; Deep learning; knowledge distillation (KD); teacher-student learning;
D O I
10.1109/TNNLS.2020.2970494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks have been widely used in numerous applications, but their demanding storage and computational resource requirements prevent their applications on mobile devices. Knowledge distillation aims to optimize a portable student network by taking the knowledge from a well-trained heavy teacher network. Traditional teacher-student-based methods used to rely on additional fully connected layers to bridge intermediate layers of teacher and student networks, which brings in a large number of auxiliary parameters. In contrast, this article aims to propagate information from teacher to student without introducing new variables that need to be optimized. We regard the teacher-student paradigm from a new perspective of feature embedding. By introducing the locality preserving loss, the student network is encouraged to generate the low-dimensional features that could inherit intrinsic properties of their corresponding high-dimensional features from the teacher network. The resulting portable network, thus, can naturally maintain the performance as that of the teacher network. Theoretical analysis is provided to justify the lower computation complexity of the proposed method. Experiments on benchmark data sets and well-trained networks suggest that the proposed algorithm is superior to state-of-the-art teacher-student learning methods in terms of computational and storage complexity.
引用
收藏
页码:25 / 35
页数:11
相关论文
共 50 条
  • [31] Video Manipulation Detection via Recurrent Residual Feature Learning Networks
    Howard, Matthew J.
    Williamson, Alexander S.
    Norouzi, Narges
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [32] Learning Optical Flow via Deformable Convolution and Feature Pyramid Networks
    Zhou Haiyun
    Xiang Xuezhi
    Zhang Rongfang
    Zhai Mingliang
    Ali, Syed Masroor
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 26 - 30
  • [33] Learning Markov Blankets for Continuous or Discrete Networks via Feature Selection
    Deng, Houtao
    Davila, Saylisse
    Runger, George
    Tuv, Eugene
    ENSEMBLES IN MACHINE LEARNING APPLICATIONS, 2011, 373 : 117 - +
  • [34] Centrality informed embedding of networks for temporal feature extraction
    Oggier, Frederique
    Datta, Anwitaman
    SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
  • [35] Centrality informed embedding of networks for temporal feature extraction
    Frédérique Oggier
    Anwitaman Datta
    Social Network Analysis and Mining, 2021, 11
  • [36] RED: Learning the role embedding in networks via Discrete-time quantum walk
    Wang, Xin
    Jian, Songlei
    Lu, Kai
    Zhang, Yi
    Liu, Kai
    APPLIED INTELLIGENCE, 2022, 52 (02) : 1493 - 1507
  • [37] RED: Learning the role embedding in networks via Discrete-time quantum walk
    Xin Wang
    Songlei Jian
    Kai Lu
    Yi Zhang
    Kai Liu
    Applied Intelligence, 2022, 52 : 1493 - 1507
  • [38] Feature Extraction via Sparse Difference Embedding (SDE)
    Wan, Minghua
    Lai, Zhihui
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (07): : 3594 - 3607
  • [39] Reaching the Rest: Embedding Sustainability in Undergraduate Student Learning
    Robinson, John
    Ariga, Ayako
    Cameron, Sean
    Wang, Ryan
    JOURNAL OF INTEGRATIVE ENVIRONMENTAL SCIENCES, 2022, 19 (01) : 171 - 187
  • [40] EMBEDDING AN ONLINE CAREER DEVELOPMENT PROGRAM INTO STUDENT LEARNING
    Thomson, Alison
    AUSTRALIAN JOURNAL OF CAREER DEVELOPMENT, 2010, 19 (03) : 6 - 14