A Double Knowledge Distillation Model for Remote Sensing Image Scene Classification

被引:0
|
作者
Li D. [1 ]
Nan Y. [1 ]
Liu Y. [1 ]
机构
[1] School of Telecommunication and Information Engineering, Xi’an University of Posts and Telecommunications, Xi’an
基金
中国国家自然科学基金;
关键词
Dual Attention (DA); Knowledge distillation; Remote sensing image classification; Spatial Structure (SS);
D O I
10.11999/JEIT221017
中图分类号
学科分类号
摘要
In order to improve the accuracy of light-weight Convolutional Neural Networks (CNN) in the classification task of Remote Sensing Images (RSI) scene, a Double Knowledge Distillation (DKD) model combined with Dual-Attention (DA) and Spatial Structure (SS) is designed in this paper. First, new DA and SS modules are constructed and introduced into ResNet101 and lightweight CNN designed as teacher and student networks respectively. Then, a DA distillation loss function is constructed to transfer DA knowledge from teacher network to student network, so as to enhance their ability to extract local features from RSI. Finally, constructing a SS distillation loss function, migrating the semantic extraction ability in the teacher network to the student network in the form of a spatial structure to enhance its ability to express the high -level semantics of the RSI. The experimental results based on two standard data sets AID and NWPU-45 show that the performance of the student network after knowledge distillation is improved by 7.57% and 7.28% respectively under the condition of 20% training proportion, and the performance is still better than other methods under the condition of fewer parameters. © 2023 Science Press. All rights reserved.
引用
收藏
页码:3558 / 3567
页数:9
相关论文
共 28 条
  • [1] MA Shaopeng, LIANG Lu, TENG Shaohua, A lightweight hyperspectral remote sensing image classification method[J], Journal of Guangdong University of Technology, 38, 3, pp. 29-35, (2021)
  • [2] Deng PAN, ZHANG Meng, ZHANG Bo, A generic FCN-based approach for the road-network extraction from VHR remote sensing images–using OpenStreetMap as benchmarks[J], IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 14, pp. 2662-2673, (2021)
  • [3] JIANG Yanan, ZHANG Xin, ZHANG Chunlei, Et al., Classification of remote sensing images based on multi-scale feature fusion using local binary patterns[J], Remote Sensing for Natural Resources, 33, 3, pp. 36-44, (2021)
  • [4] CHAIB S, Yanfeng GU, YAO Hongxun, An informative feature selection method based on sparse PCA for VHR scene classification[J], IEEE Geoscience and Remote Sensing Letters, 13, 2, pp. 147-151, (2016)
  • [5] LI Yanfu, FAN Xijian, YANG Xubing, Et al., Remote sensing image classification framework based on self-attention convolutional neural network[J], Journal of Beijing Forestry University, 43, 10, pp. 81-88, (2021)
  • [6] XU Kejie, HUANG Hong, DENG Peifang, Et al., Deep feature aggregation framework driven by graph convolutional network for scene classification in remote sensing[J], IEEE Transactions on Neural Networks and Learning Systems, 33, 10, pp. 5751-5765, (2022)
  • [7] CHEN Sibao, WEI Qingsong, WANG Wenzhong, Et al., Remote sensing scene classification via multi-branch local attention network[J], IEEE Transactions on Image Processing, 31, pp. 99-109, (2021)
  • [8] CHEN Xi, XING Zhiqiang, CHENG Yuyang, Introduction to model compression knowledge distillation[C], 2021 6th International Conference on Intelligent Computing and Signal Processing, pp. 1464-1467, (2021)
  • [9] HE Kaiming, ZHANG Xiangyu, REN Shaoqing, Et al., Deep residual learning for image recognition[C], 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [10] LUO Yana, WANG Zhongsheng, An improved ResNet algorithm based on CBAM[C], 2021 International Conference on Computer Network, Electronic and Automation, pp. 121-125, (2021)