Class-centric Knowledge Distillation for RSI Scene Classification

被引:0
|
作者
Liu X. [1 ]
Liu Z. [1 ]
Lin Y. [1 ]
Wang S. [1 ]
Zuo X. [1 ]
机构
[1] Institute of Geospatial Information, Information Engineering University, Zhengzhou
关键词
class center; convolutional neural network; deep learning; knowledge distillation; model compression; remote sensing; Reproducing Kernel Hilbert Space; scene classification;
D O I
10.12082/dqxxkx.2023.220781
中图分类号
学科分类号
摘要
Convolutional neural networks have been widely used in the task of Remote Sensing Image Scene Classification (RSISC) and have achieved extraordinary performance. However, these excellent models have large volume and high computational cost, which cannot be deployed to resource- constrained edge devices. Moreover, in the RSISC task, the existing knowledge distillation method is directly applied to the compression model, ignoring the intra-class diversity and inter-class similarity of scene data. To this end, we propose a novel class- centric knowledge distillation method, which aims to obtain a compact, efficient, and accurate network model for RSISC. The proposed class-centric knowledge distillation framework for remote sensing image scene classification consists of two streams, teacher network flow and student network flow. Firstly, the remote sensing image scene classification dataset is sent into the teacher network pre-trained on a large-scale dataset to fine-tune the parameters. Then, the class- centric knowledge of the hidden layer is extracted from the adjusted teacher network and transferred to the student network based on the designed class center distillation loss, which is realized by constraining the distance of the distribution center of similar features extracted by the teacher and student network, so that the student network can learn the powerful feature extraction ability of the teacher network. The distillation process is combined with the truth tag supervision. Finally, the trained student network is used for scene prediction from remote sensing images alone. To evaluate the proposed method, we design a comparison experiment with eight advanced distillation methods on classical remote sensing image scene classification with different training ratios and different teacher- student architectures. Our results show that: compared to the best performance of other distillation methods, in the case of the teacher- student network belonging to the same series, the overall classification accuracy of our proposed method is increased by 1.429% and 2.74%, respectively, with a given training ratio of 80% and 60%; and in the case of teacher-student networks belonging to different series, the classification accuracy is increased by 0.238% and 0.476%, respectively, with the two given ratios. Additionally, supplementary experiments are also carried out on a small data set of RSC11 with few classes and few samples, a multi-scale data set of RSSCN7 with few classes and multiple books, and a large complex data set of AID with many classes of heterogeneous samples. The results show that the proposed method has good generalization ability. Trough the comparison experiments with similar techniques, it is found that the proposed method can maintain excellent performance in challenging categories through confusion matrix, and the proposed distillation loss function can better deal with noise through testing error curve. And visualization analysis also shows that the proposed method can effectively deal with the problems of intra-class diversity and inter-class similarity in remote sensing image scenes. © 2023 Journal of Geo-information Science. All rights reserved.
引用
收藏
页码:1050 / 1063
页数:13
相关论文
共 48 条
  • [1] Ghazouani F, Farah I R, Solaiman B., A multi-level semantic scene interpretation strategy for change interpretation in remote sensing imagery[J], IEEE Transactions on Geoscience and Remote Sensing, 57, 11, pp. 8775-8795, (2019)
  • [2] Hu F, Xia G S, Hu J W, Et al., Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery[J], Remote Sensing, 7, 11, pp. 14680-14707, (2015)
  • [3] Gu Y T, Wang Y T, Li Y S., A survey on deep learning-driven remote sensing image scene understanding: Scene classification, scene retrieval and scene-guided object detection[J], Applied Sciences, 9, 10, (2019)
  • [4] Ojala T, Pietikainen M, Maenpaa T., Multiresolution gray-scale and rotation invariant texture classification with local binary patterns[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 7, pp. 971-987, (2002)
  • [5] Zhu Q Q, Zhong Y F, Zhao B, Et al., Bag-of-visual-words scene classifier with local and global features for high spatial resolution remote sensing imagery[J], IEEE Geoscience and Remote Sensing Letters, 13, 6, pp. 747-751, (2016)
  • [6] Romero A, Gatta C, Camps-Valls G., Unsupervised deep feature extraction for remote sensing image classification[J], IEEE Transactions on Geoscience and Remote Sensing, 54, 3, pp. 1349-1362, (2016)
  • [7] Cheng G, Xie X X, Han J W, Et al., Remote sensing image scene classification meets deep learning: Challenges, methods, benchmarks, and opportunities[J], IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 13, pp. 3735-3756, (2020)
  • [8] Guo Z H, Liu W., Land type interpretation authenticity check of vector patch supported by deep learning and remote sensing image[J], Journal of Geo-Information Science, 22, 10, pp. 2051-2061, (2020)
  • [9] Yu D H, Zhang B M, Zhao C, Et al., Scene classification of remote sensing image using ensemble convolutional neural network[J], Journal of Remote Sensing, 24, 6, pp. 717-727, (2020)
  • [10] Sun H M, Lin Y W, Zou Q, Et al., Convolutional neural networks based remote sensing scene classification under clear and cloudy environments[C], 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp. 713-720, (2021)