Interpretable Object Detection Method for Remote Sensing Image Based on Deep Reinforcement Learning

被引:0
|
作者
Zhao J. [1 ,2 ,3 ]
Zhang D. [1 ,2 ]
Zhou Y. [1 ,2 ]
Chen S. [1 ,2 ]
Tang J. [1 ,2 ]
Yao R. [1 ,2 ]
机构
[1] School of Computer Science and Technology, China University of Mining and Technology, Xuzhou
[2] Engineering Research Center of Mine Digitization of Ministry of Education, China University of Mining and Technology, Xuzhou
[3] Innovation Research Center of Disaster Intelligent Prevention and Emergency Rescue, China University of Mining and Technology, Xuzhou
基金
中国国家自然科学基金;
关键词
Deep Reinforcement Learning; Object Detection; Remote Sensing Image; Reward Function;
D O I
10.16451/j.cnki.issn1003-6059.202109001
中图分类号
学科分类号
摘要
With the rapid development of remote sensing technology, object detection for remote sensing image is widely applied in many fields,such as resource exploration, urban planning and natural disaster assessment. Aiming at the complex background and the small target scale of remote sensing images, an interpretable object detection method for remote sensing image based on deep reinforcement learning is proposed. Firstly, deep reinforcement learning is applied to the region proposal network in faster region-convolutional neural network to improve the detection accuracy of remote sensing images by modifying the excitation function. Secondly, the detection speed and portability of the model are improved by lightening the original backbone network with a large number of parameters. Finally, the interpretability of the hidden layer representation in the model is quantified using the network anatomy method to endow the model with an interpretable concept of human understanding. Experiments on three public remote sensing datasets show that the performance of the proposed method is improved and the effectiveness of the proposed method is verified by the improved network anatomy method. © 2021, Science Press. All right reserved.
引用
收藏
页码:777 / 786
页数:9
相关论文
共 36 条
  • [1] ZHANG H, LIU W J, LU H H., Feature Extraction for Hyperspectral Remote Sensing Image Based on Local Fisher Discriminant Analysis with Wavelet Kernel, Pattern Recognition and Artificial Intelligence, 32, 7, pp. 624-632, (2019)
  • [2] CHU J, ZHU X Y, LENG L, Et al., Target Detector with Channel Attention and Residual Learning, Pattern Recognition and Artificial Intelligence, 33, 10, pp. 889-897, (2020)
  • [3] GIRSHICK R, DONAHUE J, DARRELL T, Et al., Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [4] GIRSHICK R., Fast R-CNN, Proc of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [5] REN S Q, HE K M, GIRSHICK R, Et al., Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
  • [6] REDMON J, FARHADI A., YOLO9000: Better, Faster, Stronger, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)
  • [7] LIU W, ANGUELOV D, ERHAN D, Et al., SSD: Single Shot Multibox Detector, Proc of the European Conference on Computer Vision, pp. 21-37, (2016)
  • [8] REDMON J, DIVVALA S, GIRSHICK R, Et al., You Only Look Once: Unified, Real-Time Object Detection, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
  • [9] LAW H, DENG J., CornerNet: Detecting Objects as Paired Keypoints, International Journal of Computer Vision, 128, pp. 642-656, (2020)
  • [10] ZHOU X Y, ZHUO J C, KRAHENBUHL P., Bottom-Up Object Detection by Grouping Extreme and Center Points, Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 850-859, (2019)