Weakly Supervised Object Real-time Detection Based on High-resolution Class Activation Mapping Algorithm

被引:0
|
作者
Sun H. [1 ]
Shi Y. [1 ,2 ]
Zhang J. [1 ]
Wang R. [1 ]
Wang Y. [3 ]
机构
[1] College of Information Engineering and Automation, Civil Aviation University of China, Tianjin
[2] College of Artificial Intelligence, Nankai University, Tianjin
[3] Tianjin Binhai International Airport Co., Ltd., Tianjin
关键词
Class Activation Mapping(CAM) algorithm; Contrastive layer-wise relevance propagation theory; Object detection; Object-Aware Loss function(OA-Loss); Weakly supervised localization;
D O I
10.11999/JEIT230268
中图分类号
学科分类号
摘要
Thanks to the development of deep learning technology, object detection techniques have gained wide attention in various vision tasks. However, obtaining bounding box annotations for objects requires high time and labor costs, which hinders the application of object detection technology in practical scenarios. Therefore, a weakly supervised real-time object detection method based on high resolution class activation mapping algorithm is proposed, using only image class labels to reduce the dependence of network on object instance labels. It subdivides object detection into two subtasks: weakly supervised object localization and real-time object detection. In weakly supervised object localization task, a novel High Resolution Class Activation Mapping(HR-CAM) algorithm based on contrastive layer-wise relevance propagation theory is designed. It can obtain high quality class activation maps and generate pseudo detection annotation box. In real-time detection task, Single Shot multibox Detector(SSD) network as object detector is selected and an Object-Aware Loss function(OA-Loss) based on the class activation maps is designed. It can jointly supervise the training process of the SSD network with generated pseudo detection annotation box, to improve the networks' detection performance for objects. The experimental results show that the method proposed in this paper can achieve accurate and efficient object detection on the CUB200 and TJAB52 datasets, verifying the effectiveness and superiority of this method. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1051 / 1059
页数:8
相关论文
共 32 条
  • [1] REN Shaoqing, HE Kaiming, GIRSHICK R, Et al., Faster RCNN: Towards real-time object detection with region proposal networks[C], The 28th International Conference on Neural Information Processing Systems, pp. 1137-1149, (2015)
  • [2] LIU Wei, ANGUELOV D, ERHAN D, Et al., SSD: Single shot multiBox detector[C], 14th European Conference on Computer Vision, pp. 21-37, (2016)
  • [3] REDMON J, DIVVALA S, GIRSHICK R, Et al., You only look once: Unified, real-time object detection[C], The 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
  • [4] REDMON J, FARHADI A., YOLO9000: Better, faster, stronger[C], The IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)
  • [5] REDMON J, FARHADI A., YOLOv3: An incremental improvement, (2018)
  • [6] BOCHKOVSKIY A, WANG C Y, LIAO H Y M., YOLOv4: Optimal speed and accuracy of object detection, (2020)
  • [7] WANG Rui, SHI Yulong, SUN Hui, Et al., Lightweight-based high resolution bird flocking recognition deep learning network[J], Journal of Huazhong University of Science and Technology (Nature Science Edition), 51, 5, pp. 81-87, (2023)
  • [8] WANG Rui, LI Jinming, SHI Yulong, Et al., Vision-based path planning algorithm of unmanned bird-repelling vehicles in airports, (2022)
  • [9] CARBONNEAU M A, CHEPLYGINA V, GRANGER E, Et al., Multiple instance learning: A survey of problem characteristics and applications[J], Pattern Recognition, 77, pp. 329-353, (2018)
  • [10] CHENG Shuai, SUN Junxi, CAO Yonggang, Et al., Target tracking based on multiple instance deep learning[J], Journal of Electronics & Information Technology, 37, 12, pp. 2906-2912, (2015)