Pre-locate net for object detection in high-resolution images

被引:0
|
作者
ZHANG, Yunhao [1 ]
XU, Ting-Bing [1 ]
WEI, Zhenzhong [1 ]
机构
[1] Key Laboratory of Precision Opto-mechatronics Technology, Ministry of Education, the School of Instrumentation and Optoelectronic Engineering, Beihang University, Beijing,100083, China
来源
Chinese Journal of Aeronautics | 2022年 / 35卷 / 10期
关键词
652.3 Aircraft Instruments and Equipment - 716.2 Radar Systems and Equipment - 723.2 Data Processing and Image Processing;
D O I
暂无
中图分类号
学科分类号
摘要
Small-object detection has long been a challenge. High-megapixel cameras are used to solve this problem in industries. However, current detectors are inefficient for high-resolution images. In this work, we propose a new module called Pre-Locate Net, which is a plug-and-play structure that can be combined with most popular detectors. We inspire the use of classification ideas to obtain candidate regions in images, greatly reducing the amount of calculation, and thus achieving rapid detection in high-resolution images. Pre-Locate Net mainly includes two parts, candidate region classification and behavior classification. Candidate region classification is used to obtain a candidate region, and behavior classification is used to estimate the scale of an object. Different follow-up processing is adopted according to different scales to balance the variance of the network input. Different from the popular candidate region generation method, we abandon the idea of regression of a bounding box and adopt the concept of classification, so as to realize the prediction of a candidate region in the shallow network. We build a high-resolution dataset of aircraft and landing gears covering complex scenes to verify the effectiveness of our method. Compared to state-of-the-art detectors (e.g., Guided Anchoring, Libra-RCNN, and FASF), our method achieves the best mAP of 94.5 on 1920 × 1080 images at 16.7 FPS. © 2021 Chinese Society of Aeronautics and Astronautics
引用
收藏
页码:313 / 325
相关论文
共 50 条
  • [1] Pre-locate net for object detection in high-resolution images
    Zhang, Yunhao
    Xu, Ting -Bing
    Wei, Zhenzhong
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (10) : 313 - 325
  • [2] Pre-locate net for object detection in high-resolution images
    Yunhao ZHANG
    Tingbing XU
    Zhenzhong WEI
    Chinese Journal of Aeronautics , 2022, (10) : 313 - 325
  • [3] Pre-locate net for object detection in high-resolution images
    Yunhao ZHANG
    Tingbing XU
    Zhenzhong WEI
    Chinese Journal of Aeronautics, 2022, 35 (10) : 313 - 325
  • [4] ANALYSIS OF HIGH-RESOLUTION AERIAL IMAGES FOR OBJECT DETECTION
    TRIVEDI, MM
    BOKIL, AG
    TAKLA, MB
    MAKSYMONKO, GB
    BROACH, JT
    ADVANCES IN IMAGE COMPRESSION AND AUTOMATIC TARGET RECOGNITION, 1989, 1099 : 58 - 65
  • [5] ESOD: Efficient Small Object Detection on High-Resolution Images
    Liu, Kai
    Fu, Zhihang
    Jin, Sheng
    Chen, Ze
    Zhou, Fan
    Jiang, Rongxin
    Chen, Yaowu
    Ye, Jieping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 183 - 195
  • [6] Hierarchical object detection for very high-resolution satellite images
    Wu, Zhi-Ze
    Wang, Xiao-Feng
    Zou, Le
    Xu, Li-Xiang
    Li, Xin-Lu
    Weise, Thomas
    APPLIED SOFT COMPUTING, 2021, 113
  • [7] Cascade multi-scale object detection on high-resolution images
    Novoselov, Alexey
    Dyakov, Oleg
    Kostromin, Igor
    Pogibelskiy, Dmitry
    2019 INTERNATIONAL CONFERENCE ON ENGINEERING AND TELECOMMUNICATION (ENT), 2019,
  • [8] High-Resolution Polar Network for Object Detection in Remote Sensing Images
    He, Xu
    Ma, Shiping
    He, Linyuan
    Ru, Le
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] High-Resolution Polar Network for Object Detection in Remote Sensing Images
    He, Xu
    Ma, Shiping
    He, Linyuan
    Ru, Le
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [10] FAST LEARNABLE OBJECT TRACKING AND DETECTION IN HIGH-RESOLUTION OMNIDIRECTIONAL IMAGES
    Hurych, David
    Zimmermann, Karel
    Svoboda, Tomas
    VISAPP 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, 2011, : 521 - 530