Towards Accurate Oriented Object Detection in Aerial Images with Adaptive Multi-level Feature Fusion

被引:18
|
作者
Zhen, Peining [1 ]
Wang, Shuqi [1 ]
Zhang, Suming [2 ]
Yan, Xiaotao [2 ]
Wang, Wei [2 ]
Ji, Zhigang [1 ]
Chen, Hai-Bao [1 ]
机构
[1] Shanghai Jiao Tong Univ, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[2] Beijing Inst Astronaut Syst Engn, 1 Donggaodi South St, Beijing 100076, Peoples R China
关键词
Remote sensing images; aerial images; oriented object detection; convolutional neural network;
D O I
10.1145/3513133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting objects in aerial images is a long-standing and challenging problem since the objects in aerial images vary dramatically in size and orientation. Most existing neural network based methods are not robust enough to provide accurate oriented object detection results in aerial images since they do not consider the correlations between different levels and scales of features. In this paper, we propose a novel two-stage network-based detector with adaptive feature fusion towards highly accurate oriented object detection in aerial images, named AFF-Det. First, a multi-scale feature fusion module (MSFF) is built on the top layer of the extracted feature pyramids to mitigate the semantic information loss in the small-scale features. We also propose a cascaded oriented bounding box regression method to transform the horizontal proposals into oriented ones. Then the transformed proposals are assigned to all feature pyramid network (FPN) levels and aggregated by the weighted RoI feature aggregation (WRFA) module. The above modules can adaptively enhance the feature representations in different stages of the network based on the attention mechanism. Finally, a rotated decoupled-RCNN head is introduced to obtain the classification and localization results. Extensive experiments are conducted on the DOTA and HRSC2016 datasets to demonstrate the advantages of our proposed AFF-Det. The best detection results can achieve 80.73% mAP and 90.48% mAP, respectively, on these two datasets, outperforming recent state-of-the-art methods.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Multi-level Feature Selection for Oriented Object Detection
    Jiang, Chen
    Jiang, Yefan
    Bian, Zhangxing
    Yang, Fan
    Xia, Siyu
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 36 - 43
  • [2] Multi-level feature fusion pyramid network for object detection
    Guo, Zebin
    Shuai, Hui
    Liu, Guangcan
    Zhu, Yisheng
    Wang, Wenqing
    VISUAL COMPUTER, 2023, 39 (09): : 4267 - 4277
  • [3] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    The Visual Computer, 2023, 39 : 4267 - 4277
  • [4] Attention-Based Multi-Level Feature Fusion for Object Detection in Remote Sensing Images
    Dong, Xiaohu
    Qin, Yao
    Gao, Yinghui
    Fu, Ruigang
    Liu, Songlin
    Ye, Yuanxin
    REMOTE SENSING, 2022, 14 (15)
  • [5] A multi-level feature weight fusion model for salient object detection
    Zhang, Shanqing
    Chen, Yujie
    Meng, Yiheng
    Lu, Jianfeng
    Li, Li
    Bai, Rui
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 887 - 895
  • [6] A multi-level feature weight fusion model for salient object detection
    Zhang Shanqing
    Chen Yujie
    Meng Yiheng
    Lu Jianfeng
    Li Li
    Bai Rui
    Multimedia Systems, 2023, 29 : 887 - 895
  • [7] Multi-level feature enhancement network for object detection in sonar images
    Zhou, Xin
    Zhou, Zihan
    Wang, Manying
    Ning, Bo
    Wang, Yanhao
    Zhu, Pengli
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [8] Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery
    Chen, Luchang
    Liu, Chunsheng
    Chang, Faliang
    Li, Shuang
    Nie, Zhaoying
    NEUROCOMPUTING, 2021, 451 : 67 - 80
  • [9] Salient Object Detection Based on Multi-scale Feature Extraction and Multi-level Feature Fusion
    Li, Lingli
    Meng, Lingbing
    Li, Jinbao
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2021, 53 (01): : 170 - 177
  • [10] MLSA-YOLO: a multi-level feature fusion and scale-adaptive framework for small object detection
    Peng, Jiayu
    Lv, Kai
    Wang, Guoliang
    Xiao, Wendong
    Ran, Teng
    Yuan, Liang
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):