mSODANet: A network for multi-scale object detection in aerial images using hierarchical dilated convolutions *

被引:73
|
作者
Chalavadi, Vishnu [1 ]
Jeripothula, Prudviraj [1 ]
Datla, Rajeshreddy [1 ,2 ]
Babu, Sobhan Ch [1 ]
Mohan, Krishna C. [1 ]
机构
[1] Indian Inst Technol Hyderabad, Dept Comp Sci & Engn, Visual Learning & Intelligence Grp VIGIL, Kandi 502285, Sangareddy, India
[2] Adv Data Proc Res Inst ADRIN, Dept Space, Akbar Rd, Manovikas Nagar 500009, Secunderabad, India
关键词
Multi-scale object detection; Contextual features; Dilated convolutions; Aerial images;
D O I
10.1016/j.patcog.2022.108548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A B S T R A C T The object detection in aerial images is one of the most commonly used tasks in the wide-range of computer vision applications. However, the object detection is more challenging due to the following issues: (a) the pixel occupancy vary among the different scales of objects, (b) the distribution of objects is not uniform in aerial images, (c) the appearance of an object varies with different view-points and illumination conditions, and (d) the number of objects, even though they belong to same type, vary across the images. To address these issues, we propose a novel network for multi-scale object detection in aerial images using hierarchical dilated convolutions, called as mSODANet. In particular, we probe hierarchical dilated network using parallel dilated convolutions to learn the contextual information of different types of objects at multiple scales and multiple field-of-views. The introduced hierarchical dilated network captures the visual information of aerial image more effectively and enhances the detection capability of the model. Further, the extensive experiments conducted on three challenging publicly available datasets, i.e., Visdrone2019, DOTA (OBB & HBB), NWPU VHR-10, demonstrate the effectiveness of the proposed mSODANet and achieve the state-of-the-art performance on all three datasets. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Attentional single-shot network with multi-scale feature fusion for object detection in aerial images
    Wang, Yusheng
    Wang, Hongzhang
    Tang, Eryong
    Liu, Ye
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4754 - 4758
  • [12] MSNet: Multi-Scale Network for Object Detection in Remote Sensing Images
    Gao, Tao
    Xia, Shilin
    Liu, Mengkun
    Zhang, Jing
    Chen, Ting
    Li, Ziqi
    PATTERN RECOGNITION, 2025, 158
  • [13] Image Deblurring using Multi-Scale Dilated Convolutions in a LSTM-based Neural Network
    Richmond, Greig
    Cole-Rhodes, Arlene
    2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [14] Pediatric Seizure Prediction in Scalp EEG Using a Multi-Scale Neural Network With Dilated Convolutions
    Gao, Yikai
    Chen, Xun
    Liu, Aiping
    Liang, Deng
    Wu, Le
    Qian, Ruobing
    Xie, Hongtao
    Zhang, Yongdong
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2022, 10
  • [15] Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
    Yang, Feng
    Li, Wentong
    Hu, Haiwei
    Li, Wanyi
    Wang, Peng
    SENSORS, 2020, 20 (06)
  • [16] Scale Enhancement Network for Object Detection in Aerial Images
    Mao, Shihan
    Wang, Zhi
    He, Qineng
    Zhu, Zhangqing
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (02)
  • [17] SSN: Scale Selection Network for Multi-Scale Object Detection in Remote Sensing Images
    Lin, Zhili
    Leng, Biao
    REMOTE SENSING, 2024, 16 (19)
  • [18] A Scale-Aware Pyramid Network for Multi-Scale Object Detection in SAR Images
    Tang, Linbo
    Tang, Wei
    Qu, Xin
    Han, Yuqi
    Wang, Wenzheng
    Zhao, Baojun
    REMOTE SENSING, 2022, 14 (04)
  • [19] Lightweight Object Detection Combined with Multi-Scale Dilated-Convolution and Multi-Scale Deconvolution
    Yi, Qingming
    Lü, Renyi
    Shi, Min
    Luo, Aiwen
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (12): : 41 - 48
  • [20] A Contextual Deep Neural Network with Dilated Convolutions for Object Detection in Remote Sensing Images
    Wan, Shouhong
    Li, Xingyue
    Jin, Peiquan
    Zou, Chang
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806