Semantic Assistance in SAR Object Detection: A Mask-Guided Approach

被引:0
|
作者
Liu, Wei [1 ]
Zhou, Lifan [1 ]
Zhong, Shan [1 ]
Gong, Shengrong [1 ]
机构
[1] Changshu Inst Technol, Suzhou 215500, Peoples R China
基金
中国国家自然科学基金;
关键词
DEtection TRansformer (DETR); object detection; segment anything model (SAM); synthetic aperture radar (SAR); PYRAMID NETWORK; FOCAL LOSS;
D O I
10.1109/JSTARS.2024.3481368
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The unique challenge in SAR object detection is the strong speckle noise inherent in SAR imagery. Existing learning-based works mainly focus on architectural enhancements, and fail to consider the valuable semantic information that can mitigate the effects of speckle noise. Large pretrained segment anything model (SAM) is a powerful foundational model with general semantic knowledge. However, SAM is not fully exploited for SAR object detection. This study paves the way for applying SAM for SAR object detection. Rather than fine-tuning the SAM network, we propose three mask-guided learning strategies by simply utilizing the semantic masks generated by SAM. Built upon the advanced RealTime DEtection TRansformer (RT-DETR) framework, the Semantic Assisted DETR, deemed as SA-DETR, integrates prior semantics from SAM into the SAR detection task. To be specific, first, we propose the mask-guided feature denoising module in the encoder stage, to enhance the network's discrimination of positives and negatives. Second, we propose the mask-guided query selection for initial query generation, which is beneficial for the decoder refinement. Finally, the mask-guided instance segmentation is proposed to achieve more accurate localization. To validate the superiority of the proposed SA-DETR, extensive experiments are conducted on two benchmark datasets, i.e., the SAR ship detection dataset (SSDD) and the recently published COCO-level large-scale multiclass SAR object detection dataset (SARDet-100K). Experimental results on both datasets outperform previous advanced detectors, achieving a new state-of-the-art with 99.0 $AP_{50}$ and 88.4 $mAP_{50}$ on SSDD and SARDet-100 K, respectively.
引用
收藏
页码:19395 / 19407
页数:13
相关论文
共 50 条
  • [1] Mask-guided SSD for small-object detection
    Sun, Chang
    Ai, Yibo
    Wang, Sheng
    Zhang, Weidong
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3311 - 3322
  • [2] Mask-guided SSD for small-object detection
    Chang Sun
    Yibo Ai
    Sheng Wang
    Weidong Zhang
    Applied Intelligence, 2021, 51 : 3311 - 3322
  • [3] Mask-Guided Transformer for Human-Object Interaction Detection
    Ying, Daocheng
    Yang, Hua
    Sun, Jun
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [4] MEAD: a Mask-guidEd Anchor-free Detector for oriented aerial object detection
    Zewen He
    Zhida Ren
    Xuebing Yang
    Yang Yang
    Wensheng Zhang
    Applied Intelligence, 2022, 52 : 4382 - 4397
  • [5] MEAD: a Mask-guidEd Anchor-free Detector for oriented aerial object detection
    He, Zewen
    Ren, Zhida
    Yang, Xuebing
    Yang, Yang
    Zhang, Wensheng
    APPLIED INTELLIGENCE, 2022, 52 (04) : 4382 - 4397
  • [6] Mask-Guided Attention Network for Occluded Pedestrian Detection
    Pang, Yanwei
    Xie, Jin
    Khan, Muhammad Haris
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4966 - 4974
  • [7] MagDR: Mask-guided Detection and Reconstruction for Defending Deepfakes
    Chen, Zhikai
    Xie, Lingxi
    Pang, Shanmin
    He, Yong
    Zhang, Bo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9010 - 9019
  • [8] Mask-guided Matting in the Wild
    Park, Kwanyong
    Woo, Sanghyun
    Oh, Seoung Wug
    Kweon, In So
    Lee, Joon-Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1992 - 2001
  • [9] Mask-guided explicit feature modulation for multispectral pedestrian detection
    Shen, Jifeng
    Liu, Yue
    Chen, Yifei
    Zuo, Xin
    Li, Jun
    Yang, Wankou
    Computers and Electrical Engineering, 2022, 103
  • [10] Mask-guided explicit feature modulation for multispectral pedestrian detection
    Shen, Jifeng
    Liu, Yue
    Chen, Yifei
    Zuo, Xin
    Li, Jun
    Yang, Wankou
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103