Semantic Assistance in SAR Object Detection: A Mask-Guided Approach

被引:0
|
作者
Liu, Wei [1 ]
Zhou, Lifan [1 ]
Zhong, Shan [1 ]
Gong, Shengrong [1 ]
机构
[1] Changshu Inst Technol, Suzhou 215500, Peoples R China
基金
中国国家自然科学基金;
关键词
DEtection TRansformer (DETR); object detection; segment anything model (SAM); synthetic aperture radar (SAR); PYRAMID NETWORK; FOCAL LOSS;
D O I
10.1109/JSTARS.2024.3481368
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The unique challenge in SAR object detection is the strong speckle noise inherent in SAR imagery. Existing learning-based works mainly focus on architectural enhancements, and fail to consider the valuable semantic information that can mitigate the effects of speckle noise. Large pretrained segment anything model (SAM) is a powerful foundational model with general semantic knowledge. However, SAM is not fully exploited for SAR object detection. This study paves the way for applying SAM for SAR object detection. Rather than fine-tuning the SAM network, we propose three mask-guided learning strategies by simply utilizing the semantic masks generated by SAM. Built upon the advanced RealTime DEtection TRansformer (RT-DETR) framework, the Semantic Assisted DETR, deemed as SA-DETR, integrates prior semantics from SAM into the SAR detection task. To be specific, first, we propose the mask-guided feature denoising module in the encoder stage, to enhance the network's discrimination of positives and negatives. Second, we propose the mask-guided query selection for initial query generation, which is beneficial for the decoder refinement. Finally, the mask-guided instance segmentation is proposed to achieve more accurate localization. To validate the superiority of the proposed SA-DETR, extensive experiments are conducted on two benchmark datasets, i.e., the SAR ship detection dataset (SSDD) and the recently published COCO-level large-scale multiclass SAR object detection dataset (SARDet-100K). Experimental results on both datasets outperform previous advanced detectors, achieving a new state-of-the-art with 99.0 $AP_{50}$ and 88.4 $mAP_{50}$ on SSDD and SARDet-100 K, respectively.
引用
收藏
页码:19395 / 19407
页数:13
相关论文
共 50 条
  • [21] Segmentation mask-guided person image generation
    Meichen Liu
    Xin Yan
    Chenhui Wang
    Kejun Wang
    Applied Intelligence, 2021, 51 : 1161 - 1176
  • [22] MagConv: Mask-Guided Convolution for Image Inpainting
    Yu, Xuexin
    Xu, Long
    Li, Jia
    Ji, Xiangyang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4716 - 4727
  • [23] Mask-Guided Portrait Editing with Conditional GANs
    Gu, Shuyang
    Bao, Jianmin
    Yang, Hao
    Chen, Dong
    Wen, Fang
    Yuan, Lu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3431 - 3440
  • [24] Segmentation mask-guided person image generation
    Liu, Meichen
    Yan, Xin
    Wang, Chenhui
    Wang, Kejun
    APPLIED INTELLIGENCE, 2021, 51 (02) : 1161 - 1176
  • [25] Mask-Guided Model for Seismic Data Denoising
    Fang, Ziyi
    Lin, Hongbo
    Xu, Xinyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [26] Exploiting semantic-level affinities with a mask-guided network for temporal action proposal in videos
    Yu Yang
    Mengmeng Wang
    Jianbiao Mei
    Yong Liu
    Applied Intelligence, 2023, 53 : 15516 - 15536
  • [27] Exploiting semantic-level affinities with a mask-guided network for temporal action proposal in videos
    Yang, Yu
    Wang, Mengmeng
    Mei, Jianbiao
    Liu, Yong
    APPLIED INTELLIGENCE, 2023, 53 (12) : 15516 - 15536
  • [28] Scale-invariant mask-guided vehicle keypoint detection from a monocular image
    Kim, Sunpil
    Yoon, Gang-Joon
    Song, Jinjoo
    Yoon, Sang Min
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 107
  • [29] Mask-Guided Stamp Erasure for Real Document Image
    Yang, Xinye
    Yang, Dongbao
    Zhou, Yu
    Guo, Youhui
    Wang, Weiping
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1631 - 1636
  • [30] Highlight mask-guided adaptive residual network for single image highlight detection and removal
    Wang, Shuaibin
    Li, Li
    Wang, Juan
    Peng, Tao
    Li, Zhenwei
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (03)