Hybrid Multiscale SAR Ship Detector With CNN-Transformer and Adaptive Fusion Loss

被引:0
|
作者
Wang, Fei [1 ]
Chen, Chengcheng [1 ]
Zeng, Weiming [1 ]
机构
[1] Shanghai Maritime Univ, Digital Imaging & Intelligent Comp Lab, Shanghai 201306, Peoples R China
关键词
Marine vehicles; Feature extraction; Detectors; Convolution; Transformers; Computational modeling; Synthetic aperture radar; Deep learning; multiscale feature fusion; ship detection; synthetic aperture radar (SAR);
D O I
10.1109/LGRS.2024.3450716
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Ship detection in remote sensing imagery is crucial for various maritime applications such as surveillance and navigation. Convolutional neural networks (CNNs) and transformers have shown significant potential in object detection within the field of image processing. However, existing models applied directly to ship detection in synthetic aperture radar (SAR) imagery encounter challenges due to the varying sizes of ship targets. This often leads to issues such as low detection accuracy, missed detections, and false alarms. In this letter, we propose a new detection network, HMA-Net, to further address these issues. Initially, we introduce the Cwin module, which enhances interference resistance at a relatively low cost, enabling the model to more accurately capture target information. Subsequently, we design a multiscale ship feature extraction module, which uses a parallel multibranch structure to extract features of ships of various sizes and shapes. Finally, we introduce an adaptive fusion loss function that flexibly allocates loss calculation methods to detected targets, thereby enhancing the robustness of the model and achieving high-quality detection boxes. The proposed HMA-Net achieved improvements of 2.0% and 0.9% in mAP(.50:.95) over the baseline models on the SAR Ship Detection dataset and the High-Resolution SAR Images dataset, using only 3.52 M parameters.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation
    Li, Zihan
    Li, Dihan
    Xu, Cangbai
    Wang, Weice
    Hong, Qingqi
    Li, Qingde
    Tian, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 781 - 792
  • [22] Correction to: A hybrid CNN-Transformer model for ozone concentration prediction
    Yibin Chen
    Xiaomin Chen
    Ailan Xu
    Qiang Sun
    Xiaoyan Peng
    Air Quality, Atmosphere & Health, 2022, 15 : 1695 - 1697
  • [23] CNN-Transformer Hybrid Architecture for Underwater Sonar Image Segmentation
    Lei, Juan
    Wang, Huigang
    Lei, Zelin
    Li, Jiayuan
    Rong, Shaowei
    REMOTE SENSING, 2025, 17 (04)
  • [24] Harmful Cyanobacterial Blooms forecasting based on improved CNN-Transformer and Temporal Fusion Transformer
    Ahn, Jung Min
    Kim, Jungwook
    Kim, Hongtae
    Kim, Kyunghyun
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2023, 32
  • [25] A hybrid CNN-Transformer model for Historical Document Image Binarization
    Rezanezhad, Vahid
    Baierer, Konstantin
    Neudecker, Clemens
    PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023, 2023, : 79 - 84
  • [26] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
    Wang, Quanli
    Jin, Xin
    Jiang, Qian
    Wu, Liwen
    Zhang, Yunchun
    Zhou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [27] Enhanced Segmentation in Abdominal CT Images: Leveraging Hybrid CNN-Transformer Architectures and Compound Loss Function
    Piri, Fatemeh
    Karimi, Nader
    Samavi, Shadrokh
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0363 - 0369
  • [28] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
    Wang, Hongmei
    Li, Lin
    Li, Chenkai
    Lu, Xuanyu
    IEEE ACCESS, 2023, 11 : 78956 - 78969
  • [29] A CNN-Transformer Combined Remote Sensing Imagery Spatiotemporal Fusion Model
    Jiang, Mingyu
    Shao, Hua
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 13995 - 14009
  • [30] Block Cipher Algorithm Identification Based on CNN-Transformer Fusion Model
    Xie, Rongna
    Chen, Xiaoyu
    Zhang, Xinru
    Shi, Guozhen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 97 - 110