Aggregating transformers and CNNs for salient object detection in optical remote sensing images

被引:21
|
作者
Bao, Liuxin [1 ]
Zhou, Xiaofei [1 ]
Zheng, Bolun [1 ]
Yin, Haibing [2 ,3 ]
Zhu, Zunjie [2 ,3 ]
Zhang, Jiyong [1 ]
Yan, Chenggang [1 ,2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Lishui Inst, Lishui 323000, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; CNNs; Feature fusion; Optical RSIs; Salient object detection; ENCODER-DECODER NETWORK; ATTENTION; FEATURES;
D O I
10.1016/j.neucom.2023.126560
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) in optical remote sensing images (RSIs) plays a significant role in many areas such as agriculture, environmental protection, and the military. However, since the difference in imaging mode and image complexity between RSIs and natural scene images (NSIs), it is difficult to achieve remarkable results by directly extending the saliency method targeting NSIs to RSIs. Besides, we note that the convolutional neural networks (CNNs) based U-Net cannot effectively acquire the global long-range dependency, and the Transformer doesn't adequately characterize the spatial local details of each patch. Therefore, to conduct salient object detection in RSIs, we propose a novel two-branch architecture based network for Aggregating the Transformers and CNNs, namely ATC-Net, where the local spatial details and the global semantic information are fused into the final high-quality saliency map. Specifically, our saliency model adopts an encoder-decoder architecture including two parallel encoder branches and a decoder. Firstly, the two parallel encoder branches extract global and local features by using Transformer and CNNs, respectively. Then, the decoder employs a series of featureenhanced fusion (FF) modules to aggregate multi-level global and local features by interactive guidance and enhance the fused feature via attention mechanism. Finally, the decoder deploys the read out (RO) module to fuse the aggregated feature of FF module and the low-level CNN feature, steering the feature to focus more on spatial local details. Extensive experiments are performed on two public optical RSIs datasets, and the results show that our saliency model consistently outperforms 30 state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] X-shape Feature Expansion Network for Salient Object Detection in Optical Remote Sensing Images
    Huang, Lisu
    Sun, Minghui
    Liang, Yanhua
    Qin, Guihe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 246 - 258
  • [42] Transformer guidance dual-stream network for salient object detection in optical remote sensing images
    Yi Zhang
    Jichang Guo
    Huihui Yue
    Xiangjun Yin
    Sida Zheng
    Neural Computing and Applications, 2023, 35 : 17733 - 17747
  • [43] Iterative Saliency Aggregation and Assignment Network for Efficient Salient Object Detection in Optical Remote Sensing Images
    Yao, Zhaojian
    Gao, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [44] A parallel down-up fusion network for salient object detection in optical remote sensing images
    Li, Chongyi
    Cong, Runmin
    Guo, Chunle
    Li, Hua
    Zhang, Chunjie
    Zheng, Feng
    Zhao, Yao
    NEUROCOMPUTING, 2020, 415 : 411 - 420
  • [45] DKETFormer: Salient object detection in optical remote sensing images based on discriminative knowledge extraction and transfer
    Sun, Yuze
    Zhao, Hongwei
    Zhou, Jianhang
    NEUROCOMPUTING, 2025, 625
  • [46] Multilevel Interactive Reverse-Guided Network for Salient Object Detection in Optical Remote Sensing Images
    Zhao, Jie
    Jia, Yun
    Ma, Lin
    Yu, Lidan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12983 - 12999
  • [47] Semantic-Guided Attention Refinement Network for Salient Object Detection in Optical Remote Sensing Images
    Huang, Zhou
    Chen, Huaixin
    Liu, Biyuan
    Wang, Zhixi
    REMOTE SENSING, 2021, 13 (11)
  • [48] Edge-Guided Recurrent Positioning Network for Salient Object Detection in Optical Remote Sensing Images
    Zhou, Xiaofei
    Shen, Kunye
    Weng, Li
    Cong, Runmin
    Zheng, Bolun
    Zhang, Jiyong
    Yan, Chenggang
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) : 539 - 552
  • [49] Salient Object Detection Based on Progressively Supervised Learning for Remote Sensing Images
    Zhang, Libao
    Ma, Jie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11): : 9682 - 9696
  • [50] Edge-Aware Multiscale Feature Integration Network for Salient Object Detection in Optical Remote Sensing Images
    Zhou, Xiaofei
    Shen, Kunye
    Liu, Zhi
    Gong, Chen
    Zhang, Jiyong
    Yan, Chenggang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60