Aggregating transformers and CNNs for salient object detection in optical remote sensing images

被引:21
|
作者
Bao, Liuxin [1 ]
Zhou, Xiaofei [1 ]
Zheng, Bolun [1 ]
Yin, Haibing [2 ,3 ]
Zhu, Zunjie [2 ,3 ]
Zhang, Jiyong [1 ]
Yan, Chenggang [1 ,2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Lishui Inst, Lishui 323000, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; CNNs; Feature fusion; Optical RSIs; Salient object detection; ENCODER-DECODER NETWORK; ATTENTION; FEATURES;
D O I
10.1016/j.neucom.2023.126560
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Salient object detection (SOD) in optical remote sensing images (RSIs) plays a significant role in many areas such as agriculture, environmental protection, and the military. However, since the difference in imaging mode and image complexity between RSIs and natural scene images (NSIs), it is difficult to achieve remarkable results by directly extending the saliency method targeting NSIs to RSIs. Besides, we note that the convolutional neural networks (CNNs) based U-Net cannot effectively acquire the global long-range dependency, and the Transformer doesn't adequately characterize the spatial local details of each patch. Therefore, to conduct salient object detection in RSIs, we propose a novel two-branch architecture based network for Aggregating the Transformers and CNNs, namely ATC-Net, where the local spatial details and the global semantic information are fused into the final high-quality saliency map. Specifically, our saliency model adopts an encoder-decoder architecture including two parallel encoder branches and a decoder. Firstly, the two parallel encoder branches extract global and local features by using Transformer and CNNs, respectively. Then, the decoder employs a series of featureenhanced fusion (FF) modules to aggregate multi-level global and local features by interactive guidance and enhance the fused feature via attention mechanism. Finally, the decoder deploys the read out (RO) module to fuse the aggregated feature of FF module and the low-level CNN feature, steering the feature to focus more on spatial local details. Extensive experiments are performed on two public optical RSIs datasets, and the results show that our saliency model consistently outperforms 30 state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Salient Object Detection in Optical Remote Sensing Images Driven by Transformer
    Li, Gongyang
    Bai, Zhen
    Liu, Zhi
    Zhang, Xinpeng
    Ling, Haibin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5257 - 5269
  • [2] Transformers and CNNs fusion network for salient object detection
    Yao, Cuili
    Feng, Lin
    Kong, Yuqiu
    Xiao, Lin
    Chen, Tao
    NEUROCOMPUTING, 2023, 520 : 342 - 355
  • [3] Dynamic Context Coordination for Salient Object Detection in Optical Remote Sensing Images
    Huang, Jiarong
    Huang, Kan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [4] Adjacent Complementary Network for Salient Object Detection in Optical Remote Sensing Images
    Song, Dawei
    Dong, Yongsheng
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Attention Guided Network for Salient Object Detection in Optical Remote Sensing Images
    Lin, Yuhan
    Sun, Han
    Liu, Ningzhong
    Bian, Yetong
    Cen, Jun
    Zhou, Huiyu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 25 - 36
  • [6] Multiscale Feature Enhancement Network for Salient Object Detection in Optical Remote Sensing Images
    Wang, Zhen
    Guo, Jianxin
    Zhang, Chuanlei
    Wang, Buhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] MEANet: An effective and lightweight solution for salient object detection in optical remote sensing images
    Liang, Bocheng
    Luo, Huilan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [8] Multiscale Feature Aggregation Network for Salient Object Detection in Optical Remote Sensing Images
    Yan, Longquan
    Geng, Guohua
    Zhang, Qi
    Feng, Long
    Liu, Yangyang
    Ge, Xing
    Jia, Haotian
    IEEE SENSORS JOURNAL, 2023, 23 (16) : 18362 - 18373
  • [9] Speed-Oriented Lightweight Salient Object Detection in Optical Remote Sensing Images
    Li, Zhaoyang
    Miao, Yinxiao
    Li, Xiongwei
    Li, Wenrui
    Cao, Jie
    Hao, Qun
    Li, Dongxing
    Sheng, Yunlong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [10] Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images
    Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai
    200444, China
    不详
    200444, China
    不详
    200444, China
    不详
    639798, Singapore
    不详
    NY
    11794, United States
    arXiv, 1600,