SCAD: A Siamese Cross-Attention Discrimination Network for Bitemporal Building Change Detection

被引:6
|
作者
Xu, Chuan [1 ]
Ye, Zhaoyi [1 ]
Mei, Liye [1 ,2 ]
Shen, Sen [3 ]
Zhang, Qi [1 ]
Sui, Haigang [4 ]
Yang, Wei [5 ]
Sun, Shaohua [6 ]
机构
[1] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
[2] Wuhan Univ, Inst Technol Sci, Wuhan 430072, Peoples R China
[3] Naval Engn Univ, Sch Weap Engn, Wuhan 430032, Peoples R China
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430072, Peoples R China
[5] Wuchang Shouyi Univ, Sch Informat Sci & Engn, Wuhan 430064, Peoples R China
[6] Air Force Res Acad, Beijing 10085, Peoples R China
基金
中国国家自然科学基金;
关键词
building change detection; deep learning; Siamese cross-attention; feature fusion; differential context; COVER CHANGE DETECTION; IMAGE;
D O I
10.3390/rs14246213
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Building change detection (BCD) is crucial for urban construction and planning. The powerful discriminative ability of deep convolutions in deep learning-based BCD methods has considerably increased the accuracy and efficiency. However, dense and continuously distributed buildings contain a wide range of multi-scale features, which render current deep learning methods incapable of discriminating and incorporating multiple features effectively. In this work, we propose a Siamese cross-attention discrimination network (SCADNet) to identify complex information in bitemporal images and improve the change detection accuracy. Specifically, we first use the Siamese cross-attention (SCA) module to learn unchanged and changed feature information, combining multi-head cross-attention to improve the global validity of high-level semantic information. Second, we adapt a multi-scale feature fusion (MFF) module to integrate embedded tokens with context-rich channel transformer outputs. Then, upsampling is performed to fuse the extracted multi-scale information content to recover the original image information to the maximum extent. For information content with a large difference in contextual semantics, we perform filtering using a differential context discrimination (DCD) module, which can help the network to avoid pseudo-change occurrences. The experimental results show that the present SCADNet is able to achieve a significant change detection performance in terms of three public BCD datasets (LEVIR-CD, SYSU-CD, and WHU-CD). For these three datasets, we obtain F1 scores of 90.32%, 81.79%, and 88.62%, as well as OA values of 97.98%, 91.23%, and 98.88%, respectively.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Skin lesion segmentation network with cross-attention coding
    Li D.
    Yang F.
    Liu Y.
    Tang Y.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (04): : 609 - 621
  • [32] Deep Cross-Attention Network for Crowdfunding Success Prediction
    Tang, Zhe
    Yang, Yi
    Li, Wen
    Lian, Defu
    Duan, Lixin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1306 - 1319
  • [33] Attention-Guided Siamese Fusion Network for Change Detection of Remote Sensing Images
    Chen, Puhua
    Guo, Lei
    Zhang, Xiangrong
    Qin, Kai
    Ma, Wentao
    Jiao, Licheng
    REMOTE SENSING, 2021, 13 (22)
  • [34] CAFIN: cross-attention based face image repair network
    Li, Yaqian
    Li, Kairan
    Li, Haibin
    Zhang, Wenming
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [35] Speech Enhancement with Fullband-Subband Cross-Attention Network
    Chen, Jun
    Rao, Wei
    Wang, Zilin
    Wu, Zhiyong
    Wang, Yannan
    Yu, Tao
    Shang, Shidong
    Meng, Helen
    INTERSPEECH 2022, 2022, : 976 - 980
  • [36] RECA: Relation Extraction Based on Cross-Attention Neural Network
    Huang, Xiaofeng
    Guo, Zhiqiang
    Zhang, Jialiang
    Cao, Hui
    Yang, Jie
    ELECTRONICS, 2022, 11 (14)
  • [37] A Lightweight Siamese Neural Network for Building Change Detection Using Remote Sensing Images
    Yang, Haiping
    Chen, Yuanyuan
    Wu, Wei
    Pu, Shiliang
    Wu, Xiaoyang
    Wan, Qiming
    Dong, Wen
    REMOTE SENSING, 2023, 15 (04)
  • [38] Multi-Level Multi-Modal Cross-Attention Network for Fake News Detection
    Ying, Long
    Yu, Hui
    Wang, Jinguang
    Ji, Yongze
    Qian, Shengsheng
    IEEE ACCESS, 2021, 9 : 132363 - 132373
  • [39] Mention Recommendation for Multimodal Microblog with Cross-attention Memory Network
    Ma, Renfeng
    Zhang, Qi
    Wang, Jiawen
    Cui, Lizhen
    Huang, Xuanjing
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 195 - 204
  • [40] LCAN: Light Cross-Attention Network for Collaborative Filtering Recommendation
    Liu, Lin
    Zhou, Wei
    Wen, Junhao
    Zhang, Yihao
    Wang, Yu
    Zhang, Hanwen
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 81 - 92