SCAD: A Siamese Cross-Attention Discrimination Network for Bitemporal Building Change Detection

被引：6

作者：

Xu, Chuan ^{[1
]}

Ye, Zhaoyi ^{[1
]}

Mei, Liye ^{[1
,2
]}

Shen, Sen ^{[3
]}

Zhang, Qi ^{[1
]}

Sui, Haigang ^{[4
]}

Yang, Wei ^{[5
]}

Sun, Shaohua ^{[6
]}

机构：

[1] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China

[2] Wuhan Univ, Inst Technol Sci, Wuhan 430072, Peoples R China

[3] Naval Engn Univ, Sch Weap Engn, Wuhan 430032, Peoples R China

[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430072, Peoples R China

[5] Wuchang Shouyi Univ, Sch Informat Sci & Engn, Wuhan 430064, Peoples R China

[6] Air Force Res Acad, Beijing 10085, Peoples R China

来源：

REMOTE SENSING | 2022年 / 14卷 / 24期

基金：

中国国家自然科学基金;

关键词：

building change detection; deep learning; Siamese cross-attention; feature fusion; differential context; COVER CHANGE DETECTION; IMAGE;

D O I：

10.3390/rs14246213

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Building change detection (BCD) is crucial for urban construction and planning. The powerful discriminative ability of deep convolutions in deep learning-based BCD methods has considerably increased the accuracy and efficiency. However, dense and continuously distributed buildings contain a wide range of multi-scale features, which render current deep learning methods incapable of discriminating and incorporating multiple features effectively. In this work, we propose a Siamese cross-attention discrimination network (SCADNet) to identify complex information in bitemporal images and improve the change detection accuracy. Specifically, we first use the Siamese cross-attention (SCA) module to learn unchanged and changed feature information, combining multi-head cross-attention to improve the global validity of high-level semantic information. Second, we adapt a multi-scale feature fusion (MFF) module to integrate embedded tokens with context-rich channel transformer outputs. Then, upsampling is performed to fuse the extracted multi-scale information content to recover the original image information to the maximum extent. For information content with a large difference in contextual semantics, we perform filtering using a differential context discrimination (DCD) module, which can help the network to avoid pseudo-change occurrences. The experimental results show that the present SCADNet is able to achieve a significant change detection performance in terms of three public BCD datasets (LEVIR-CD, SYSU-CD, and WHU-CD). For these three datasets, we obtain F1 scores of 90.32%, 81.79%, and 88.62%, as well as OA values of 97.98%, 91.23%, and 98.88%, respectively.

引用

页数：21

共 50 条

[31] Skin lesion segmentation network with cross-attention coding
Li D.
Yang F.
Liu Y.
Tang Y.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (04): : 609 - 621
[32] Deep Cross-Attention Network for Crowdfunding Success Prediction
Tang, Zhe
Yang, Yi
Li, Wen
Lian, Defu
Duan, Lixin
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1306 - 1319
[33] Attention-Guided Siamese Fusion Network for Change Detection of Remote Sensing Images
Chen, Puhua
Guo, Lei
Zhang, Xiangrong
Qin, Kai
Ma, Wentao
Jiao, Licheng
REMOTE SENSING, 2021, 13 (22)
[34] CAFIN: cross-attention based face image repair network
Li, Yaqian
Li, Kairan
Li, Haibin
Zhang, Wenming
MULTIMEDIA SYSTEMS, 2024, 30 (05)
[35] Speech Enhancement with Fullband-Subband Cross-Attention Network
Chen, Jun
Rao, Wei
Wang, Zilin
Wu, Zhiyong
Wang, Yannan
Yu, Tao
Shang, Shidong
Meng, Helen
INTERSPEECH 2022, 2022, : 976 - 980
[36] RECA: Relation Extraction Based on Cross-Attention Neural Network
Huang, Xiaofeng
Guo, Zhiqiang
Zhang, Jialiang
Cao, Hui
Yang, Jie
ELECTRONICS, 2022, 11 (14)
[37] A Lightweight Siamese Neural Network for Building Change Detection Using Remote Sensing Images
Yang, Haiping
Chen, Yuanyuan
Wu, Wei
Pu, Shiliang
Wu, Xiaoyang
Wan, Qiming
Dong, Wen
REMOTE SENSING, 2023, 15 (04)
[38] Multi-Level Multi-Modal Cross-Attention Network for Fake News Detection
Ying, Long
Yu, Hui
Wang, Jinguang
Ji, Yongze
Qian, Shengsheng
IEEE ACCESS, 2021, 9 : 132363 - 132373
[39] Mention Recommendation for Multimodal Microblog with Cross-attention Memory Network
Ma, Renfeng
Zhang, Qi
Wang, Jiawen
Cui, Lizhen
Huang, Xuanjing
ACM/SIGIR PROCEEDINGS 2018, 2018, : 195 - 204
[40] LCAN: Light Cross-Attention Network for Collaborative Filtering Recommendation
Liu, Lin
Zhou, Wei
Wen, Junhao
Zhang, Yihao
Wang, Yu
Zhang, Hanwen
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 81 - 92

← 1 2 3 4 5 →