Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning

被引:0
|
作者
Chen, Cai [1 ]
Wang, Yi [2 ]
Yap, Kim-Hui [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[2] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
关键词
Image Change Captioning (ICC); Remote Sensing (RS); Multi-scale Change Awareness;
D O I
10.1109/ISCAS58744.2024.10558583
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Remote-sensing Image Change Captioning (RSICC) aims to automatically generate sentences describing the difference of content in remote-sensing bitemporal images. Most of the methods often address shortcomings in model architecture to enhance previous work, overlooking the distinctive characteristics that set remote sensing images apart from natural images, such as recognizing the change of objects with various scales (e.g., small/large-scale objects). By considering the difference, we proposed a Multi-scale Attentive Fusion Network (MAF-Net) to adaptively capture and describe the object change with a wide range of scales. The MAF-Net first extracts multi-scale visual features of bitemporal images from different stages of the CNN backbone, then captures the changes in each pair of the features with the proposed Multi-scale Change Aware Encoders (MCAE). Specifically, the MCAE captures the changeaware discriminative information over the paired multi-scale bitemporal features by Transformer-based different and content cross-attention encoding. Furthermore, a Gated Attentive Fusion (GAF) module is introduced to adaptively aggregate the relevant change-aware features to enhance the change caption performance. We evaluate the effectiveness of our proposed method on two RSICC datasets (e.g., LEVIR-CC and LEVIRCCD), and experimental results demonstrate that our method achieves stateof-the-art performance.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Remote Sensing Image Change Captioning Using Multi-Attentive Network with Diffusion Model
    Yang, Yue
    Liu, Tie
    Pu, Ying
    Liu, Liangchen
    Zhao, Qijun
    Wan, Qun
    REMOTE SENSING, 2024, 16 (21)
  • [2] MIFNet: Multi-Scale Interaction Fusion Network for Remote Sensing Image Change Detection
    Xie, Weiying
    Shao, Wenjie
    Li, Daixun
    Li, Yunsong
    Fang, Leyuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2725 - 2739
  • [3] MSGFNet: Multi-Scale Gated Fusion Network for Remote Sensing Image Change Detection
    Wang, Yukun
    Wang, Mengmeng
    Hao, Zhonghu
    Wang, Qiang
    Wang, Qianwen
    Ye, Yuanxin
    REMOTE SENSING, 2024, 16 (03)
  • [4] Multi-scale feature progressive fusion network for remote sensing image change detection
    Lu, Di
    Cheng, Shuli
    Wang, Liejun
    Song, Shiji
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Multi-scale feature progressive fusion network for remote sensing image change detection
    Di Lu
    Shuli Cheng
    Liejun Wang
    Shiji Song
    Scientific Reports, 12
  • [6] MULTI-SCALE CROPPING MECHANISM FOR REMOTE SENSING IMAGE CAPTIONING
    Zhang, Xueting
    Wang, Qi
    Chen, Shangdong
    Li, Xuelong
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 10039 - 10042
  • [7] Remote Sensing Image Change Detection on Multi-Scale MRF Fusion
    Wang, W. X.
    Yu, Tianchao
    REMOTE SENSING AND MODELING OF ECOSYSTEMS FOR SUSTAINABILITY VIII, 2011, 8156
  • [8] MFSFNet: Multi-Scale Feature Subtraction Fusion Network for Remote Sensing Image Change Detection
    Huang, Zhiqi
    You, Hongjian
    REMOTE SENSING, 2023, 15 (15)
  • [9] Remote sensing image change detection network with multi-scale feature information mining and fusion
    Xue, Songdong
    Zhang, Minming
    Qiao, Gangzhu
    Zhang, Chaofan
    Wang, Bin
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (02)
  • [10] Changes to Captions: An Attentive Network for Remote Sensing Change Captioning
    Chang, Shizhen
    Ghamisi, Pedram
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6047 - 6060