Cross-Attention-Driven Adaptive Graph Relational Network for Multilabel Remote Sensing Scene Classification

被引:0
|
作者
Bi, Haixia [1 ]
Chang, Honghao [1 ]
Wang, Xiaotian [2 ]
Hong, Danfeng [3 ,4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Peoples R China
[2] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Semantics; Feature extraction; Correlation; Scene classification; Long short term memory; Support vector machines; Adaptation models; Visualization; Solid modeling; Cross-attention; graph convolutional networks (GCNs); label dependency; multilabel classification; remote sensing; DEEP LEARNING APPROACH; IMAGE;
D O I
10.1109/TGRS.2024.3476089
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Multilabel remote sensing scene classification (MLRSSC) has garnered growing attention in recent years, owing to its more comprehensive description of land covers compared to its single-label counterpart. However, challenges arise inevitably. First, the relations among multiple scene labels are sophisticated. How to excavate the interclass dependencies is, therefore, a key challenge for the MLRSSC task. Second, extracting discriminative semantic features is essential, yet challenging for scene prediction of remote sensing images. Another issue is that the multilabel dataset usually shows twofold sample imbalances, that is, class imbalance and positive-negative imbalance, which have not been explored in MLRSSC tasks so far. To overcome the above hurdles, we put forward a cross-attention-driven adaptive graph relational network for the MLRSSC task. Different from the chain-like long short-term memory (LSTM) or static label co-occurrence matrices, we propose to use image-specific relational graphs to dynamically model the interclass dependencies. We innovatively devise a cross-attention-driven representation learning approach, which uses learnable label embeddings to query the class-wise semantic features, explicitly establishing the feature-label connections. Moreover, we design a balanced focal loss (BFL) function, where the loss contributions of positive and negative samples are rebalanced based on the respective imbalance degrees of diverse classes. Extensive experiments were performed on UCM, AID, and DFC15 multilabel datasets. Experimental results demonstrated that our proposed method achieves state-of-the-art performance in the studied task.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Attention-Driven Graph Convolution Network for Remote Sensing Image Retrieval
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Bhattacharya, Avik
    Datcu, Mihai
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [32] Remote sensing scene classification based on high-order graph convolutional network
    Gao, Yue
    Shi, Jun
    Li, Jun
    Wang, Ruoyu
    EUROPEAN JOURNAL OF REMOTE SENSING, 2021, 54 (54) : 141 - 155
  • [33] Multisource Compensation Network for Remote Sensing Cross-Domain Scene Classification
    Lu, Xiaoqiang
    Gong, Tengfei
    Zheng, Xiangtao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (04): : 2504 - 2515
  • [34] HIGH-ORDER SELF-ATTENTION NETWORK FOR REMOTE SENSING SCENE CLASSIFICATION
    He, Nanjun
    Fang, Leyuan
    Li, Yi
    Plaza, Antonio
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3013 - 3016
  • [35] Remote Sensing Scene Classification via Multi-Branch Local Attention Network
    Chen, Si-Bao
    Wei, Qing-Song
    Wang, Wen-Zhong
    Tang, Jin
    Luo, Bin
    Wang, Zu-Yuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 99 - 109
  • [36] Wavelet Attention ResNeXt Network for High-resolution Remote Sensing Scene Classification
    Song, Wanying
    Cong, Yifan
    Zhang, Yingying
    Zhang, Shiru
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 330 - 333
  • [37] Gradient-Guided Multiscale Focal Attention Network for Remote Sensing Scene Classification
    Zhao, Yue
    Gong, Maoguo
    Qin, A. K.
    Zhang, Mingyang
    Hu, Zhuping
    Gao, Tianqi
    Pu, Yan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [38] Channel-Attention-Based DenseNet Network for Remote Sensing Image Scene Classification
    Tong, Wei
    Chen, Weitao
    Han, Wei
    Li, Xianju
    Wang, Lizhe
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 4121 - 4132
  • [39] Self-Attention Network With Joint Loss for Remote Sensing Image Scene Classification
    Wu, Honglin
    Zhao, Shuzhen
    Li, Liang
    Lu, Chaoquan
    Chen, Wen
    IEEE ACCESS, 2020, 8 : 210347 - 210359
  • [40] CAW: A Remote-Sensing Scene Classification Network Aided by Local Window Attention
    Wang, Wei
    Wen, Xiaowei
    Wang, Xin
    Tang, Chen
    Deng, Jiwei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022