Cross-Attention-Driven Adaptive Graph Relational Network for Multilabel Remote Sensing Scene Classification

被引:0
|
作者
Bi, Haixia [1 ]
Chang, Honghao [1 ]
Wang, Xiaotian [2 ]
Hong, Danfeng [3 ,4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Peoples R China
[2] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Semantics; Feature extraction; Correlation; Scene classification; Long short term memory; Support vector machines; Adaptation models; Visualization; Solid modeling; Cross-attention; graph convolutional networks (GCNs); label dependency; multilabel classification; remote sensing; DEEP LEARNING APPROACH; IMAGE;
D O I
10.1109/TGRS.2024.3476089
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Multilabel remote sensing scene classification (MLRSSC) has garnered growing attention in recent years, owing to its more comprehensive description of land covers compared to its single-label counterpart. However, challenges arise inevitably. First, the relations among multiple scene labels are sophisticated. How to excavate the interclass dependencies is, therefore, a key challenge for the MLRSSC task. Second, extracting discriminative semantic features is essential, yet challenging for scene prediction of remote sensing images. Another issue is that the multilabel dataset usually shows twofold sample imbalances, that is, class imbalance and positive-negative imbalance, which have not been explored in MLRSSC tasks so far. To overcome the above hurdles, we put forward a cross-attention-driven adaptive graph relational network for the MLRSSC task. Different from the chain-like long short-term memory (LSTM) or static label co-occurrence matrices, we propose to use image-specific relational graphs to dynamically model the interclass dependencies. We innovatively devise a cross-attention-driven representation learning approach, which uses learnable label embeddings to query the class-wise semantic features, explicitly establishing the feature-label connections. Moreover, we design a balanced focal loss (BFL) function, where the loss contributions of positive and negative samples are rebalanced based on the respective imbalance degrees of diverse classes. Extensive experiments were performed on UCM, AID, and DFC15 multilabel datasets. Experimental results demonstrated that our proposed method achieves state-of-the-art performance in the studied task.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Scene Classification of Remote Sensing Images Based on Saliency Dual Attention Residual Network
    Guo, Dongen
    Xia, Ying
    Luo, Xiaobo
    IEEE ACCESS, 2020, 8 : 6344 - 6357
  • [42] Adaptive scene-aware deep attention network for remote sensing image compression
    Zhai, Guowei
    Liu, Gang
    He, Xiaohai
    Wang, Zhengyong
    Ren, Chao
    Chen, Zhengxin
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (05)
  • [43] Semantic interleaving global channel attention for multilabel remote sensing image classification
    Liu, Yongkun
    Ni, Kesong
    Zhang, Yuhan
    Zhou, Lijian
    Zhao, Kun
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (02) : 393 - 419
  • [44] Graph Relation Network: Modeling Relations Between Scenes for Multilabel Remote-Sensing Image Classification and Retrieval
    Kang, Jian
    Fernandez-Beltran, Ruben
    Hong, Danfeng
    Chanussot, Jocelyn
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05): : 4355 - 4369
  • [45] Relation-Attention Networks for Remote Sensing Scene Classification
    Wang, Xin
    Duan, Lin
    Ning, Chen
    Zhou, Huiyu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 422 - 439
  • [46] Scene Classification With Recurrent Attention of VHR Remote Sensing Images
    Wang, Qi
    Liu, Shaoteng
    Chanussot, Jocelyn
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02): : 1155 - 1167
  • [47] Remote Sensing Image Classification Based on a Cross-Attention Mechanism and Graph Convolution
    Cai, Weiwei
    Wei, Zhanguo
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [48] DA2Net: Distraction-Attention-Driven Adversarial Network for Robust Remote Sensing Image Scene Classification
    Yang, Rui
    Pu, Fangling
    Xu, Zhaozhuo
    Ding, Chujiang
    Xu, Xin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [49] Remote Sensing Scene Classification by Gated Bidirectional Network
    Sun, Hao
    Li, Siyuan
    Zheng, Xiangtao
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (01): : 82 - 96
  • [50] Progressive Feature Fusion Framework Based on Graph Convolutional Network for Remote Sensing Scene Classification
    Zhang, Chongyang
    Wang, Bin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3270 - 3284