Cross-Attention-Driven Adaptive Graph Relational Network for Multilabel Remote Sensing Scene Classification

被引:0
|
作者
Bi, Haixia [1 ]
Chang, Honghao [1 ]
Wang, Xiaotian [2 ]
Hong, Danfeng [3 ,4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Peoples R China
[2] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Semantics; Feature extraction; Correlation; Scene classification; Long short term memory; Support vector machines; Adaptation models; Visualization; Solid modeling; Cross-attention; graph convolutional networks (GCNs); label dependency; multilabel classification; remote sensing; DEEP LEARNING APPROACH; IMAGE;
D O I
10.1109/TGRS.2024.3476089
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Multilabel remote sensing scene classification (MLRSSC) has garnered growing attention in recent years, owing to its more comprehensive description of land covers compared to its single-label counterpart. However, challenges arise inevitably. First, the relations among multiple scene labels are sophisticated. How to excavate the interclass dependencies is, therefore, a key challenge for the MLRSSC task. Second, extracting discriminative semantic features is essential, yet challenging for scene prediction of remote sensing images. Another issue is that the multilabel dataset usually shows twofold sample imbalances, that is, class imbalance and positive-negative imbalance, which have not been explored in MLRSSC tasks so far. To overcome the above hurdles, we put forward a cross-attention-driven adaptive graph relational network for the MLRSSC task. Different from the chain-like long short-term memory (LSTM) or static label co-occurrence matrices, we propose to use image-specific relational graphs to dynamically model the interclass dependencies. We innovatively devise a cross-attention-driven representation learning approach, which uses learnable label embeddings to query the class-wise semantic features, explicitly establishing the feature-label connections. Moreover, we design a balanced focal loss (BFL) function, where the loss contributions of positive and negative samples are rebalanced based on the respective imbalance degrees of diverse classes. Extensive experiments were performed on UCM, AID, and DFC15 multilabel datasets. Experimental results demonstrated that our proposed method achieves state-of-the-art performance in the studied task.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Adaptive Cross-Attention-Driven Spatial-Spectral Graph Convolutional Network for Hyperspectral Image Classification
    Yang, Jin-Yu
    Li, Heng-Chao
    Hu, Wen-Shuai
    Pan, Lei
    Du, Qian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [2] Label-Driven Graph Convolutional Network for Multilabel Remote Sensing Image Classification
    Ma, Boyi
    Wu, Falin
    Hu, Tianyang
    Fathollahi, Loghman
    Sui, Xiaohong
    Liu, Yushuang
    Gantumur, Byambakhuu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 2245 - 2255
  • [3] A Multiscale Self-Adaptive Attention Network for Remote Sensing Scene Classification
    Li, Lingling
    Liang, Pujiang
    Ma, Jingjing
    Jiao, Licheng
    Guo, Xiaohui
    Liu, Fang
    Sun, Chen
    REMOTE SENSING, 2020, 12 (14)
  • [4] Multiscale Sparse Cross-Attention Network for Remote Sensing Scene Classification
    Ma, Jingjing
    Jiang, Wei
    Tang, Xu
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [5] ATTENTION BASED NETWORK FOR REMOTE SENSING SCENE CLASSIFICATION
    Liu, Shaoteng
    Wang, Qi
    Li, Xuelong
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4740 - 4743
  • [6] Attention Consistent Network for Remote Sensing Scene Classification
    Tang, Xu
    Ma, Qiushuo
    Zhang, Xiangrong
    Liu, Fang
    Ma, Jingjing
    Jiao, Licheng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 2030 - 2045
  • [7] FGRMNet: Fully graph relational matching network for few-shot remote sensing scene classification
    Regan, Jacob
    Khodayar, Mahdi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [8] A Multiscale Attention Network for Remote Sensing Scene Images Classification
    Zhang, Guokai
    Xu, Weizhe
    Zhao, Wei
    Huang, Chenxi
    Ng, Eddie Yk
    Chen, Yongyong
    Su, Jian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 9530 - 9545
  • [9] Efficient recurrent attention network for remote sensing scene classification
    Liang, Le
    Wang, Guoli
    IET IMAGE PROCESSING, 2021, 15 (08) : 1712 - 1721
  • [10] Context Residual Attention Network for Remote Sensing Scene Classification
    Wang, Yuhua
    Hu, Yaxin
    Xu, Yuezhu
    Jiao, Peiyuan
    Zhang, Xiangrong
    Cui, Huanyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19