SwinTFNet: Dual-Stream Transformer With Cross Attention Fusion for Land Cover Classification

被引:5
|
作者
Ren, Bo [1 ]
Liu, Bo [1 ]
Hou, Biao [1 ]
Wang, Zhao [1 ]
Yang, Chen [1 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ China, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Optical imaging; Feature extraction; Optical sensors; Transformers; Adaptive optics; Optical fiber networks; Fuses; Data fusion; land cover classification (LCC); multimodality; synthetic aperture radar (SAR)-optical;
D O I
10.1109/LGRS.2024.3358899
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Land cover classification (LCC) is an important application in remote sensing data interpretation. As two common data sources, synthetic aperture radar (SAR) images can be regarded as an effective complement to optical images, which will reduce the influence caused by single-modal data. However, common LCC methods focus on designing advanced network architectures to process single-modal remote sensing data. Few works have been oriented toward improving segmentation performance through fusing multimodal data. In order to deeply integrate SAR and optical features, we propose SwinTFNet, a dual-stream deep fusion network. Through the global context modeling capability of Transformer structure, SwinTFNet models teleconnections between pixels in other regions and pixels in cloud regions for better prediction in cloud regions. In addition, a cross-attention fusion module (CAFM) is proposed to fuse features from optical and SAR data. Experimental results show that our method improves greatly in the classification of clouded images compared with other excellent segmentation methods and achieves the best performance on multimodal data. The source code of SwinTFNet is publicly available at https://github.com/XD-MG/SwinTFNet.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 50 条
  • [21] Dual-Stream Architecture Enhanced by Soft-Attention Mechanism for Plant Species Classification
    Khan, Imran Ullah
    Khan, Haseeb Ali
    Lee, Jong Weon
    PLANTS-BASEL, 2024, 13 (18):
  • [22] An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network
    Tellai M.
    Gao L.
    Mao Q.
    International Journal of Speech Technology, 2023, 26 (02) : 541 - 557
  • [23] Polarized reflection removal with dual-stream attention guidance
    Wang, Xin
    Zhang, Yong
    Chen, Yanchu
    PATTERN RECOGNITION, 2025, 157
  • [24] Dual-Stream Attention Network for Hyperspectral Image Unmixing
    Wang, Yufang
    Wu, Wenmin
    Qi, Lin
    Gao, Feng
    International Geoscience and Remote Sensing Symposium (IGARSS), 2024, : 9438 - 9441
  • [25] Dual-Stream Attention Network for Hyperspectral Image Unmixing
    School of Computer Science and Technology, Ocean University of China, Qingdao
    266100, China
    不详
    266100, China
    arXiv,
  • [26] Efficient Vehicle Trajectory Prediction With Goal Lane Segments and Dual-Stream Cross Attention
    Li, Linhui
    Wang, Xuecheng
    Lian, Jing
    Zhao, Jian
    Hu, Jun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 21221 - 21234
  • [27] Dual-stream pyramid and attention network for stereo matching
    Shi, Liqing
    Xiong, Taiping
    Cui, Gengshen
    Pan, Minghua
    Zhu, Zhiguo
    Cheng, Wei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 4909 - 4922
  • [28] A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention
    Han, Jianmin
    Li, Jie
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2024, 32 (04) : 673 - 694
  • [29] Feature Fusion for Dual-Stream Cooperative Action Recognition
    Chen, Dong
    Wu, Mengtao
    Zhang, Tao
    Li, Chuanqi
    IEEE ACCESS, 2023, 11 : 116732 - 116740
  • [30] Compressed Video Action Recognition With Dual-Stream and Dual-Modal Transformer
    Mou, Yuting
    Jiang, Xinghao
    Xu, Ke
    Sun, Tanfeng
    Wang, Zepeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3299 - 3312