CSTFNet: A CNN and Dual Swin-Transformer Fusion Network for Remote Sensing Hyperspectral Data Fusion and Classification of Coastal Areas

被引:0
|
作者
Li, Dekai [1 ]
Neira-Molina, Harold [2 ]
Huang, Mengxing [1 ]
Syam, M. S. [3 ,4 ,5 ]
Yu, Zhang [1 ,6 ]
Zhang, Junfeng [1 ]
Bhatti, Uzair Aslam [1 ]
Asif, Muhammad [7 ]
Sarhan, Nadia [8 ]
Awwad, Emad Mahrous [9 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
[2] Univ Costa, Dept Comp Sci & Elect, Barranquilla 080002, Colombia
[3] Jingchu Univ Technol, Sch Artificial Intelligence, Jingmen 448000, Peoples R China
[4] Jingchu Univ Technol, Jingmen Cryptometry Applicat Technol Res Ctr, Jingmen 448000, Peoples R China
[5] Jingchu Univ Technol, Internet Intelligences Applicat Innovat Res Ctr, Jingmen 448000, Peoples R China
[6] Hainan Univ, Sch Comp Sci & Technol, Haikou 570228, Peoples R China
[7] Hunan Univ Sci & Engn, Sch Media, Yongzhou 425199, Peoples R China
[8] King Saud Univ, Coll Business Adm, Dept Quantitat Anal, Riyadh 11421, Saudi Arabia
[9] King Saud Univ, Coll Engn, Dept Elect Engn, Riyadh 11421, Saudi Arabia
基金
美国国家科学基金会;
关键词
Feature extraction; Computational modeling; Transformers; Convolutional neural networks; Accuracy; Hyperspectral imaging; Vectors; Three-dimensional displays; Sea measurements; Faces; Convolutional neural network (CNN); hyperspectral image; remote sensing; Swin transformer; IMAGE CLASSIFICATION;
D O I
10.1109/JSTARS.2025.3530935
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hyperspectral imaging (HSI) can capture a large amount of spectral information at various wavelengths, enabling detailed material classification and identification, making it a key tool in remote sensing, particularly for coastal area monitoring. In recent years, the convolutional neural network (CNN) framework and transformer models have demonstrated strong performance in HSI classification, especially in applications requiring precise change detection and analysis. However, due to the high dimensionality of HSI data and the complexity of spectral-spatial feature extraction, achieving accurate results in coastal areas remains challenging. This article introduces a new hybrid model, CSTFNet, which combines an improved CNN module and dual-layer Swin transformer (DLST) to tackle these challenges. CSTFNet integrates spectral and spatial processing capabilities, significantly reducing computational complexity while maintaining high classification accuracy. The improved CNN module employs one-dimensional convolutions to handle high-dimensional data, while the DLST module uses window-based multihead attention to capture both local and global dependencies. Experiments conducted on four standard HSI datasets (Houston-2013, Samson, KSC, and Botswana) demonstrate that CSTFNet outperforms traditional and state-of-the-art algorithms, achieving overall classification accuracy exceeding 99% . In particular, on the Houston-2013 dataset, the results for OA and AA are 1.00 and the kappa coefficient is 0. 976. The results highlight the robustness and efficiency of the proposed model in coastal area applications, where accurate and reliable spectral-spatial classification is crucial for monitoring and environmental management.
引用
收藏
页码:5853 / 5865
页数:13
相关论文
共 50 条
  • [1] Interactive transformer and CNN network for fusion classification of hyperspectral and LiDAR data
    Wang, Leiquan
    Liu, Wenwen
    Lyu, Dong
    Zhang, Peiying
    Guo, Fangming
    Hu, Yabin
    Xu, Mingming
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024,
  • [2] Fusion of hyperspectral and LIDAR remote sensing data for classification of complex forest areas
    Dalponte, Michele
    Bruzzone, Lorenzo
    Gianelle, Damiano
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (05): : 1416 - 1427
  • [3] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [4] Dual selective fusion transformer network for hyperspectral image classification
    Xu, Yichu
    Wang, Di
    Zhang, Lefei
    Zhang, Liangpei
    NEURAL NETWORKS, 2025, 187
  • [5] Inductive Biased Swin-Transformer With Cyclic Regressor for Remote Sensing Scene Classification
    Hao, Siyuan
    Li, Nan
    Ye, Yuanxin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 6265 - 6278
  • [6] STMSF: Swin Transformer with Multi-Scale Fusion for Remote Sensing Scene Classification
    Duan, Yingtao
    Song, Chao
    Zhang, Yifan
    Cheng, Puyu
    Mei, Shaohui
    REMOTE SENSING, 2025, 17 (04)
  • [7] SwinSTFM: Remote Sensing Spatiotemporal Fusion Using Swin Transformer
    Chen, Guanyu
    Jiao, Peng
    Hu, Qing
    Xiao, Linjie
    Ye, Zijian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] An end-to-end medical image fusion network based on Swin-transformer
    Yu, Kaixin
    Yang, Xiaoming
    Jeon, Seunggil
    Dou, Qingyu
    MICROPROCESSORS AND MICROSYSTEMS, 2023, 98
  • [9] MULTISOURCE REMOTE SENSING DATA CLASSIFICATION BASED ON A DUAL ATTENTION FUSION NETWORK
    Wang, Junjie
    Li, Wei
    Zhang, Mengmeng
    Gao, Yunhao
    2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [10] Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection
    Gao, Yuhao
    Pei, Gensheng
    Sheng, Mengmeng
    Sun, Zeren
    Chen, Tao
    Yao, Yazhou
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,