A lightweight CNN-Transformer network for pixel-based crop mapping using time-series Sentinel-2 imagery

被引:5
|
作者
Wang, Yumiao [1 ,2 ]
Feng, Luwei [3 ]
Sun, Weiwei [1 ]
Wang, Lihua [1 ]
Yang, Gang [1 ]
Chen, Binjie [1 ]
机构
[1] Ningbo Univ, Dept Geog & Spatial Informat Tech, Ningbo 315211, Peoples R China
[2] Ningbo Univ, Inst East China Sea, Ningbo 315211, Zhejiang, Peoples R China
[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Crop mapping; Convolutional neural network; Transformer; Pixel-based classification; Temporal Sentinel-2 data;
D O I
10.1016/j.compag.2024.109370
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Deep learning approaches have provided state-of-the-art performance in crop mapping. Recently, several studies have combined the strengths of two dominant deep learning architectures, Convolutional Neural Networks (CNNs) and Transformers, to classify crops using remote sensing images. Despite their success, many of these models utilize patch-based methods that require extensive data labeling, as each sample contains multiple pixels with corresponding labels. This leads to higher costs in data preparation and processing. Moreover, previous methods rarely considered the impact of missing values caused by clouds and no-observations in remote sensing data. Therefore, this study proposes a lightweight multi-stage CNN-Transformer network (MCTNet) for pixel- based crop mapping using time-series Sentinel-2 imagery. MCTNet consists of several successive modules, each containing a CNN sub-module and a Transformer sub-module to extract important features from the images, respectively. An attention-based learnable positional encoding (ALPE) module is designed in the Transformer sub-module to capture the complex temporal relations in the time-series data with different missing rates. Arkansas and California in the U.S. are selected to evaluate the model. Experimental results show that the MCTNet has a lightweight advantage with the fewest parameters and memory usage while achieving the superior performance compared to eight advanced models. Specifically, MCTNet obtained an overall accuracy (OA) of 0.968, a kappa coefficient (Kappa) of 0.951, and a macro-averaged F1 score (F1) of 0.933 in Arkansas, and an OA of 0.852, a Kappa of 0.806, and an F1 score of 0.829 in California. The results highlight the importance of each component of the model, particularly the ALPE module, which enhanced the Kappa of MCTNet by 4.2% in Arkansas and improved the model's robustness to missing values in remote sensing data. Additionally, visualization results demonstrated that the features extracted from CNN and Transformer sub-modules are complementary, explaining the effectiveness of the MCTNet.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Synergy of Sentinel-1 and Sentinel-2 Imagery for Crop Classification Based on DC-CNN
    Zhang, Kaixin
    Yuan, Da
    Yang, Huijin
    Zhao, Jianhui
    Li, Ning
    REMOTE SENSING, 2023, 15 (11)
  • [22] A SEMI-SUPERVISED APPROACH TOWARDS LAND COVER MAPPING WITH SENTINEL-2 DESNSE TIME-SERIES IMAGERY
    Hu, Ting
    Huang, Xin
    Li, Jiayi
    Benediktsson, Jon Atli
    Yang, Jiansi
    Gong, Jianya
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2423 - 2426
  • [23] Contrastive-Learning-Based Time-Series Feature Representation for Parcel-Based Crop Mapping Using Incomplete Sentinel-2 Image Sequences
    Zhou, Ya'nan
    Wang, Yan
    Yan, Na'na
    Feng, Li
    Chen, Yuehong
    Wu, Tianjun
    Gao, Jianwei
    Zhang, Xiwang
    Zhu, Weiwei
    REMOTE SENSING, 2023, 15 (20)
  • [24] Scalable pixel-based crop classification combining Sentinel-2 and Landsat-8 data time series: Case study of the Duero river basin
    Piedelobo, Laura
    Hernandez-Lopez, David
    Ballesteros, Rock
    Chakhar, Amal
    Del Pozo, Susana
    Gonzalez-Aguilera, Diego
    Moreno, Miguel A.
    AGRICULTURAL SYSTEMS, 2019, 171 : 36 - 50
  • [25] DETAILED MAPPING OF RESIDENTIAL LAND USE IN QUEZON CITY USING SENTINEL-2 IMAGERY: AN ANALYSIS OF PIXEL-BASED IMAGE CLASSIFICATION USING SUPPORT VECTOR MACHINE
    Mabalot, M. I. D.
    Sumera, L. S.
    Blanco, A. C.
    Carcellar, B. G.
    GEOINFORMATION WEEK 2022, VOL. 48-4, 2023, : 201 - 209
  • [26] A new phenology-based method for mapping wheat and barley using time-series of Sentinel-2 images
    Ashourloo, Davoud
    Nematollahi, Hamed
    Huete, Alfredo
    Aghighi, Hossein
    Azadbakht, Mohsen
    Shahrabi, Hamid Salehi
    Goodarzdashti, Salman
    REMOTE SENSING OF ENVIRONMENT, 2022, 280
  • [27] Evaluation of Sentinel-2 time-series for mapping floodplain grassland plant communities
    Rapinel, Sebastien
    Mony, Cendrine
    Lecoq, Lucie
    Clement, Bernard
    Thomas, Alban
    Hubert-Moy, Laurence
    REMOTE SENSING OF ENVIRONMENT, 2019, 223 : 115 - 129
  • [28] Pre-harvest classification of crop types using a Sentinel-2 time-series and machine learning
    Maponya, Mmamokoma Grace
    van Niekerk, Adriaan
    Mashimbye, Zama Eric
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 169 (169)
  • [29] Short-time-series grassland mapping using Sentinel-2 imagery and deep learning-based architecture
    Abdollahi, Abolfazl
    Liu, Yuxia
    Pradhan, Biswajeet
    Huete, Alfredo
    Dikshit, Abhirup
    Ngoc Nguyen Tran
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2022, 25 (03): : 673 - 685
  • [30] Paddy Rice Mapping Using a Dual-Path Spatio-Temporal Network Based on Annual Time-Series Sentinel-2 Images
    Wang, Hui
    Zhao, Bo
    Tang, Panpan
    Wang, Yuxiang
    Wan, Haoming
    Bai, Shi
    Wei, Ronghao
    IEEE ACCESS, 2022, 10 : 132584 - 132595