A lightweight CNN-Transformer network for pixel-based crop mapping using time-series Sentinel-2 imagery

被引：5

作者：

Wang, Yumiao ^{[1
,2
]}

Feng, Luwei ^{[3
]}

Sun, Weiwei ^{[1
]}

Wang, Lihua ^{[1
]}

Yang, Gang ^{[1
]}

Chen, Binjie ^{[1
]}

机构：

[1] Ningbo Univ, Dept Geog & Spatial Informat Tech, Ningbo 315211, Peoples R China

[2] Ningbo Univ, Inst East China Sea, Ningbo 315211, Zhejiang, Peoples R China

[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2024年 / 226卷

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Crop mapping; Convolutional neural network; Transformer; Pixel-based classification; Temporal Sentinel-2 data;

D O I：

10.1016/j.compag.2024.109370

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Deep learning approaches have provided state-of-the-art performance in crop mapping. Recently, several studies have combined the strengths of two dominant deep learning architectures, Convolutional Neural Networks (CNNs) and Transformers, to classify crops using remote sensing images. Despite their success, many of these models utilize patch-based methods that require extensive data labeling, as each sample contains multiple pixels with corresponding labels. This leads to higher costs in data preparation and processing. Moreover, previous methods rarely considered the impact of missing values caused by clouds and no-observations in remote sensing data. Therefore, this study proposes a lightweight multi-stage CNN-Transformer network (MCTNet) for pixel- based crop mapping using time-series Sentinel-2 imagery. MCTNet consists of several successive modules, each containing a CNN sub-module and a Transformer sub-module to extract important features from the images, respectively. An attention-based learnable positional encoding (ALPE) module is designed in the Transformer sub-module to capture the complex temporal relations in the time-series data with different missing rates. Arkansas and California in the U.S. are selected to evaluate the model. Experimental results show that the MCTNet has a lightweight advantage with the fewest parameters and memory usage while achieving the superior performance compared to eight advanced models. Specifically, MCTNet obtained an overall accuracy (OA) of 0.968, a kappa coefficient (Kappa) of 0.951, and a macro-averaged F1 score (F1) of 0.933 in Arkansas, and an OA of 0.852, a Kappa of 0.806, and an F1 score of 0.829 in California. The results highlight the importance of each component of the model, particularly the ALPE module, which enhanced the Kappa of MCTNet by 4.2% in Arkansas and improved the model's robustness to missing values in remote sensing data. Additionally, visualization results demonstrated that the features extracted from CNN and Transformer sub-modules are complementary, explaining the effectiveness of the MCTNet.

引用

页数：17

共 50 条

[21] Synergy of Sentinel-1 and Sentinel-2 Imagery for Crop Classification Based on DC-CNN
Zhang, Kaixin
Yuan, Da
Yang, Huijin
Zhao, Jianhui
Li, Ning
REMOTE SENSING, 2023, 15 (11)
[22] A SEMI-SUPERVISED APPROACH TOWARDS LAND COVER MAPPING WITH SENTINEL-2 DESNSE TIME-SERIES IMAGERY
Hu, Ting
Huang, Xin
Li, Jiayi
Benediktsson, Jon Atli
Yang, Jiansi
Gong, Jianya
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2423 - 2426
[23] Contrastive-Learning-Based Time-Series Feature Representation for Parcel-Based Crop Mapping Using Incomplete Sentinel-2 Image Sequences
Zhou, Ya'nan
Wang, Yan
Yan, Na'na
Feng, Li
Chen, Yuehong
Wu, Tianjun
Gao, Jianwei
Zhang, Xiwang
Zhu, Weiwei
REMOTE SENSING, 2023, 15 (20)
[24] Scalable pixel-based crop classification combining Sentinel-2 and Landsat-8 data time series: Case study of the Duero river basin
Piedelobo, Laura
Hernandez-Lopez, David
Ballesteros, Rock
Chakhar, Amal
Del Pozo, Susana
Gonzalez-Aguilera, Diego
Moreno, Miguel A.
AGRICULTURAL SYSTEMS, 2019, 171 : 36 - 50
[25] DETAILED MAPPING OF RESIDENTIAL LAND USE IN QUEZON CITY USING SENTINEL-2 IMAGERY: AN ANALYSIS OF PIXEL-BASED IMAGE CLASSIFICATION USING SUPPORT VECTOR MACHINE
Mabalot, M. I. D.
Sumera, L. S.
Blanco, A. C.
Carcellar, B. G.
GEOINFORMATION WEEK 2022, VOL. 48-4, 2023, : 201 - 209
[26] A new phenology-based method for mapping wheat and barley using time-series of Sentinel-2 images
Ashourloo, Davoud
Nematollahi, Hamed
Huete, Alfredo
Aghighi, Hossein
Azadbakht, Mohsen
Shahrabi, Hamid Salehi
Goodarzdashti, Salman
REMOTE SENSING OF ENVIRONMENT, 2022, 280
[27] Evaluation of Sentinel-2 time-series for mapping floodplain grassland plant communities
Rapinel, Sebastien
Mony, Cendrine
Lecoq, Lucie
Clement, Bernard
Thomas, Alban
Hubert-Moy, Laurence
REMOTE SENSING OF ENVIRONMENT, 2019, 223 : 115 - 129
[28] Pre-harvest classification of crop types using a Sentinel-2 time-series and machine learning
Maponya, Mmamokoma Grace
van Niekerk, Adriaan
Mashimbye, Zama Eric
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 169 (169)
[29] Short-time-series grassland mapping using Sentinel-2 imagery and deep learning-based architecture
Abdollahi, Abolfazl
Liu, Yuxia
Pradhan, Biswajeet
Huete, Alfredo
Dikshit, Abhirup
Ngoc Nguyen Tran
EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2022, 25 (03): : 673 - 685
[30] Paddy Rice Mapping Using a Dual-Path Spatio-Temporal Network Based on Annual Time-Series Sentinel-2 Images
Wang, Hui
Zhao, Bo
Tang, Panpan
Wang, Yuxiang
Wan, Haoming
Bai, Shi
Wei, Ronghao
IEEE ACCESS, 2022, 10 : 132584 - 132595

← 1 2 3 4 5 →