A novel multitask transformer deep learning architecture for joint classification and segmentation of horticulture plantations using very High-Resolution satellite imagery

被引：3

作者：

Vinod, P. V. ^{[1
,2
]}

Behera, M. D. ^{[2
]}

Prakash, A. Jaya ^{[2
]}

Hebbar, R. ^{[1
]}

Srivastav, S. K. ^{[3
]}

机构：

[1] Reg Remote Sensing Ctr RRSC NRSC, Bangalore, India

[2] Indian Inst Technol Kharagpur, Ctr Ocean River Atmosphere & Land Sci, Kharagpur, West Bengal, India

[3] ISRO, Reg Ctr, NRSC, New Delhi, India

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2024年 / 227卷

关键词：

DeiT; U; -Net; Transformer; Joint loss; Intersection of Union (IoU); Classification and segmentation;

D O I：

10.1016/j.compag.2024.109540

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

This study introduces MultiTaskDeiTUNet, a novel multitask deep learning architecture designed to tackle the dual challenges of classifying tree plantation densities and segmenting tree crowns in high-resolution (0.7 m) satellite imagery. The core challenge lies in the overlapping spatial patterns of various tree species and densities, complicating the accurate extraction and classification of individual tree crowns. MultiTaskDeiTUNet integrates the Data-efficient Image Transformer (DeiT) with the U-Net model, harnessing DeiT's strength in contextual detail recognition and spatial dependency capture for density classification, alongside U-Net's proficiency in capturing low-level features for precise crown segmentation. By addressing the complexities of high-resolution data handling and the simultaneous execution of classification and segmentation tasks, MultiTaskDeiTUNet achieves an average F1 score of 0.91 (+ 0.03) and a precise tree crown segmentation with mIoU of 0.73 (+ 0.01). The DeiT backbone adeptly learns shared features such as canopy shapes and spatial arrangements, which are crucial for both tasks and enhance overall model performance. Ablation studies underscore the specialized roles of each component: freezing DeiT's weights results in reduced classification accuracy with an average F1 score of 0.48 (+ 0.08), while freezing U-Net's weights yields a reduced mIoU of 0.29 (+ 0.12) This differentiation highlights DeiT's excellence in classification tasks and U-Net's superiority in segmentation. Substituting the DeiT model with a standard ViT model further highlights the effectiveness of DeiT, as the ViT model demonstrated lower accuracy, with an average F1 score of 0.87 (+0.05) compared to DeiT's F1 score of 0.91 (+0.03). Statistical analysis revealed right-skewed distributions in tree crown areas across density categories. The efficacy of MultiTaskDeiTUNet in tree plantation analysis indicates its potential applicability to a wide range of horticultural plants. Customizing the architecture to species-specific characteristics and varying image resolutions could provide valuable insights for improving management and conservation practices across diverse agricultural and forest ecosystems.

引用

页数：16

共 50 条

[1] Agave crop segmentation and maturity classification with deep learning data-centric strategies using very high-resolution satellite imagery
Sánchez A.
Nanclares R.
Pelagio U.
Quevedo A.
Calvario G.
Aguilar A.
Moya Sánchez E.U.
International Journal of Remote Sensing, 2023, 44 (22) : 7017 - 7032
[2] End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning
Morales, Giorgio
Ramirez, Alejandro
Telles, Joel
PROCEEDINGS OF THE 2019 IEEE XXVI INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2019,
[3] Semantic segmentation of high-resolution satellite images using deep learning
Kuldeep Chaurasia
Rijul Nandy
Omkar Pawar
Ravi Ranjan Singh
Meghana Ahire
Earth Science Informatics, 2021, 14 : 2161 - 2170
[4] Semantic segmentation of high-resolution satellite images using deep learning
Chaurasia, Kuldeep
Nandy, Rijul
Pawar, Omkar
Singh, Ravi Ranjan
Ahire, Meghana
EARTH SCIENCE INFORMATICS, 2021, 14 (04) : 2161 - 2170
[5] Land Cover Classification at the Wildland Urban Interface using High-Resolution Satellite Imagery and Deep Learning
Nguyen, Mai H.
Block, Jessica
Crawl, Daniel
Siu, Vincent
Bhatnagar, Akshit
Rodriguez, Federico
Kwan, Alison
Baru, Namrita
Altintas, Ilkay
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1632 - 1638
[6] COMPRESSIVE SENSING BASED RECONSTRUCTION AND PIXEL-LEVEL CLASSIFICATION OF VERY HIGH-RESOLUTION DISASTER SATELLITE IMAGERY USING DEEP LEARNING
Shinde, Rajat C.
Potnis, Abhishek, V
Durbha, Surya S.
Andugula, Prakash
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2639 - 2642
[7] Cloud Detection in High-Resolution Multispectral Satellite Imagery Using Deep Learning
Morales, Giorgio
Huaman, Samuel G.
Telles, Joel
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 280 - 288
[8] Extending deep learning approaches for forest disturbance segmentation on very high-resolution satellite images
Kislov, Dmitry E.
Korznikov, Kirill A.
Altman, Jan
Vozmishcheva, Anna S.
Krestov, Pavel V.
REMOTE SENSING IN ECOLOGY AND CONSERVATION, 2021, 7 (03) : 355 - 368
[9] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
Nurkarim, Wahidya
Wijayanto, Arie Wahyu
EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 515 - 532
[10] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
Wahidya Nurkarim
Arie Wahyu Wijayanto
Earth Science Informatics, 2023, 16 : 515 - 532

← 1 2 3 4 5 →