A novel multitask transformer deep learning architecture for joint classification and segmentation of horticulture plantations using very High-Resolution satellite imagery

被引:3
|
作者
Vinod, P. V. [1 ,2 ]
Behera, M. D. [2 ]
Prakash, A. Jaya [2 ]
Hebbar, R. [1 ]
Srivastav, S. K. [3 ]
机构
[1] Reg Remote Sensing Ctr RRSC NRSC, Bangalore, India
[2] Indian Inst Technol Kharagpur, Ctr Ocean River Atmosphere & Land Sci, Kharagpur, West Bengal, India
[3] ISRO, Reg Ctr, NRSC, New Delhi, India
关键词
DeiT; U; -Net; Transformer; Joint loss; Intersection of Union (IoU); Classification and segmentation;
D O I
10.1016/j.compag.2024.109540
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
This study introduces MultiTaskDeiTUNet, a novel multitask deep learning architecture designed to tackle the dual challenges of classifying tree plantation densities and segmenting tree crowns in high-resolution (0.7 m) satellite imagery. The core challenge lies in the overlapping spatial patterns of various tree species and densities, complicating the accurate extraction and classification of individual tree crowns. MultiTaskDeiTUNet integrates the Data-efficient Image Transformer (DeiT) with the U-Net model, harnessing DeiT's strength in contextual detail recognition and spatial dependency capture for density classification, alongside U-Net's proficiency in capturing low-level features for precise crown segmentation. By addressing the complexities of high-resolution data handling and the simultaneous execution of classification and segmentation tasks, MultiTaskDeiTUNet achieves an average F1 score of 0.91 (+ 0.03) and a precise tree crown segmentation with mIoU of 0.73 (+ 0.01). The DeiT backbone adeptly learns shared features such as canopy shapes and spatial arrangements, which are crucial for both tasks and enhance overall model performance. Ablation studies underscore the specialized roles of each component: freezing DeiT's weights results in reduced classification accuracy with an average F1 score of 0.48 (+ 0.08), while freezing U-Net's weights yields a reduced mIoU of 0.29 (+ 0.12) This differentiation highlights DeiT's excellence in classification tasks and U-Net's superiority in segmentation. Substituting the DeiT model with a standard ViT model further highlights the effectiveness of DeiT, as the ViT model demonstrated lower accuracy, with an average F1 score of 0.87 (+0.05) compared to DeiT's F1 score of 0.91 (+0.03). Statistical analysis revealed right-skewed distributions in tree crown areas across density categories. The efficacy of MultiTaskDeiTUNet in tree plantation analysis indicates its potential applicability to a wide range of horticultural plants. Customizing the architecture to species-specific characteristics and varying image resolutions could provide valuable insights for improving management and conservation practices across diverse agricultural and forest ecosystems.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Agave crop segmentation and maturity classification with deep learning data-centric strategies using very high-resolution satellite imagery
    Sánchez A.
    Nanclares R.
    Pelagio U.
    Quevedo A.
    Calvario G.
    Aguilar A.
    Moya Sánchez E.U.
    International Journal of Remote Sensing, 2023, 44 (22) : 7017 - 7032
  • [2] End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning
    Morales, Giorgio
    Ramirez, Alejandro
    Telles, Joel
    PROCEEDINGS OF THE 2019 IEEE XXVI INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2019,
  • [3] Semantic segmentation of high-resolution satellite images using deep learning
    Kuldeep Chaurasia
    Rijul Nandy
    Omkar Pawar
    Ravi Ranjan Singh
    Meghana Ahire
    Earth Science Informatics, 2021, 14 : 2161 - 2170
  • [4] Semantic segmentation of high-resolution satellite images using deep learning
    Chaurasia, Kuldeep
    Nandy, Rijul
    Pawar, Omkar
    Singh, Ravi Ranjan
    Ahire, Meghana
    EARTH SCIENCE INFORMATICS, 2021, 14 (04) : 2161 - 2170
  • [5] Land Cover Classification at the Wildland Urban Interface using High-Resolution Satellite Imagery and Deep Learning
    Nguyen, Mai H.
    Block, Jessica
    Crawl, Daniel
    Siu, Vincent
    Bhatnagar, Akshit
    Rodriguez, Federico
    Kwan, Alison
    Baru, Namrita
    Altintas, Ilkay
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1632 - 1638
  • [6] COMPRESSIVE SENSING BASED RECONSTRUCTION AND PIXEL-LEVEL CLASSIFICATION OF VERY HIGH-RESOLUTION DISASTER SATELLITE IMAGERY USING DEEP LEARNING
    Shinde, Rajat C.
    Potnis, Abhishek, V
    Durbha, Surya S.
    Andugula, Prakash
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2639 - 2642
  • [7] Cloud Detection in High-Resolution Multispectral Satellite Imagery Using Deep Learning
    Morales, Giorgio
    Huaman, Samuel G.
    Telles, Joel
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 280 - 288
  • [8] Extending deep learning approaches for forest disturbance segmentation on very high-resolution satellite images
    Kislov, Dmitry E.
    Korznikov, Kirill A.
    Altman, Jan
    Vozmishcheva, Anna S.
    Krestov, Pavel V.
    REMOTE SENSING IN ECOLOGY AND CONSERVATION, 2021, 7 (03) : 355 - 368
  • [9] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
    Nurkarim, Wahidya
    Wijayanto, Arie Wahyu
    EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 515 - 532
  • [10] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
    Wahidya Nurkarim
    Arie Wahyu Wijayanto
    Earth Science Informatics, 2023, 16 : 515 - 532