A novel multitask transformer deep learning architecture for joint classification and segmentation of horticulture plantations using very High-Resolution satellite imagery

被引:3
|
作者
Vinod, P. V. [1 ,2 ]
Behera, M. D. [2 ]
Prakash, A. Jaya [2 ]
Hebbar, R. [1 ]
Srivastav, S. K. [3 ]
机构
[1] Reg Remote Sensing Ctr RRSC NRSC, Bangalore, India
[2] Indian Inst Technol Kharagpur, Ctr Ocean River Atmosphere & Land Sci, Kharagpur, West Bengal, India
[3] ISRO, Reg Ctr, NRSC, New Delhi, India
关键词
DeiT; U; -Net; Transformer; Joint loss; Intersection of Union (IoU); Classification and segmentation;
D O I
10.1016/j.compag.2024.109540
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
This study introduces MultiTaskDeiTUNet, a novel multitask deep learning architecture designed to tackle the dual challenges of classifying tree plantation densities and segmenting tree crowns in high-resolution (0.7 m) satellite imagery. The core challenge lies in the overlapping spatial patterns of various tree species and densities, complicating the accurate extraction and classification of individual tree crowns. MultiTaskDeiTUNet integrates the Data-efficient Image Transformer (DeiT) with the U-Net model, harnessing DeiT's strength in contextual detail recognition and spatial dependency capture for density classification, alongside U-Net's proficiency in capturing low-level features for precise crown segmentation. By addressing the complexities of high-resolution data handling and the simultaneous execution of classification and segmentation tasks, MultiTaskDeiTUNet achieves an average F1 score of 0.91 (+ 0.03) and a precise tree crown segmentation with mIoU of 0.73 (+ 0.01). The DeiT backbone adeptly learns shared features such as canopy shapes and spatial arrangements, which are crucial for both tasks and enhance overall model performance. Ablation studies underscore the specialized roles of each component: freezing DeiT's weights results in reduced classification accuracy with an average F1 score of 0.48 (+ 0.08), while freezing U-Net's weights yields a reduced mIoU of 0.29 (+ 0.12) This differentiation highlights DeiT's excellence in classification tasks and U-Net's superiority in segmentation. Substituting the DeiT model with a standard ViT model further highlights the effectiveness of DeiT, as the ViT model demonstrated lower accuracy, with an average F1 score of 0.87 (+0.05) compared to DeiT's F1 score of 0.91 (+0.03). Statistical analysis revealed right-skewed distributions in tree crown areas across density categories. The efficacy of MultiTaskDeiTUNet in tree plantation analysis indicates its potential applicability to a wide range of horticultural plants. Customizing the architecture to species-specific characteristics and varying image resolutions could provide valuable insights for improving management and conservation practices across diverse agricultural and forest ecosystems.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Segmentation of multispectral high-resolution satellite imagery using log Gabor filters
    Xiao, Pengfeng
    Feng, Xuezhi
    An, Ru
    Zhao, Shuhe
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2010, 31 (06) : 1427 - 1439
  • [22] Building footprint extraction from very high-resolution satellite images using deep learning
    Ps, Prakash
    Aithal, Bharath H.
    JOURNAL OF SPATIAL SCIENCE, 2023, 68 (03) : 487 - 503
  • [23] Deep Learning Based Classification System for Identifying Weeds Using High-Resolution UAV Imagery
    Bah, M. Dian
    Dericquebourg, Eric
    Hafiane, Adel
    Canals, Raphael
    INTELLIGENT COMPUTING, VOL 2, 2019, 857 : 176 - 187
  • [24] Oil Palm Tree Detection and Health Classification on High-Resolution Imagery Using Deep Learning
    Yarak, Kanitta
    Witayangkurn, Apichon
    Kritiyutanont, Kunnaree
    Arunplod, Chomchanok
    Shibasaki, Ryosuke
    AGRICULTURE-BASEL, 2021, 11 (02): : 1 - 17
  • [25] Advantage of Combining OBIA and Classifier Ensemble Method for Very High-Resolution Satellite Imagery Classification
    Han, Ruimei
    Liu, Pei
    Wang, Guangyan
    Zhang, Hanwei
    Wu, Xilong
    JOURNAL OF SENSORS, 2020, 2020
  • [26] Automatic building footprint extraction from very high-resolution imagery using deep learning techniques
    Rastogi, Kriti
    Bodani, Pankaj
    Sharma, Shashikant A.
    GEOCARTO INTERNATIONAL, 2022, 37 (05) : 1501 - 1513
  • [27] Classification of Multispectral High-Resolution Satellite Imagery Using LIDAR Elevation Data
    Alonso, Maria C.
    Malpica, Jose A.
    ADVANCES IN VISUAL COMPUTING, PT II, PROCEEDINGS, 2008, 5359 : 85 - 94
  • [28] Using High-Resolution Satellite Imagery and Deep Learning to Track Dynamic Seasonality in Small Water Bodies
    Mullen, Andrew L.
    Watts, Jennifer D.
    Rogers, Brendan M.
    Carroll, Mark L.
    Elder, Clayton D.
    Noomah, Jonas
    Williams, Zachary
    Caraballo-Vega, Jordan A.
    Bredder, Allison
    Rickenbaugh, Eliza
    Levenson, Eric
    Cooley, Sarah W.
    Hung, Jacqueline K. Y.
    Fiske, Greg
    Potter, Stefano
    Yang, Yili
    Miller, Charles E.
    Natali, Susan M.
    Douglas, Thomas A.
    Kyzivat, Ethan D.
    GEOPHYSICAL RESEARCH LETTERS, 2023, 50 (07)
  • [29] High-Resolution Income Estimates Using Satellite Imagery: A Deep Learning Approach Applied in Buenos Aires
    Abbate, Nicolas F.
    Gasparini, Leonardo
    Ronchetti, Franco
    Quiroga, Facundo
    2024 L LATIN AMERICAN COMPUTER CONFERENCE, CLEI 2024, 2024,
  • [30] A Hybrid Privacy-Preserving Deep Learning Approach for Object Classification in Very High-Resolution Satellite Images
    Boulila, Wadii
    Khlifi, Manel Khazri
    Ammar, Adel
    Koubaa, Anis
    Benjdira, Bilel
    Farah, Imed Riadh
    REMOTE SENSING, 2022, 14 (18)