A novel multitask transformer deep learning architecture for joint classification and segmentation of horticulture plantations using very High-Resolution satellite imagery

被引:3
|
作者
Vinod, P. V. [1 ,2 ]
Behera, M. D. [2 ]
Prakash, A. Jaya [2 ]
Hebbar, R. [1 ]
Srivastav, S. K. [3 ]
机构
[1] Reg Remote Sensing Ctr RRSC NRSC, Bangalore, India
[2] Indian Inst Technol Kharagpur, Ctr Ocean River Atmosphere & Land Sci, Kharagpur, West Bengal, India
[3] ISRO, Reg Ctr, NRSC, New Delhi, India
关键词
DeiT; U; -Net; Transformer; Joint loss; Intersection of Union (IoU); Classification and segmentation;
D O I
10.1016/j.compag.2024.109540
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
This study introduces MultiTaskDeiTUNet, a novel multitask deep learning architecture designed to tackle the dual challenges of classifying tree plantation densities and segmenting tree crowns in high-resolution (0.7 m) satellite imagery. The core challenge lies in the overlapping spatial patterns of various tree species and densities, complicating the accurate extraction and classification of individual tree crowns. MultiTaskDeiTUNet integrates the Data-efficient Image Transformer (DeiT) with the U-Net model, harnessing DeiT's strength in contextual detail recognition and spatial dependency capture for density classification, alongside U-Net's proficiency in capturing low-level features for precise crown segmentation. By addressing the complexities of high-resolution data handling and the simultaneous execution of classification and segmentation tasks, MultiTaskDeiTUNet achieves an average F1 score of 0.91 (+ 0.03) and a precise tree crown segmentation with mIoU of 0.73 (+ 0.01). The DeiT backbone adeptly learns shared features such as canopy shapes and spatial arrangements, which are crucial for both tasks and enhance overall model performance. Ablation studies underscore the specialized roles of each component: freezing DeiT's weights results in reduced classification accuracy with an average F1 score of 0.48 (+ 0.08), while freezing U-Net's weights yields a reduced mIoU of 0.29 (+ 0.12) This differentiation highlights DeiT's excellence in classification tasks and U-Net's superiority in segmentation. Substituting the DeiT model with a standard ViT model further highlights the effectiveness of DeiT, as the ViT model demonstrated lower accuracy, with an average F1 score of 0.87 (+0.05) compared to DeiT's F1 score of 0.91 (+0.03). Statistical analysis revealed right-skewed distributions in tree crown areas across density categories. The efficacy of MultiTaskDeiTUNet in tree plantation analysis indicates its potential applicability to a wide range of horticultural plants. Customizing the architecture to species-specific characteristics and varying image resolutions could provide valuable insights for improving management and conservation practices across diverse agricultural and forest ecosystems.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Classification of Water in an Urban Environment by Applying OBIA and Fuzzy Logic to Very High-Resolution Satellite Imagery
    Perregrini, Dario
    Casella, Vittorio
    GEOMATICS FOR ENVIRONMENTAL MONITORING: FROM DATA TO SERVICES, ASITA 2023, 2024, 2088 : 285 - 301
  • [42] Object-based classification of urban plant species from very high-resolution satellite imagery
    Sicard, Pierre
    Coulibaly, Fatimatou
    Lameiro, Morgane
    Araminiene, Valda
    De Marco, Alessandra
    Sorrentino, Beatrice
    Anav, Alessandro
    Manzini, Jacopo
    Hoshika, Yasutomo
    Moura, Barbara Baesso
    Paoletti, Elena
    URBAN FORESTRY & URBAN GREENING, 2023, 81
  • [43] Individual Tree-Crown Detection and Species Classification in Very High-Resolution Remote Sensing Imagery Using a Deep Learning Ensemble Model
    Plesoianu, Alin-Ionut
    Stupariu, Mihai-Sorin
    Sandric, Ionut
    Patru-Stupariu, Ileana
    Dragut, Lucian
    REMOTE SENSING, 2020, 12 (15)
  • [44] Chimney detection and size estimation from high-resolution optical satellite imagery using deep learning models
    Park, Che-Won
    Jung, Hyung-Sup
    Lee, Won-Jin
    Lee, Kwang-Jae
    Oh, Kwan-Young
    Won, Joong-Sun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [45] Using very-high-resolution satellite imagery and deep learning to detect and count African elephants in heterogeneous landscapes
    Duporge, Isla
    Isupova, Olga
    Reece, Steven
    Macdonald, David W.
    Wang, Tiejun
    REMOTE SENSING IN ECOLOGY AND CONSERVATION, 2021, 7 (03) : 369 - 381
  • [46] Automatic crack segmentation using deep high-resolution representation learning
    Chen, Hanshen
    Su, Yishun
    He, Wei
    APPLIED OPTICS, 2021, 60 (21) : 6080 - 6090
  • [47] Mapping taluses using deep learning and high-resolution satellite images
    Jiang, Decai
    Feng, Min
    Yan, Dezhao
    Wang, Yingzheng
    Xu, Jinhao
    Wang, Ning
    Wang, Jianbang
    Li, Xin
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2025, 18 (01)
  • [48] DeepAir: deep learning and satellite imagery to estimate high-resolution PM2.5 at scale
    Guo, Wenxuan
    Hu, Zhaoping
    Jin, Ling
    Xu, Yanyan
    Gonzalez, Marta C.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
  • [49] Scaling Deep Learning-Based Analysis of High-Resolution Satellite Imagery with Distributed Processing
    Nguyen, Mai H.
    Li, Jiaxin
    Crawl, Daniel
    Block, Jessica
    Altintas, Ilkay
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5437 - 5443
  • [50] Informal settlement mapping from very high-resolution satellite data using a hybrid deep learning framework
    Ravi Prabhu
    Neural Computing and Applications, 2025, 37 (4) : 2877 - 2889