A novel multitask transformer deep learning architecture for joint classification and segmentation of horticulture plantations using very High-Resolution satellite imagery

被引:3
|
作者
Vinod, P. V. [1 ,2 ]
Behera, M. D. [2 ]
Prakash, A. Jaya [2 ]
Hebbar, R. [1 ]
Srivastav, S. K. [3 ]
机构
[1] Reg Remote Sensing Ctr RRSC NRSC, Bangalore, India
[2] Indian Inst Technol Kharagpur, Ctr Ocean River Atmosphere & Land Sci, Kharagpur, West Bengal, India
[3] ISRO, Reg Ctr, NRSC, New Delhi, India
关键词
DeiT; U; -Net; Transformer; Joint loss; Intersection of Union (IoU); Classification and segmentation;
D O I
10.1016/j.compag.2024.109540
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
This study introduces MultiTaskDeiTUNet, a novel multitask deep learning architecture designed to tackle the dual challenges of classifying tree plantation densities and segmenting tree crowns in high-resolution (0.7 m) satellite imagery. The core challenge lies in the overlapping spatial patterns of various tree species and densities, complicating the accurate extraction and classification of individual tree crowns. MultiTaskDeiTUNet integrates the Data-efficient Image Transformer (DeiT) with the U-Net model, harnessing DeiT's strength in contextual detail recognition and spatial dependency capture for density classification, alongside U-Net's proficiency in capturing low-level features for precise crown segmentation. By addressing the complexities of high-resolution data handling and the simultaneous execution of classification and segmentation tasks, MultiTaskDeiTUNet achieves an average F1 score of 0.91 (+ 0.03) and a precise tree crown segmentation with mIoU of 0.73 (+ 0.01). The DeiT backbone adeptly learns shared features such as canopy shapes and spatial arrangements, which are crucial for both tasks and enhance overall model performance. Ablation studies underscore the specialized roles of each component: freezing DeiT's weights results in reduced classification accuracy with an average F1 score of 0.48 (+ 0.08), while freezing U-Net's weights yields a reduced mIoU of 0.29 (+ 0.12) This differentiation highlights DeiT's excellence in classification tasks and U-Net's superiority in segmentation. Substituting the DeiT model with a standard ViT model further highlights the effectiveness of DeiT, as the ViT model demonstrated lower accuracy, with an average F1 score of 0.87 (+0.05) compared to DeiT's F1 score of 0.91 (+0.03). Statistical analysis revealed right-skewed distributions in tree crown areas across density categories. The efficacy of MultiTaskDeiTUNet in tree plantation analysis indicates its potential applicability to a wide range of horticultural plants. Customizing the architecture to species-specific characteristics and varying image resolutions could provide valuable insights for improving management and conservation practices across diverse agricultural and forest ecosystems.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Semantic segmentation of major macroalgae in coastal environments using high-resolution ground imagery and deep learning
    Balado, Jesus
    Olabarria, Celia
    Martinez-Sanchez, Joaquin
    Rodriguez-Perez, Jose R.
    Pedro, Arias
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (05) : 1785 - 1800
  • [32] Classification of semiurban landscapes from very high-resolution satellite images using a regionalized multiscale segmentation approach
    Kavzoglu, Taskin
    Erdemir, Merve Yildiz
    Tonbul, Hasan
    JOURNAL OF APPLIED REMOTE SENSING, 2017, 11
  • [33] Segmentation and classification of breast cancer using novel deep learning architecture
    S. Ramesh
    S. Sasikala
    S. Gomathi
    V. Geetha
    V. Anbumani
    Neural Computing and Applications, 2022, 34 : 16533 - 16545
  • [34] Segmentation and classification of breast cancer using novel deep learning architecture
    Ramesh, S.
    Sasikala, S.
    Gomathi, S.
    Geetha, V
    Anbumani, V
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (19): : 16533 - 16545
  • [35] Detection and mapping of artillery craters with very high spatial resolution satellite imagery and deep learning
    Duncan, Erik C.
    Skakun, Sergii
    Kariryaa, Ankit
    Prishchepov, Alexander, V
    SCIENCE OF REMOTE SENSING, 2023, 7
  • [36] Learning Multiscale Deep Features for High-Resolution Satellite Image Scene Classification
    Liu, Qingshan
    Hang, Renlong
    Song, Huihui
    Li, Zhi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (01): : 117 - 126
  • [37] HIGH-RESOLUTION SATELLITE IMAGE CLASSIFICATION AND SEGMENTATION USING LAPLACIAN GRAPH ENERGY
    Meng, Zhao
    Xiao, Bai
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 605 - 608
  • [38] Land-Cover Classification Using Deep Learning with High-Resolution Remote-Sensing Imagery
    Fayaz, Muhammad
    Nam, Junyoung
    Dang, L. Minh
    Song, Hyoung-Kyu
    Moon, Hyeonjoon
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [39] Urban Land Use and Land Cover Classification Using Novel Deep Learning Models Based on High Spatial Resolution Satellite Imagery
    Zhang, Pengbin
    Ke, Yinghai
    Zhang, Zhenxin
    Wang, Mingli
    Li, Peng
    Zhang, Shuangyue
    SENSORS, 2018, 18 (11)
  • [40] BENCHMARKING DEEP LEARNING FRAMEWORKS FOR THE CLASSIFICATION OF VERY HIGH RESOLUTION SATELLITE MULTISPECTRAL DATA
    Papadomanolaki, M.
    Vakalopoulou, M.
    Zagoruyko, S.
    Karantzalos, K.
    XXIII ISPRS CONGRESS, COMMISSION VII, 2016, 3 (07): : 83 - 88