Deep convolutional neural networks and Swin transformer-based frameworks for individual date palm tree detection and mapping from large-scale UAV images

被引:11
|
作者
Gibril, Mohamed Barakat A. [1 ,2 ]
Shafri, Helmi Zulhaidi Mohd [1 ,2 ]
Shanableh, Abdallah [3 ,4 ]
Al-Ruzouq, Rami [3 ,4 ]
Wayayok, Aimrun [5 ]
bin Hashim, Shaiful Jahari [6 ]
Sachit, Mourtadha Sarhan [1 ,2 ]
机构
[1] Univ Putra Malaysia UPM, Dept Civil Engn, Fac Engn, Serdang, Selangor, Malaysia
[2] Univ Putra Malaysia UPM, Geospatial Informat Sci Res Ctr GISRC, Fac Engn, Serdang, Selangor, Malaysia
[3] Univ Sharjah, Fac Engn, Dept Civil & Environm Engn, Sharjah, U Arab Emirates
[4] Univ Sharjah, GIS & Remote Sensing Ctr, Res Inst Sci & Engn, Sharjah, U Arab Emirates
[5] Univ Putra Malaysia UPM, Fac Engn, Dept Biol & Agr Engn, Serdang, Selangor, Malaysia
[6] Univ Putra Malaysia UPM, Fac Engn, Dept Comp & Commun Syst Engn, Serdang, Selangor, Malaysia
关键词
Instance segmentation; mask R-CNN; Swin transformer; mask scoring R-CNN; SOLOv2; YOLACT; PointRend; individual tree crown delineation; PHOENIX-DACTYLIFERA L; CROWN;
D O I
10.1080/10106049.2022.2142966
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Timely and reliable mapping of individual date palm trees is essential for their monitoring, health and risk assessment, pest control, and sustainable management of the date palm industry. This study presents an instance segmentation framework for large-scale detection and mapping of date palm trees using unmanned aerial vehicle (UAV)-based images. First, a data conversion framework is created to convert UAV image tiles and ground-truth vector data into annotation format of Common Objects in Context. Second, this study examines the efficacy of various instance segmentation models, namely, mask region convolutional neural network (Mask R-CNN), Mask Scoring R-CNN, You Only Look At CoefficientTs, Point-based Rendering, Segmenting Objects by Locations (SOLO), and SOLOv2) with varying residual learning networks (ResNets) in detecting and delineating individual date palm trees. Furthermore, the performance of two variants of Swin Transformer networks with a feature pyramid network (FPN) (Swin-small-FPN and Swin-tiny-FPN) as Mask R-CNN network backbones was also evaluated. Third, we assess the generalizability of the evaluated instance segmentation models and backbones on different testing datasets with varying spatial resolutions. Results show that Mask R-CNN models based on Swin Transformers backbones outperform those with ResNets in the detection and segmentation of date palm trees with mAP(50) of 92% and 91% and F-measures of 94% and 93%. Moreover, the Mask scoring R-CNN-based ResNet-50 and Mask R-CNN with a Swin-small-FPN backbone outperform the evaluated models and demonstrate great generalizability in different datasets with diverse spatial resolutions. The proposed instance segmentation framework provides an efficient tool for date palm tree mapping from multi-scale UAV-based images and is valuable and suitable for individual tree crown delineations and other earth-related applications.
引用
收藏
页码:18569 / 18599
页数:31
相关论文
共 26 条
  • [1] Deep Convolutional Neural Network for Large-Scale Date Palm Tree Mapping from UAV-Based Images
    Gibril, Mohamed Barakat A.
    Shafri, Helmi Zulhaidi Mohd
    Shanableh, Abdallah
    Al-Ruzouq, Rami
    Wayayok, Aimrun
    Hashim, Shaiful Jahari
    REMOTE SENSING, 2021, 13 (14)
  • [2] Large-Scale Date Palm Tree Segmentation from Multiscale UAV-Based and Aerial Images Using Deep Vision Transformers
    Gibril, Mohamed Barakat A.
    Shafri, Helmi Zulhaidi Mohd
    Al-Ruzouq, Rami
    Shanableh, Abdallah
    Nahas, Faten
    Al Mansoori, Saeed
    DRONES, 2023, 7 (02)
  • [3] Deep convolutional neural network based large-scale oil palm tree detection for high-resolution remote sensing images
    Li, Weijia
    Fu, Haohuan
    Yu, Le
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 846 - 849
  • [4] Large-Scale Mapping of Small Roads in Lidar Images Using Deep Convolutional Neural Networks
    Salberg, Arnt-Borre
    Trier, Oivind Due
    Kampffmeyer, Michael
    IMAGE ANALYSIS, SCIA 2017, PT II, 2017, 10270 : 193 - 204
  • [5] Large-Scale Oil Palm Tree Detection from High-Resolution Satellite Images Using Two-Stage Convolutional Neural Networks
    Li, Weijia
    Dong, Runmin
    Fu, Haohuan
    Yu, Le
    REMOTE SENSING, 2019, 11 (01)
  • [6] Large-Scale Solar Panel Mapping from Aerial Images Using Deep Convolutional Networks
    Yuan, Jiangye
    Yang, Hsiu-Han Lexie
    Omitaomu, Olufemi A.
    Bhaduri, Budhendra L.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2703 - 2708
  • [7] Swin Transformer-Based Multiscale Attention Model for Landslide Extraction From Large-Scale Area
    Gao, Mengjie
    Chen, Fang
    Wang, Lei
    Zhao, Huichen
    Yu, Bo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] Large-scale assessment of date palm plantations based on UAV remote sensing and multiscale vision transformer
    Gibril, Mohamed Barakat A.
    Shafri, Helmi Zulhaidi Mohd
    Shanableh, Abdallah
    Al-Ruzouq, Rami
    Hashim, Shaiful Jahari bin
    Wayayok, Aimrun
    Sachit, Mourtadha Sarhan
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2024, 34
  • [9] Spectral-Spatial transformer-based semantic segmentation for large-scale mapping of individual date palm trees using very high-resolution satellite data
    Al-Ruzouq, Rami
    Gibril, Mohamed Barakat A.
    Shanableh, Abdallah
    Bolcek, Jan
    Lamghari, Fouad
    Hammour, Nezar Atalla
    El-Keblawy, Ali
    Jena, Ratiranjan
    ECOLOGICAL INDICATORS, 2024, 163
  • [10] Automatic Detection of Oil Palm Tree from UAV Images Based on the Deep Learning Method
    Liu, Xinni
    Ghazali, Kamarul Hawari
    Han, Fengrong
    Mohamed, Izzeldin Ibrahim
    APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (01) : 13 - 24