Deep convolutional neural networks and Swin transformer-based frameworks for individual date palm tree detection and mapping from large-scale UAV images

被引:11
|
作者
Gibril, Mohamed Barakat A. [1 ,2 ]
Shafri, Helmi Zulhaidi Mohd [1 ,2 ]
Shanableh, Abdallah [3 ,4 ]
Al-Ruzouq, Rami [3 ,4 ]
Wayayok, Aimrun [5 ]
bin Hashim, Shaiful Jahari [6 ]
Sachit, Mourtadha Sarhan [1 ,2 ]
机构
[1] Univ Putra Malaysia UPM, Dept Civil Engn, Fac Engn, Serdang, Selangor, Malaysia
[2] Univ Putra Malaysia UPM, Geospatial Informat Sci Res Ctr GISRC, Fac Engn, Serdang, Selangor, Malaysia
[3] Univ Sharjah, Fac Engn, Dept Civil & Environm Engn, Sharjah, U Arab Emirates
[4] Univ Sharjah, GIS & Remote Sensing Ctr, Res Inst Sci & Engn, Sharjah, U Arab Emirates
[5] Univ Putra Malaysia UPM, Fac Engn, Dept Biol & Agr Engn, Serdang, Selangor, Malaysia
[6] Univ Putra Malaysia UPM, Fac Engn, Dept Comp & Commun Syst Engn, Serdang, Selangor, Malaysia
关键词
Instance segmentation; mask R-CNN; Swin transformer; mask scoring R-CNN; SOLOv2; YOLACT; PointRend; individual tree crown delineation; PHOENIX-DACTYLIFERA L; CROWN;
D O I
10.1080/10106049.2022.2142966
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Timely and reliable mapping of individual date palm trees is essential for their monitoring, health and risk assessment, pest control, and sustainable management of the date palm industry. This study presents an instance segmentation framework for large-scale detection and mapping of date palm trees using unmanned aerial vehicle (UAV)-based images. First, a data conversion framework is created to convert UAV image tiles and ground-truth vector data into annotation format of Common Objects in Context. Second, this study examines the efficacy of various instance segmentation models, namely, mask region convolutional neural network (Mask R-CNN), Mask Scoring R-CNN, You Only Look At CoefficientTs, Point-based Rendering, Segmenting Objects by Locations (SOLO), and SOLOv2) with varying residual learning networks (ResNets) in detecting and delineating individual date palm trees. Furthermore, the performance of two variants of Swin Transformer networks with a feature pyramid network (FPN) (Swin-small-FPN and Swin-tiny-FPN) as Mask R-CNN network backbones was also evaluated. Third, we assess the generalizability of the evaluated instance segmentation models and backbones on different testing datasets with varying spatial resolutions. Results show that Mask R-CNN models based on Swin Transformers backbones outperform those with ResNets in the detection and segmentation of date palm trees with mAP(50) of 92% and 91% and F-measures of 94% and 93%. Moreover, the Mask scoring R-CNN-based ResNet-50 and Mask R-CNN with a Swin-small-FPN backbone outperform the evaluated models and demonstrate great generalizability in different datasets with diverse spatial resolutions. The proposed instance segmentation framework provides an efficient tool for date palm tree mapping from multi-scale UAV-based images and is valuable and suitable for individual tree crown delineations and other earth-related applications.
引用
收藏
页码:18569 / 18599
页数:31
相关论文
共 26 条
  • [21] Object Extraction From Very High-Resolution Images Using a Convolutional Neural Network Based on a Noisy Large-Scale Dataset
    Li, Panle
    He, Xiaohui
    Cheng, Xijie
    Gao, Xu
    Li, Runchuan
    Qia, Mengjia
    Li, Daidong
    Qiu, Fangbing
    Li, Zhiqiang
    IEEE ACCESS, 2019, 7 : 122784 - 122795
  • [22] Vehicle detection from high-resolution aerial images using spatial pyramid pooling-based deep convolutional neural networks
    Qu, Tao
    Zhang, Quanyuan
    Sun, Shilei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) : 21651 - 21663
  • [23] Oil Spill Detection Based on Deep Convolutional Neural Networks Using Polarimetric Scattering Information From Sentinel-1 SAR Images
    Ma, Xiaoshuang
    Xu, Jiangong
    Wu, Penghai
    Kong, Peng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] Vehicle detection from high-resolution aerial images using spatial pyramid pooling-based deep convolutional neural networks
    Tao Qu
    Quanyuan Zhang
    Shilei Sun
    Multimedia Tools and Applications, 2017, 76 : 21651 - 21663
  • [25] Comparing fully convolutional networks, random forest, support vector machine, and patch-based deep convolutional neural networks for object-based wetland mapping using images from small unmanned aircraft system
    Liu, Tao
    Abd-Elrahman, Amr
    Morton, Jon
    Wilhelm, Victor L.
    GISCIENCE & REMOTE SENSING, 2018, 55 (02) : 243 - 264
  • [26] COVID-19 Detection Using Deep Convolutional Neural Networks and Binary Differential Algorithm-Based Feature Selection from X-Ray Images
    Iraji, Mohammad Saber
    Feizi-Derakhshi, Mohammad-Reza
    Tanha, Jafar
    COMPLEXITY, 2021, 2021