A Comprehensive Evaluation of Monocular Depth Estimation Methods in Low-Altitude Forest Environment

被引:0
|
作者
Jia, Jiwen [1 ]
Kang, Junhua [1 ]
Chen, Lin [2 ]
Gao, Xiang [1 ]
Zhang, Borui [1 ]
Yang, Guijun [1 ,3 ]
机构
[1] Changan Univ, Coll Geol Engn & Geomat, Xian 710054, Peoples R China
[2] VISCODA GmbH, Schneiderberg 32, D-30167 Hannover, Germany
[3] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Key Lab Quantitat Remote Sensing Agr, Minist Agr & Rural Affairs, Beijing 100097, Peoples R China
关键词
monocular depth estimation; CNN; vision transformer; forest environment; comparative study;
D O I
10.3390/rs17040717
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monocular depth estimation (MDE) is a critical computer vision task that enhances environmental perception in fields such as autonomous driving and robot navigation. In recent years, deep learning-based MDE methods have achieved notable progress in these fields. However, achieving robust monocular depth estimation in low-altitude forest environments remains challenging, particularly in scenes with dense and cluttered foliage, which complicates applications in environmental monitoring, agriculture, and search and rescue operations. This paper presents a comprehensive evaluation of state-of-the-art deep learning-based MDE methods on low-altitude forest datasets. The evaluated models include both self-supervised and supervised approaches, employing different network structures such as convolutional neural networks (CNNs) and Vision Transformers (ViTs). We assessed the generalization of these approaches across diverse low-altitude scenarios, specifically focusing on forested environments. A systematic set of evaluation criteria is employed, comprising traditional image-based global statistical metrics as well as geometry-aware metrics, to provide a more comprehensive evaluation of depth estimation performance. The results indicate that most Transformer-based models, such as DepthAnything and Metric3D, outperform traditional CNN-based models in complex forest environments by capturing detailed tree structures and depth discontinuities. Conversely, CNN-based models like MiDas and Adabins struggle with handling depth discontinuities and complex occlusions, yielding less detailed predictions. On the Mid-Air dataset, the Transformer-based DepthAnything demonstrates a 54.2% improvement in RMSE for the global error metric compared to the CNN-based Adabins. On the LOBDM dataset, the CNN-based MiDas has the depth edge completeness error of 93.361, while the Transformer-based Metric3D demonstrates the significantly lower error of only 5.494. These findings highlight the potential of Transformer-based approaches for monocular depth estimation in low-altitude forest environments, with implications for high-throughput plant phenotyping, environmental monitoring, and other forest-specific applications.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] Scene-aware refinement network for unsupervised monocular depth estimation in ultra-low altitude oblique photography of UAV
    Yu, Kailong
    Li, Hao
    Xing, Linjie
    Wen, Tao
    Fu, Die
    Yang, Yang
    Zhou, Chengjiang
    Chang, Rong
    Zhao, Shan
    Xing, Lin
    Bai, Haicheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 205 : 284 - 300
  • [42] Research on Defense-in-Depth Zone of Low-Altitude Security Area in Nuclear Power Plant
    Ye, Lin
    Zhang, Jie
    Meng, Guang
    NUCLEAR POWER PLANTS: INNOVATIVE TECHNOLOGIES FOR INSTRUMENTATION AND CONTROL SYSTEMS (ISNPP 2019), 2020, 595 : 30 - 37
  • [43] Low-altitude Wind Shear Wind Speed Estimation Method Based onGAMP-STAP in Complex Terrain Environment br
    Li, Hai
    Xie, Ruijie
    Xie, Lingli
    Meng, Fanwang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (02) : 576 - 584
  • [44] Wind Speed Estimation Method of Low-altitude Wind Shear Based on CL-KA-STAP in Complex Terrain Environment
    Li Hai
    Xie Lingli
    Zhang Zhining
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3647 - 3655
  • [45] Sensor cooperative scheduling method for low-altitude maneuvering target tracking in complex environment
    Zhang, Yunpu
    Xu, Gongguo
    Shan, Ganlin
    SENSOR REVIEW, 2022, 42 (01) : 133 - 144
  • [46] Experimental study on low-altitude UAV-to-ground propagation characteristics in campus environment
    Zhang, Yuandi
    Lu, Jiawangnan
    Zhang, Hongtao
    Huang, Ziyi
    Briso-Rodriguez, Cesar
    Zhang, Lei
    COMPUTER NETWORKS, 2023, 237
  • [47] Statistical Channel Modeling for Low-Altitude UAVs in an Urban Tree-Covered Environment
    Moraitis, Nektarios
    Psychogios, Konstantinos
    Panagopoulos, Athanasios D.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (12) : 2198 - 2202
  • [48] Visual Attention Accelerated Vehicle Detection in Low-Altitude Airborne Video of Urban Environment
    Cao, Xianbin
    Lin, Renjun
    Yan, Pingkun
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (03) : 366 - 378
  • [49] Vehicle Detection and Motion Analysis in Low-Altitude Airborne Video Under Urban Environment
    Cao, Xianbin
    Wu, Changxia
    Lan, Jinhe
    Yan, Pingkun
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (10) : 1522 - 1533
  • [50] Estimation-based missile threat detection and evasion maneuver for a low-altitude aircraft
    Tian, Zijiao
    Danino, Meir
    Bar-Shalom, Yaakov
    Milgrom, Benny
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII, 2023, 12547