High-precision target ranging in complex orchard scenes by utilizing semantic segmentation results and binocular vision

被引:8
|
作者
Wen, Yu [1 ]
Xue, Jinlin [1 ]
Sun, Han [1 ]
Song, Yue [1 ]
Lv, Pengfei [1 ]
Liu, Shaohua [1 ]
Chu, Yangyang [1 ]
Zhang, Tianyu [1 ]
机构
[1] Nanjing Agr Univ, Coll Engn, Nanjing 210031, Peoples R China
关键词
Orchard; Deep learning; Semantic segmentation; Binocular vision; Attention mechanism; Feature fusion; AGRICULTURE;
D O I
10.1016/j.compag.2023.108440
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The automation of orchard production is increasingly relying on robotics, driven by the advancements in artificial intelligence technology. However, accurately comprehending semantic information and precisely locating various targets within orchard environments remain challenges. Current research often relies on expensive multisensor fusion techniques or vision-only approaches that yield inadequate segmentation outcomes for perceiving orchard surroundings. To address these issues, this article proposes a novel approach for target ranging in complex orchard scenes, leveraging semantic segmentation results. The article introduces the MsFF-Segformer model, which employs multi-scale feature fusion to generate high-precision semantic segmentation images. The model incorporates the MiT-B0 encoder, which utilizes a pure attention mechanism, and the MsFF decoder, specifically designed for multi-scale feature fusion. The MsFF decoder includes the AFAM module to effectively align features of adjacent scales. Additionally, the channel attention module and depth separable convolution module are introduced to reduce model parameter size and obtain feature vectors with rich semantic levels, enhancing the segmentation performance of multi-scale targets in orchards. Based on the accurate semantic segmentation outcomes in orchard environments, this study introduces a novel approach named TPDMR that integrates binocular vision to estimate the distances of various objects within orchards. Firstly, the process involves matching the semantic category matrix with the depth information matrix. Subsequently, the depth information array that represents the target category is obtained, and any invalid depth information is filtered out. Finally, the average depth of the target is calculated. Evaluation of the MsFF-Segformer model on a self-made orchard dataset demonstrates superior performance compared to U-net and other models, achieving a Mean Intersection over Union (MIoU) of 86.52 % and a Mean Pixel Accuracy (MPA) of 94.05 %. The parameters and prediction time for a single frame are 15.1 M and 0.019 s, respectively. These values are significantly lower than those of U-net, Deeplabv3+, and Hrnet models, with reductions of 84.1 %, 32.5 %, 5.9 % and 69.4 %, 59.7 %, 64.2 % respectively. The TPDMR method demonstrates a high level of accuracy and stability in target ranging, with a ranging error of less than 6 % across all targets. Furthermore, the overall algorithm runtime is estimated to be approximately 0.8 s, indicating efficient performance.
引用
收藏
页数:10
相关论文
共 35 条
  • [1] A Methodology for Target Ranging in Orchard Scenarios Utilizing Semantic Segmentation Results
    Wen, Yu
    Xue, Jinlin
    Sun, Han
    Song, Yue
    Lv, Pengfei
    Liu, Shaohua
    Chu, Yangyang
    Zhang, Tianyu
    SSRN, 2023,
  • [2] High-Precision Calibration Method and Error Analysis of Infrared Binocular Target Ranging Systems
    Zeng, Changwen
    Wei, Rongke
    Gu, Mingjian
    Zhang, Nejie
    Dai, Zuoxiao
    ELECTRONICS, 2024, 13 (16)
  • [3] Research on Camera Self-Calibration of High-Precision in Binocular Vision
    Jiang, Zetao
    Jia, Lianggang
    Guo, Shutao
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 4101 - 4106
  • [4] FashionSegNet: a model for high-precision semantic segmentation of clothing images
    Zhong Xiang
    Chenglin Zhu
    Miao Qian
    Yujia Shen
    Yizhou Shao
    The Visual Computer, 2024, 40 : 1711 - 1727
  • [5] FashionSegNet: a model for high-precision semantic segmentation of clothing images
    Xiang, Zhong
    Zhu, Chenglin
    Qian, Miao
    Shen, Yujia
    Shao, Yizhou
    VISUAL COMPUTER, 2024, 40 (03): : 1711 - 1727
  • [6] High-Precision Aiming System For Laser Charging Based on Binocular Machine Vision
    Deng Liegang
    Li Wenfeng
    Yang Yannan
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (14)
  • [7] DEUFormer: High-precision semantic segmentation for urban remote sensing images
    Jia, Xinqi
    Song, Xiaoyong
    Rao, Lei
    Fan, Guangyu
    Cheng, Songlin
    Chen, Niansheng
    IET COMPUTER VISION, 2024, 18 (08) : 1209 - 1222
  • [8] High-Precision Binocular Camera Calibration Based on Coding Stereoscopic Target br
    Yang, Zhaohui
    Zhu, Huabing
    Yin, Yulong
    Yang, Pei
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2023, 50 (06):
  • [9] Research on underwater robot ranging technology based on semantic segmentation and binocular vision (vol 14, 12309, 2024)
    Hu, Qing
    Wang, Kekuan
    Ren, Fushen
    Wang, Zhongyang
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [10] High-Precision Measurement of Binocular Telecentric Vision System With Novel Calibration and Matching Methods
    Zhang, Shengfu
    Li, Bo
    Ren, Fuji
    Dong, Rong
    IEEE ACCESS, 2019, 7 : 54682 - 54692