Bunet: An effective and efficient segmentation method based on bilateral encoder-decoder structure for rapid detection of apple tree branches

被引:0
|
作者
Zhang, Shanshan [1 ]
Wan, Hao [1 ]
Fan, Zeming [1 ]
Zeng, Xilei [1 ]
Zhang, Ke [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710100, Shaanxi, Peoples R China
关键词
Fruit harvesting robots; Semantic segmentation; Branch segmentation; U-shape structure; Two-pathway backbone; AGRICULTURAL ROBOTS; OPERATIONS;
D O I
10.1007/s10489-023-04742-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic apple harvesting robots have received much research attention in recent years to lower harvesting costs. A fundamental problem for harvesting robots is how to quickly and accurately detect branches to avoid collisions with limited hardware resources. In this paper, we propose a lightweight, high-accurate and real-time semantic segmentation network, Bilateral U-shape Network (BUNet), to segment apple tree branches. The BUNet consists mainly of a U-shaped detail branch and a U-shaped semantic branch, the former for capturing spatial details and the latter for supplementing semantic information. These two U-shape branches complement each other, keeping the high accuracy of the Encoder-decoder Backbone while maintaining the efficiency and effectiveness of the Two-pathway Backbone. In addition, a Simplified Attention Fusion Module (SAFM) is proposed to effectively fuse different levels of information from two branches for pixel-by-pixel prediction. Experimental results show (on our own constructed dataset) that BUNet achieves the highest Intersection over Union (IoU) and F1-score of 75.96% and 86.34%, respectively, with minimum parameters of 0.93M and 11.94G Floating-point of Operations (FLOPs) in branch segmentation. Meanwhile, BUNet achieves a speed of 110.32 Frames Per Second (FPS) with input image size of 1280x720 pixels. These results confirm that the proposed method can effectively detect the branches and can, therefore, be used to plan an obstacle avoidance path for harvesting robots.
引用
收藏
页码:23336 / 23348
页数:13
相关论文
共 34 条
  • [11] DeepLab-Rail: semantic segmentation network for railway scenes based on encoder-decoder structure
    Zeng, Qingsong
    Zhang, Linxuan
    Wang, Yuan
    Luo, Xiaolong
    Chen, Yannan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (04)
  • [12] A Semantic Segmentation Method for High-resolution Remote Sensing Images Based on Encoder-Decoder
    Yang, Jingyu
    Zhao, Liang
    Dang, Jianwu
    Wang, Yangping
    Yue, Biao
    Gu, Zongliang
    2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD, 2022, : 98 - 103
  • [13] Real-time semantic segmentation of microvascular decompression images based on encoder-decoder structure
    Bai Rui-feng
    Jiang Shan
    Sun Hai-jiang
    Liu Xin-rui
    CHINESE OPTICS, 2022, 15 (05) : 1055 - 1065
  • [14] Accurate and Efficient Event-Based Semantic Segmentation Using Adaptive Spiking Encoder-Decoder Network
    Zhang, Rui
    Leng, Luziwei
    Che, Kaiwei
    Zhang, Hu
    Cheng, Jie
    Guo, Qinghai
    Liao, Jianxing
    Cheng, Ran
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [15] An Effective Classification Method for Hyperspectral Image With Very High Resolution Based on Encoder-Decoder Architecture
    Zhang, Zhen
    Jiang, Tao
    Liu, Chenxi
    Zhang, Linjing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 1509 - 1519
  • [16] An Effective Lightweight Crowd Counting Method Based on an Encoder-Decoder Network for Internet of Video Things
    Yi, Jun
    Chen, Fan
    Shen, Zhilong
    Xiang, Yi
    Xiao, Shan
    Zhou, Wei
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02) : 3082 - 3094
  • [17] A grid-level segmentation model based on encoder-decoder structure with multi-source features for crop lodging detection
    Wang, Lihui
    Xiao, Huidi
    APPLIED SOFT COMPUTING, 2024, 151
  • [18] An Improved Encoder-Decoder Network Based on Strip Pool Method Applied to Segmentation of Farmland Vacancy Field
    Zhang, Xixin
    Yang, Yuhang
    Li, Zhiyong
    Ning, Xin
    Qin, Yilang
    Cai, Weiwei
    ENTROPY, 2021, 23 (04)
  • [19] Cloud Detection Method Based on Spatial-Spectral Features and Encoder-Decoder Feature Fusion
    Zhang, Jing
    Shi, Xinlong
    Wu, Jun
    Song, Liangnong
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 15
  • [20] Breathing Sound Segmentation and Detection Using Transfer Learning Techniques on an Attention-Based Encoder-Decoder Architecture
    Hsiao, Chiu-Han
    Lin, Ting-Wei
    Lin, Chii-Wann
    Hsu, Fu-Shun
    Lin, Frank Yeong-Sung
    Chen, Chung-Wei
    Chung, Chi-Ming
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 754 - 759