Spatial-channel transformer network based on mask-RCNN for efficient mushroom instance segmentation

被引:0
|
作者
Wang, Jiaoling [1 ,2 ,4 ]
Song, Weidong [2 ]
Zheng, Wengang [3 ]
Feng, Qingchun [3 ]
Wang, Mingfei [3 ]
Zhao, Chunjiang [1 ,3 ]
机构
[1] Northwest Agr & Forestry Univ, Xian 712199, Peoples R China
[2] Minist Agr & Rural Affairs, Nanjing Inst Agr Mechanizat, Nanjing 210014, Peoples R China
[3] Beijing Acad Agr & Forestry Sci, Intelligent Equipment Technol Res Ctr, Beijing 100097, Peoples R China
[4] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Zhejiang Prov Key Lab Agr Intelligent Equipment &, Hangzhou 310058, Peoples R China
关键词
edible mushrooms; picking; instance segmentation; deep learning; algorithm; WHEAT FIELDS; RECOGNITION;
D O I
10.25165/j.ijabe.20241704.8987
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Edible mushrooms are rich in nutrients; however, harvesting mainly relies on manual labor. Coarse localization of each mushroom is necessary to enable a robotic arm to accurately pick edible mushrooms. Previous studies used detection algorithms that did not consider mushroom pixel-level information. When these algorithms are combined with a depth map, the information is lost. Moreover, in instance segmentation algorithms, convolutional neural network (CNN)-based methods are lightweight, and the extracted features are not correlated. To guarantee real-time location detection and improve the accuracy of mushroom segmentation, this study proposed a new spatial-channel transformer network model based on Mask-CNN (SCTMask-RCNN). The fusion of Mask-RCNN with the self-attention mechanism extracts the global correlation outcomes of image features from the channel and spatial dimensions. Subsequently, Mask-RCNN was used to maintain a lightweight structure and extract local features using a spatial pooling pyramidal structure to achieve multiscale local feature fusion and improve detection accuracy. The results showed that the SCT-Mask-RCNN method achieved a segmentation accuracy of 0.750 on segm_Precision_mAP and detection accuracy of 0.638 on Bbox_Precision_mAP. Compared to existing methods, the proposed method improved the accuracy of the evaluation metrics Bbox_Precision_mAP and segm_Precision_mAP by over 2% and 5%, respectively.
引用
收藏
页码:227 / 235
页数:9
相关论文
共 50 条
  • [1] Instance Segmentation of Concrete Defects Based on Improved Mask-RCNN
    Huang C.
    Xie X.
    Zhou Y.
    Li G.
    Bridge Construction, 2023, 53 (06) : 63 - 70
  • [2] Infrared Object Image Instance Segmentation based on Improved Mask-RCNN
    Nan Jing
    Bo Lei
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VI, 2019, 11187
  • [3] Instance-Based Segmentation for Boundary Detection of Neuropathic Ulcers Through Mask-RCNN
    Gamage, H. V. L. C.
    Wijesinghe, W. O. K. I. S.
    Perera, Indika
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 511 - 522
  • [4] Intervertebral disc instance segmentation using a multistage optimization mask-RCNN (MOM-RCNN)
    Vania, Malinda
    Lee, Deukhee
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2021, 8 (04) : 1023 - 1036
  • [5] Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using Mask-RCNN and a Mosaicking Approach
    Carvalho, Osmar Luiz Ferreira de
    de Carvalho Junior, Osmar Abilio
    Albuquerque, Anesmar Olino de
    Bem, Pablo Pozzobon de
    Silva, Cristiano Rosa
    Ferreira, Pedro Henrique Guimaraes
    Moura, Rebeca dos Santos de
    Gomes, Roberto Arnaldo Trancoso
    Guimaraes, Renato Fontes
    Borges, Dibio Leandro
    REMOTE SENSING, 2021, 13 (01) : 1 - 24
  • [6] Mask-RCNN with spatial attention for pedestrian segmentation in cyber–physical systems
    Yuan, Lin
    Qiu, Zhao
    Computer Communications, 2021, 180 : 109 - 114
  • [7] Crop disease identification segmentation algorithm based on Mask-RCNN
    Bondre, Shweta
    Patil, Dipti
    AGRONOMY JOURNAL, 2024, 116 (03) : 1088 - 1098
  • [8] Mask-RCNN with spatial attention for pedestrian segmentation in cyber-physical systems
    Yuan, Lin
    Qiu, Zhao
    COMPUTER COMMUNICATIONS, 2021, 180 : 109 - 114
  • [9] TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images
    Fu, Yinghua
    Liu, Junfeng
    Shi, Jun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [10] TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images
    Fu, Yinghua
    Liu, Junfeng
    Shi, Jun
    Computers in Biology and Medicine, 2024, 170