Spatial-channel transformer network based on mask-RCNN for efficient mushroom instance segmentation

被引:0
|
作者
Wang, Jiaoling [1 ,2 ,4 ]
Song, Weidong [2 ]
Zheng, Wengang [3 ]
Feng, Qingchun [3 ]
Wang, Mingfei [3 ]
Zhao, Chunjiang [1 ,3 ]
机构
[1] Northwest Agr & Forestry Univ, Xian 712199, Peoples R China
[2] Minist Agr & Rural Affairs, Nanjing Inst Agr Mechanizat, Nanjing 210014, Peoples R China
[3] Beijing Acad Agr & Forestry Sci, Intelligent Equipment Technol Res Ctr, Beijing 100097, Peoples R China
[4] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Zhejiang Prov Key Lab Agr Intelligent Equipment &, Hangzhou 310058, Peoples R China
关键词
edible mushrooms; picking; instance segmentation; deep learning; algorithm; WHEAT FIELDS; RECOGNITION;
D O I
10.25165/j.ijabe.20241704.8987
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Edible mushrooms are rich in nutrients; however, harvesting mainly relies on manual labor. Coarse localization of each mushroom is necessary to enable a robotic arm to accurately pick edible mushrooms. Previous studies used detection algorithms that did not consider mushroom pixel-level information. When these algorithms are combined with a depth map, the information is lost. Moreover, in instance segmentation algorithms, convolutional neural network (CNN)-based methods are lightweight, and the extracted features are not correlated. To guarantee real-time location detection and improve the accuracy of mushroom segmentation, this study proposed a new spatial-channel transformer network model based on Mask-CNN (SCTMask-RCNN). The fusion of Mask-RCNN with the self-attention mechanism extracts the global correlation outcomes of image features from the channel and spatial dimensions. Subsequently, Mask-RCNN was used to maintain a lightweight structure and extract local features using a spatial pooling pyramidal structure to achieve multiscale local feature fusion and improve detection accuracy. The results showed that the SCT-Mask-RCNN method achieved a segmentation accuracy of 0.750 on segm_Precision_mAP and detection accuracy of 0.638 on Bbox_Precision_mAP. Compared to existing methods, the proposed method improved the accuracy of the evaluation metrics Bbox_Precision_mAP and segm_Precision_mAP by over 2% and 5%, respectively.
引用
收藏
页码:227 / 235
页数:9
相关论文
共 50 条
  • [21] Breast lesions segmentation and classification in a two-stage process based on Mask-RCNN and Transfer Learning
    Soltani, Hama
    Amroune, Mohamed
    Bendib, Issam
    Haouam, Mohamed-Yassine
    Benkhelifa, Elhadj
    Fraz, Muhammad Moazam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 35763 - 35780
  • [22] SCTransNet: Spatial-Channel Cross Transformer Network for Infrared Small Target Detection
    Yuan, Shuai
    Qin, Hanlin
    Yan, Xiang
    Akhtar, Naveed
    Mian, Ajmal
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [23] Research on Instance Segmentation Algorithm of Greenhouse Sweet Pepper Detection Based on Improved Mask RCNN
    Cong, Peichao
    Li, Shanda
    Zhou, Jiachao
    Lv, Kunfeng
    Feng, Hao
    AGRONOMY-BASEL, 2023, 13 (01):
  • [24] Multiple-Object Detection and Segmentation Based on Deep Learning in High-Resolution Video Using Mask-RCNN
    Rajjak, Shaikh Shakil Abdul
    Kureshi, A. K.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (13)
  • [25] Stereo Vison and Mask-RCNN Segmentation Based 3D Points Cloud Matching for Fish Dimension Measurement
    Huang, Kangwei
    Li, Yanjun
    Suo, Feiyang
    Xiang, Ji
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6345 - 6350
  • [26] Pulmonary PET /CT image instance segmentation based on dense interactive feature fusion Mask RCNN
    Zhou T.
    Zhao Y.
    Lu H.
    Wang Y.
    Zhi L.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (03): : 527 - 534
  • [27] Mask FORD-NET: Efficient Detection of Digital Image Forgery using Hybrid REG-NET based Mask-RCNN
    Whitin, Priscilla
    Sivakumar, S.
    Geetha, M.
    Devaki, M.
    Bhuvanesh, A.
    Balasubramaniyan, Kiruthiga
    Ahilan, A.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2024, 15 (10) : 829 - 835
  • [28] Multiclass Transportation Safety Hardware Asset Detection and Segmentation Based on Mask-RCNN with RoI Attention and IoMA-Merging
    Zhang, Xinan
    Hsieh, Yung-An
    Yu, Pingzhou
    Yang, Zhongyu
    Tsai, Yichang James
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2023, 37 (05)
  • [29] An Efficient Algorithm for Extracting Railway Tracks Based on Spatial-Channel Graph Convolutional Network and Deep Neural Residual Network
    Weng, Yanbin
    Xu, Meng
    Chen, Xiahu
    Peng, Cheng
    Xiang, Hui
    Xie, Peixin
    Yin, Hua
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (09)
  • [30] Crack Segmentation Based on Fusing Multi-Scale Wavelet and Spatial-Channel Attention
    Geng P.
    Lu J.
    Ma H.
    Yang G.
    SDHM Structural Durability and Health Monitoring, 2023, 17 (01): : 1 - 22