Classification and segmentation of defect carrots using deep multi-branch models fusion network

被引:2
|
作者
Xie W. [1 ]
Wei S. [1 ]
Zheng Z. [1 ]
Yang G. [1 ]
Ding X. [1 ]
Yang D. [1 ]
机构
[1] College of Engineering, China Agricultural University, Beijing
关键词
Carrot; Classification; Deep learning; External quality; Fusion model; Image segmentation;
D O I
10.11975/j.issn.1002-6819.2021.2.021
中图分类号
学科分类号
摘要
Detection of carrot defects plays an important role in the sale of carrots. The segmentation and extraction of carrot crack regions have become necessary to automatically evaluate the crack degree of carrots, and further trim the area of the crack. In the traditional detection of carrot external quality, different image processing was designed using the features of different defects, showing high complexity while low robustness. In this study, a deep multi-branch models fusion network (CS-net) was proposed to integrate the recognition of carrot defects and segmentation of crack regions. The network contained two parts: the classification of carrot defects (C-Net), and segmentation extraction of carrot crack regions (S-Net). In C-Net, the ResNet-50 pre-trained on the ImageNet dataset was taken as an image feature extractor of carrot. The output features in the 1st, 10th, 22nd, 40th and 49th layers of ResNet-50 were processed by different pooling methods, including Average Pooling (AVP), Global Average Pooling (GAP), and Spatial Pyramid Pooling (SPP), as well as dimension reduction (principal component analysis, ReliefF). The extracted features were then used as input of Support Vector Machines (SVM) to obtain five classification models. Besides, the five classification models were ensemble with different fusion strategies (hard voting, soft voting and stacking) to obtain the final classification model. In S-Net, the pre-trained ResNet-50 was served as the encoder of segmentation network, and then the network decoder was designed to build the segmentation network of carrot crack regions. The results showed that the output features in the 49th layer of the ResNet-50 with SVM model performed best with the test accuracy of 94.71% among the single model. The fusion model with the stacking ensemble performed best with the accuracy of 98.40%, indicating a better performance in the fusion model than the single model. Different pooling methods had different effects on the performance of the model. In the low-level feature maps, the order of performance for different pooling methods was SPP > AVP > GAP. However, the pooling methods had little impact on the model performance with the high-level semantic features. It was found that dimensionality reduction reduced the number of features and then improved the performance of the model. In the segmentation part, the constructed segmentation network with the U-net construction ideas (Res-U-net) performed best with the Pixel Accuracy (PA), Mean Pixel Accuracy (MPA) and mean intersection over union (MIoU) of 98.31%, 96.05% and 92.81%, respectively. The performance of Res-U-net was not affected by the cracking area and different positions of crack. Comparing with Deeplabv3+, the PA and the MIoU in the Res-U-net were similar to those of Deeplabv3+, while the MPA was better than that of Deeplabv3+, and the model size was only half of that of Deeplabv3+. In addition, the segmentation speed of single image was faster than that of Deeplabv3+. The Res-U-net reached an advanced level in the segmentation task of carrot crack defects. The defect recognition and segmentation network have a positive significance on the quantitative evaluation of carrot external quality and the automatic trim of carrot crack. © 2021, Editorial Department of the Transactions of the Chinese Society of Agricultural Engineering. All right reserved.
引用
收藏
页码:177 / 186
页数:9
相关论文
共 32 条
  • [1] 联合国粮食及农业组织, (2020)
  • [2] Hameed K, Chai D, Rassau A., A comprehensive review of fruit and vegetable classification techniques, Image and Vision Computing, 80, pp. 24-44, (2018)
  • [3] Batchelor M M, Searcy S W., Computer vision determination of the stem/root joint on processing carrots, Journal of Agricultural Engineering Research, 43, pp. 259-269, (1989)
  • [4] Howarth M S., Reflectance characteristics of fresh-market carrots, Transactions of the ASAE, 33, 3, pp. 961-964, (1990)
  • [5] Howarth M S, Searcy S W., Fresh market carrot inspection by machine vision, Optics in Agriculture International Society for Optics and Photonics, 1379, pp. 141-150, (1991)
  • [6] Howarth M S, Brandon J R, Searcy S W, Et al., Estimation of tip shape for carrot classification by machine vision, Journal of Agricultural Engineering Research, 53, pp. 123-139, (1992)
  • [7] Hahn F, Sanchez S., Carrot volume evaluation using imaging algorithms, Journal of Agricultural Engineering Research, 75, 3, pp. 243-249, (2000)
  • [8] Han Zhongzhi, Deng Limiao, Xu Yan, Et al., Image processing method for detection of carrot green-shoulder, fibrous roots and surface cracks, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 29, 9, pp. 156-161, (2013)
  • [9] Deng L, Du H, Han Z., A carrot sorting system using machine vision technique, Applied Engineering in Agriculture, 33, 2, pp. 149-156, (2017)
  • [10] Xie W, Wang F, Yang D., Research on carrot surface defect detection methods based on machine vision, IFAC-PapersOnLine, 52, 30, pp. 24-29, (2019)