Recognition of apple targets before fruits thinning by robot based on R-FCN deep convolution neural network

被引:0
|
作者
Wang D. [1 ,2 ,3 ]
He D. [1 ,2 ,3 ]
机构
[1] College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling
[2] Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Yangling
[3] Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Yangling
关键词
Algorithms; Deep learning; Image processing; Image recognition; R-FCN; Small apple; Target recognition;
D O I
10.11975/j.issn.1002-6819.2019.03.020
中图分类号
学科分类号
摘要
Before fruit thinning, factors such as complex background, various illumination conditions, foliage occlusion, fruit clustering, especially the extreme similarities between apples and background, made the recognition of small apple targets very difficult. To solve these problems, a recognition method based on region-based fully convolutional network (R-FCN) was proposed. Firstly, deep convolution neural network including ResNet-50 based R-FCN and ResNet-101 based R-FCN were studied and analyzed. After analyzing the framework of the 2 networks, it was obviously that the difference between these 2 networks was the 'conv4' block. The 'conv4' block of ResNet-101 based R-FCN was 51 more layers than that of ResNet-50 based R-FCN, but the recognition accuracy of the 2 networks was almost the same. By comparing the framework and recognition result of ResNet-50 based R-FCN and ResNet-101 based R-FCN, A R-FCN based on ResNet-44 was designed to improve the recognition accuracy and simplify the network. The main operation to simplify the network was to simplify the 'conv4' block, and the 'conv4' block of ResNet-44 based R-FCN was 6 layers less than that of ResNet-50 based R-FCN. The ResNet-44 based R-FCN consisted of ResNet-44 fully convolutional network, region proposal network (RPN) and region of interest (RoI) sub-network. ResNet-44 fully convolutional network, the backbone network of R-FCN, was used to extract features of image. The features were then used by RPN to generate RoIs. After that, the features extracted by ResNet-44 fully convolutional network and RoIs generated by RPN were used by RoI sub-network to recognize and locate small apple targets. A total of 3 165 images were captured in an experimental apple orchard in College of Horticulture, Northwest A&F University, in City of Yangling, China. After image resizing and manual annotation, 332 images, including 85 images captured under sunny direct sunlight condition, 88 images captured under sunny backlight condition, 86 images captured under cloudy direct sunlight condition, 74 images captured under cloudy backlight condition, were selected as test set, and the other 2 833 images were used to train and optimize the network. To enrich image training set, data augment, including brightness enhancement and reduction, chroma enhancement and reduction, contrast enhancement and reduction, sharpness enhancement and reduction, and adding Gaussian noise, was performed, then a total of 28 330 images were obtained with 23 591 images randomly selected as training set, and the other 4 739 images as validation set. After training, the simplified ResNet-44 based R-FCN was tested on the test set, and the experimental results indicated that the method could effectively apply to images captured under different illumination conditions. The method could recognize clustering apples, occluded apples, vague apples and apples with shadows, strong illumination and weak illumination on the surface. In addition, apples divided into parts by branched or petiole cloud also be recognized effectively. Overall, the recognition recall rate could achieve 85.7%. The recognition accuracy and false recognition rate were 95.2% and 4.9%, respectively. The average recognition time was 0.187 s per image. To further test the performance of the proposed method, the other 3 methods were compared, including Faster R-CNN, ResNet-50 based R-FCN and ResNet-101 based R-FCN. The F1 of the proposed method was increased by 16.4, 0.7 and 0.7 percentage points, respectively. The average running time of the proposed method improved by 0.010 and 0.041 s compared with that of ResNet-50 based R-FCN and ResNet-101 based R-FCN, respectively. The proposed method could achieve the recognition of small apple targets before fruits thinning which could not be realized by traditional methods. It could also be widely applied to the recognition of other small targets whose features are similar to background. © 2019, Editorial Department of the Transactions of the Chinese Society of Agricultural Engineering. All right reserved.
引用
收藏
页码:156 / 163
页数:7
相关论文
共 38 条
  • [1] Gongal A., Amatya S., Karkee M., Et al., Sensors and systems for fruit detection and localization: A review, Computers & Electronics in Agriculture, 116, pp. 8-19, (2015)
  • [2] Jiang G.Q., Zhao C.J., Apple recognition based on machine vision, International Conference on Machine Learning and Cybernetics, pp. 1148-1151, (2012)
  • [3] Lu J., Sang N., Detecting citrus fruits and occlusion recovery under natural illumination conditions, Computers & Electronics in Agriculture, 110, pp. 121-130, (2015)
  • [4] Ji W., Zhao D., Cheng F., Et al., Automatic recognition vision system guided for apple harvesting robot, Computers & Electrical Engineering, 38, 5, pp. 1186-1195, (2012)
  • [5] Rizon M., Yusri N.A.N., Kadir M.F.A., Et al., Determination of mango fruit from binary image using randomized Hough transform, Eighth International Conference on Machine Vision, 9875, 3, pp. 1-5, (2015)
  • [6] Silwal A., Gongal A., Karkee M., Identification of red apples in field environment with over the row machine vision system, Agricultural Engineering International: The CIGR Journal, 16, 4, pp. 66-75, (2014)
  • [7] Rakun J., Stajnko D., Zazula D., Detecting fruits in natural scenes by using spatial-frequency based texture analysis and multiview geometry, Computers & Electronics in Agriculture, 76, 1, pp. 80-88, (2011)
  • [8] Chaivivatrakul S., Dailey M.N., Texture-based fruit detection, Precision Agriculture, 15, 6, pp. 662-683, (2014)
  • [9] Feng J., Wang S., Liu G., Et al., A separating method of adjacent apples based on machine vision and chain code information, International Conference on Computer and Computing Technologies in Agriculture, pp. 258-267, (2012)
  • [10] Arefi A., Motlagh A.M., Mollazade K., Et al., Recognition and localization of ripen tomato based on machine vision, Australian Journal of Crop Science, 5, 10, pp. 1144-1149, (2011)