Multi-layers context convolutional neural network for object detection

被引:0
|
作者
Wang H. [1 ]
Shan W. [1 ]
Fang B. [1 ]
机构
[1] School of Computer Science and Information Engineering, He-fei University of Technology, Hefei
基金
中国国家自然科学基金;
关键词
Feature Fusion; Multi-layers Context Information(MLC); Object Detection; Region Proposal Network(RPN);
D O I
10.16451/j.cnki.issn1003-6059.202002003
中图分类号
学科分类号
摘要
Insufficient feature information in object detection results in low accuracy of small targets and occluded targets detection. Therefore, multi-layers context convolutional neural network(MLC-CNN) is proposed, and contextual information of multiple layers is extracted to combine local features of objects in object detection. MLC-CNN consists of region proposal network(RPN) sub-network and multi-layers context(MLC) sub-network. RPN sub-network is employed to capture feature vectors with the fixed length as object features, and MLC is employed to obtain the corresponding contextual information of the different feature maps. Finally, two kinds of information are fused. In addition, hard example training is employed to solve the problem of imbalance data. Experiments on PASCAL VOC2007 and PASCAL VOC2012 datasets indicate that mean average precision(mAP) value is improved. © 2020, Science Press. All right reserved.
引用
收藏
页码:113 / 120
页数:7
相关论文
共 26 条
  • [1] Fang B.F., Fang L., Concise Feature Pyramid Region Proposal Network for Multi-scale Object Detection, The Journal of Supercomputing, (2018)
  • [2] Li Q.Z., Li Y.B., Niu J., Real-Time Detection of Underwater Fish Based on Improved YOLO and Transfer Learning, Pattern Recognition and Artificial Intelligence, 32, 3, pp. 193-203, (2019)
  • [3] Girshick R., Donahue J., Darrell T., Et al., Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [4] Girshick R., Fast R-CNN, Proc of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [5] Ren S.Q., He K.M., Girshick R., Et al., Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
  • [6] Dai J.F., Li Y., He K.M., Et al., R-FCN: Object Detection via Region-Based Fully Convolutional Networks, Proc of the 30th International Conference on Neural Information Processing Systems, pp. 379-387, (2016)
  • [7] Redmon J., Divvala S., Girshick R., Et al., You Only Look Once: Unified, Real-Time Object Detection, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
  • [8] Redmon J., Farhadi A., YOLO9000: Better, Faster, Stronger, Proc of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)
  • [9] Liu W., Anguelov D., Erhan D., Et al., SSD: Single Shot Multibox Detector, Proc of the European Conference on Computer Vision, pp. 21-37, (2016)
  • [10] Liu L., Ouyang W.L., Wang X.G., Et al., Deep Learning for Generic Object Detection: A Survey[J/OL]