Weakly supervised semantic segmentation and optimization algorithm based on multi-scale feature model

被引:0
|
作者
Xiong C. [1 ]
Zhi H. [1 ]
机构
[1] Beijing Key Laboratory of Urban Intelligent Control, North China University of Technology, Beijing
来源
关键词
Deep learning; Model integration; Model optimization; Multi-scale feature; Weakly-supervised learning;
D O I
10.11959/j.issn.1000-436x.2019004
中图分类号
学科分类号
摘要
In order to improve the accuracy of weakly-supervised semantic segmentation method, a segmentation and optimization algorithm that combines multi-scale feature was proposed. The new algorithm firstly constructs a multi-scale feature model based on transfer learning algorithm. In addition, a new classifier was introduced for category prediction to reduce the failure of segmentation due to the prediction of target class information errors. Then the designed multi-scale model was fused with the original transfer learning model by different weights to enhance the generalization performance of the model. Finally, the predictions class credibility was added to adjust the credibility of the corresponding class of pixels in the segmentation map, avoiding false positive segmentation regions. The proposed algorithm was tested on the challenging VOC 2012 dataset, the mean intersection-over-union is 58.8% on validation dataset and 57.5% on test dataset. It outperforms the original transfer-learning algorithm by 12.9% and 12.3%. And it performs favorably against other segmentation methods using weakly-supervised information based on category labels as well. © 2019, Editorial Board of Journal on Communications. All right reserved.
引用
收藏
页码:163 / 171
页数:8
相关论文
共 30 条
  • [1] Guan T., Zhou D.X., Liu Y.H., Color optical microscopic cell image segmentation based on color difference vector field, ACTA Optica Sinica, 34, 1, (2014)
  • [2] Sun Y.K., Medical image processing techniques based on optical coherence tomography and their applications, Optics and Precision Engineering, 22, 4, pp. 1086-1104, (2014)
  • [3] Ess A., Mueller T., Grabner H., Et al., Segmentation-based urban traffic scene understanding, British Machine Vision Conference, 84, pp. 1-11, (2009)
  • [4] Wan J., Wang D.Y., Hoi S.C.H., Et al., Deep learning for content-based image retrieval: a comprehensive study, The 22nd ACM international conference on Multimedia, 978, pp. 157-166, (2014)
  • [5] Oberweger M., Wohlhart P., Lepetit V., Hands deep in deep learning for hand pose estimation, Computer Vision Winter Workshop, pp. 21-30, (2015)
  • [6] Xiang S.B., Shu G.D., Ren X.L., Et al., Embedded implementation of real-time finger interaction system,, Optics and Precision Engineering, 19, 8, pp. 1911-1920, (2011)
  • [7] He K., Gkioxari G., Dollar P., Et al., Mask R-CNN, 2017 IEEE International Conference on Computer Vision, 2380, pp. 2980-2988, (2017)
  • [8] Pathak D., Shelhamer E., Long J., Et al., Fully convolutional multi-class multiple instance learning, International Conference on Learning Representations, pp. 1-4, (2015)
  • [9] Pathak D., Krahenbuhl P., Darrell T., Constrained convolutional neural networks for weakly supervised segmentation, IEEE International Conference on Computer Vision, 1550, pp. 1796-1804, (2015)
  • [10] Kwak S., Hong S., Han B., Weakly supervised semantic segmentation using superpixel pooling network, AAAI Conference on Artificial Intelligence, pp. 4111-4117, (2017)