Multi-model Integrated Weakly Supervised Semantic Segmentation Method

被引:0
|
作者
Xiong C. [1 ]
Zhi H. [1 ]
机构
[1] Beijing Key Laboratory of Urban Intelligent Control Technology, North China University of Technology, Beijing
关键词
Model integration; Semantic segmentation; Transfer learning; Weakly-supervised learning;
D O I
10.3724/SP.J.1089.2019.17379
中图分类号
学科分类号
摘要
In order to reduce the impact of loss of spatial information generated by pooling operator and improve the performance of transfer learning for weakly-supervised semantic segmentation algorithm with deep convolutional neural network, this paper designs a weakly-supervised image semantic segmentation algorithm based on multi-model ensemble. Based on transfer learning algorithm, the method firstly utilizes the semantic features from last convolutional layer of a multi-scale image and the convolutional features from the middle and deep layers of a single-scale image to respectively train two different homogeneous segmentation models. And then these models are weighted integrating with the original transfer-learning model to get the final segmentation model. In addition, the algorithm combines the confidence of categories to adjust the pixels' confidence expecting to suppress the false positive regions in the segmented image to improve the accuracy. Finally, the proposed algorithm is tested in challenging VOC2012 dataset. The results show that the mean intersection-over-union of the proposed algorithm is 55.3% on validation dataset and 56.9% on test set, outperforming the original transfer-learning algorithm by 6.1% and 11.1%, respectively. And the method performs favorably against other segmentation methods using weakly-supervised information based on class labels as well. © 2019, Beijing China Science Journal Publishing Co. Ltd. All right reserved.
引用
收藏
页码:800 / 807
页数:7
相关论文
共 23 条
  • [1] Chen L.C., Papandreou G., Kokkinos I., Et al., DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs, IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 4, pp. 834-848, (2018)
  • [2] Everingham M., Van Gool L., Williams C.K.I., Et al., The pascal visual object classes (VOC) challenge, International Journal of Computer Vision, 88, 2, pp. 303-338, (2010)
  • [3] Cordts M., Omran M., Ramos S., Et al., The cityscapes dataset for semantic urban scene understanding, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3213-3223, (2016)
  • [4] Lin T.Y., Maire M., Belongie S., Et al., Microsoft COCO: common objects in context, Proceedings of European Conference on Computer Vision, pp. 740-755, (2014)
  • [5] Papandreou G., Chen L.C., Murphy K.P., Et al., Weakly-and semi-supervised learning of a deep convolutional network for semantic image segmentation, Proceedings of the IEEE International Conference on Computer Vision, pp. 1742-1750, (2015)
  • [6] Dai J.F., He K.M., Sun J., BoxSup: exploiting bounding boxes to supervise convolutional networks for semantic segmentation, Proceedings of the IEEE International Conference on Computer Vision, pp. 1635-1643, (2015)
  • [7] Lin D., Dai J.F., Jia J.Y., Et al., ScribbleSup: scribble-supervised convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3159-3167, (2016)
  • [8] Bearman A., Russakovsky O., Ferrari V., Et al., What's the point: semantic segmentation with point supervision, Proceedings of European Conference on Computer Vision, pp. 549-565, (2016)
  • [9] Souly N., Spampinato C., Shah M., Semi supervised semantic segmentation using generative adversarial network, Proceedings of the IEEE International Conference on Computer Vision, pp. 5689-5697, (2017)
  • [10] Hung W.C., Tsai Y.H., Liou Y.T., Et al., Adversarial learning for semi-supervised semantic segmentation