Weakly-supervised scene parsing with multiple contextual cues

被引:3
|
作者
Li, Teng [1 ]
Wu, Xinyu [2 ]
Ni, Bingbing [3 ]
Lu, Ke [4 ]
Yan, Shuicheng [5 ]
机构
[1] Anhui Univ, Coll Elect Engn & Automat, Hefei, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing 100864, Peoples R China
[3] Adv Digital Sci Ctr, Singapore 138632, Singapore
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[5] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore
关键词
Scene parsing; Weakly-supervised; Multiple context; IMAGE; CLASSIFICATION; KERNELS;
D O I
10.1016/j.ins.2015.06.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene parsing, fully labeling an image with each region corresponding to a label, is one of the core problems of computer vision. Previous methods to this problem usually rely on patch-level models trained from well labeled data. In this paper, we propose a weakly-supervised scene parsing algorithm that semantically parses a collection of images with multi-label, which is guided by the top-down category models and bottom-up local patch contexts across images that closely related segments usually have similar labels. Images are segmented to patches on multi-level and the contextual relations of patches are discovered via sparse representation by l(1) minimization, based on which a graph is constructed. The multi-level spatial context of patches is also embedded in the graph, based on which image-level labels can be propagated to segments optimally. The contextual patch labeling process is formulated in an optimization framework and solved by a convergent iterative method. The category models are learned from the decomposed label representations of the image set and applied to the segments. Final labeling is obtained by combining all the information on pixel level. The effectiveness of the proposed method is demonstrated in experiments on two benchmark datasets and comparisons are taken. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:59 / 72
页数:14
相关论文
共 50 条
  • [1] Weakly-Supervised Video Scene Co-parsing
    Zhong, Guangyu
    Tsai, Yi-Hsuan
    Yang, Ming-Hsuan
    COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 : 20 - 36
  • [2] Semantic Graph Construction for Weakly-Supervised Image Parsing
    Xie, Wenxuan
    Peng, Yuxin
    Xiao, Jianguo
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2853 - 2859
  • [3] Weakly-Supervised Semantic Segmentation Using Motion Cues
    Tokmakov, Pavel
    Alahari, Karteek
    Schmid, Cordelia
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 388 - 404
  • [4] WEAKLY-SUPERVISED CARICATURE FACE PARSING THROUGH DOMAIN ADAPTATION
    Chu, Wenqing
    Hung, Wei-Chih
    Tsai, Yi-Hsuan
    Cai, Deng
    Yang, Ming-Hsuan
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3282 - 3286
  • [5] Saliency Guided Dictionary Learning for Weakly-Supervised Image Parsing
    Lai, Baisheng
    Gong, Xiaojin
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3630 - 3639
  • [6] A Simple Baseline for Weakly-Supervised Scene Graph Generation
    Shi, Jing
    Zhong, Yiwu
    Xu, Ning
    Li, Yin
    Xu, Chenliang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16373 - 16382
  • [7] Weakly-supervised region annotation for understanding scene images
    Wang, Hao
    Lu, Tong
    Wang, Yiming
    Shivakumara, Palaiahnakote
    Tan, Chew Lim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (06) : 3027 - 3051
  • [8] Weakly-supervised region annotation for understanding scene images
    Hao Wang
    Tong Lu
    Yiming Wang
    Palaiahnakote Shivakumara
    Chew Lim Tan
    Multimedia Tools and Applications, 2016, 75 : 3027 - 3051
  • [9] Weakly-supervised image captioning based on rich contextual information
    Hai-Tao Zheng
    Zhe Wang
    Ningning Ma
    Jinyuan Chen
    Xi Xiao
    Arun Kumar Sangaiah
    Multimedia Tools and Applications, 2018, 77 : 18583 - 18599
  • [10] Weakly-supervised image captioning based on rich contextual information
    Zheng, Hai-Tao
    Wang, Zhe
    Ma, Ningning
    Chen, Jinyuan
    Xiao, Xi
    Sangaiah, Arun Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18583 - 18599