Weakly-supervised scene parsing with multiple contextual cues

被引:3
|
作者
Li, Teng [1 ]
Wu, Xinyu [2 ]
Ni, Bingbing [3 ]
Lu, Ke [4 ]
Yan, Shuicheng [5 ]
机构
[1] Anhui Univ, Coll Elect Engn & Automat, Hefei, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing 100864, Peoples R China
[3] Adv Digital Sci Ctr, Singapore 138632, Singapore
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[5] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore
关键词
Scene parsing; Weakly-supervised; Multiple context; IMAGE; CLASSIFICATION; KERNELS;
D O I
10.1016/j.ins.2015.06.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene parsing, fully labeling an image with each region corresponding to a label, is one of the core problems of computer vision. Previous methods to this problem usually rely on patch-level models trained from well labeled data. In this paper, we propose a weakly-supervised scene parsing algorithm that semantically parses a collection of images with multi-label, which is guided by the top-down category models and bottom-up local patch contexts across images that closely related segments usually have similar labels. Images are segmented to patches on multi-level and the contextual relations of patches are discovered via sparse representation by l(1) minimization, based on which a graph is constructed. The multi-level spatial context of patches is also embedded in the graph, based on which image-level labels can be propagated to segments optimally. The contextual patch labeling process is formulated in an optimization framework and solved by a convergent iterative method. The category models are learned from the decomposed label representations of the image set and applied to the segments. Final labeling is obtained by combining all the information on pixel level. The effectiveness of the proposed method is demonstrated in experiments on two benchmark datasets and comparisons are taken. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:59 / 72
页数:14
相关论文
共 50 条
  • [41] Weakly-Supervised Neural Text Classification
    Meng, Yu
    Shen, Jiaming
    Zhang, Chao
    Han, Jiawei
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 983 - 992
  • [42] Weakly-Supervised Hashing in Kernel Space
    Mu, Yadong
    Shen, Jialie
    Yan, Shuicheng
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3344 - 3351
  • [43] Weakly-Supervised Evidence Pinpointing and Description
    Zhang, Qiang
    Bhalerao, Abhir
    Hutchinson, Charles
    INFORMATION PROCESSING IN MEDICAL IMAGING (IPMI 2017), 2017, 10265 : 210 - 222
  • [44] SENet for Weakly-Supervised Relation Extraction
    Liu, Jiashu
    Chen, Guang
    Guo, Jun
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 511 - 515
  • [45] Local Boosting for Weakly-Supervised Learning
    Zhang, Rongzhi
    Yu, Yue
    Shen, Jiaming
    Cui, Xiquan
    Zhang, Chao
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3364 - 3375
  • [46] Weakly-Supervised Text Instance Segmentation
    Zu, Xinyan
    Yu, Haiyang
    Li, Bin
    Xue, Xiangyang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1915 - 1923
  • [47] Weakly-Supervised Alignment of Video With Text
    Bojanowski, P.
    Lajugie, R.
    Grave, E.
    Bach, F.
    Laptev, I.
    Ponce, J.
    Schmid, C.
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4462 - 4470
  • [48] A Weakly-Supervised Approach for Semantic Segmentation
    Feng, Yanqing
    Wang, Lunwen
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2311 - 2314
  • [49] Multimodal Imbalance-Aware Gradient Modulation for Weakly-Supervised Audio-Visual Video Parsing
    Fu, Jie
    Gao, Junyu
    Bao, Bing-Kun
    Xu, Changsheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4843 - 4856
  • [50] Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Losses
    Shi, Jing
    Xu, Jia
    Gong, Boqing
    Xu, Chenliang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10436 - 10444