Deep unsupervised part-whole relational visual saliency

被引:30
|
作者
Liu, Yi [1 ,2 ]
Dong, Xiaohui [1 ,2 ]
Zhang, Dingwen [3 ]
Xu, Shoukun [1 ,2 ]
机构
[1] Changzhou Univ, Aliyun Sch Big Data, Sch Comp Sci & Artificial Intelligence, Changzhou 213000, Jiangsu, Peoples R China
[2] Changzhou Univ, Sch Software, Changzhou 213000, Jiangsu, Peoples R China
[3] Northwestern Polytech Univ, Sch Automat, Xian 710129, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Unsupervised salient object detection; Part-object relationship; Consistency-aware fusion strategy; OBJECT DETECTION;
D O I
10.1016/j.neucom.2023.126916
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Supervised Salient Object Detection (SSOD) excessively relies on large-scale annotated pixel-level labels which consume intensive labour acquiring high quality labels. In such precondition, deep Unsupervised Salient Object Detection (USOD) draws public attention. Under the framework of the existing deep USOD methods, they mostly generate pseudo labels by fusing several hand-crafted detectors' results. On top of that, a Fully Convolutional Network (FCN) will be trained to detect salient regions separately. While the existing USOD methods have achieved some progress, there are still challenges for them towards satisfactory performance on the complex scene, including (1) poor object wholeness owing to neglecting the hierarchy of those salient regions; (2) unsatisfactory pseudo labels causing by unprimitive fusion of hand-crafted results. To address these issues, in this paper, we introduce the property of part-whole relations endowed by a Belief Capsule Network (BCNet) for deep USOD, which is achieved by a multi-stream capsule routing strategy with a belief score for each stream within the CapsNets architecture. To train BCNet well, we generate high-quality pseudo labels from multiple hand-crafted detectors by developing a consistency-aware fusion strategy. Concretely, a weeding out criterion is first defined to filter out unreliable training samples based on the inter-method consistency among four hand-crafted saliency maps. In the following, a dynamic fusion mechanism is designed to generate high-quality pseudo labels from the remaining samples for BCNet training. Experiments on five public datasets illustrate the superiority of the proposed method. Codes have been released on: https://github.com/Mirlongue/Deep-Unsupervised-Part-Whole-Relational-Visual-Saliency.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Part-whole physicalism and mental causation
    Ehring, D
    SYNTHESE, 2003, 136 (03) : 359 - 388
  • [32] Part-whole practice of movement sequences
    Park, JH
    Wilde, H
    Shea, CH
    JOURNAL OF MOTOR BEHAVIOR, 2004, 36 (01) : 51 - 61
  • [33] Part-Whole Physicalism and Mental Causation
    Douglas Ehring
    Synthese, 2003, 136 : 359 - 388
  • [34] Neural models for part-whole hierarchies
    Riesenhuber, M
    Dayan, P
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 17 - 23
  • [35] Representation of part-whole similarity in geology
    Sandro Rama Fiorini
    Mara Abel
    Joel Luis Carbonera
    Earth Science Informatics, 2015, 8 : 77 - 94
  • [36] A PART-WHOLE STRATEGY FOR THE STUDY OF OPINIONS
    LEVER, H
    SMOOHA, S
    PUBLIC OPINION QUARTERLY, 1981, 45 (04) : 560 - 570
  • [37] DETERMINANTS OF PART-WHOLE PERCEPTION IN CHILDREN
    ELKIND, D
    ANAGNOSTOPOULOU, R
    MALONE, S
    CHILD DEVELOPMENT, 1970, 41 (02) : 391 - +
  • [38] HANDEDNESS AND PART-WHOLE RELATIONSHIPS - REPLICATION
    HARDYCK, C
    CORTEX, 1977, 13 (02) : 177 - 183
  • [39] Method for learning part-whole relations
    van Hage, Willem Robert
    Kolb, Hap
    Schreiber, Guus
    SEMANTIC WEB - ISEC 2006, PROCEEDINGS, 2006, 4273 : 723 - 735
  • [40] Separating Inference from Feature Learning in Deep Unsupervised Visual Saliency Estimation
    Taille, Bruno
    Ortiz, Michael Garcia
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1195 - 1201