Deep unsupervised part-whole relational visual saliency

被引：30

作者：

Liu, Yi ^{[1
,2
]}

Dong, Xiaohui ^{[1
,2
]}

Zhang, Dingwen ^{[3
]}

Xu, Shoukun ^{[1
,2
]}

机构：

[1] Changzhou Univ, Aliyun Sch Big Data, Sch Comp Sci & Artificial Intelligence, Changzhou 213000, Jiangsu, Peoples R China

[2] Changzhou Univ, Sch Software, Changzhou 213000, Jiangsu, Peoples R China

[3] Northwestern Polytech Univ, Sch Automat, Xian 710129, Shaanxi, Peoples R China

来源：

NEUROCOMPUTING | 2024年 / 563卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Unsupervised salient object detection; Part-object relationship; Consistency-aware fusion strategy; OBJECT DETECTION;

D O I：

10.1016/j.neucom.2023.126916

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Supervised Salient Object Detection (SSOD) excessively relies on large-scale annotated pixel-level labels which consume intensive labour acquiring high quality labels. In such precondition, deep Unsupervised Salient Object Detection (USOD) draws public attention. Under the framework of the existing deep USOD methods, they mostly generate pseudo labels by fusing several hand-crafted detectors' results. On top of that, a Fully Convolutional Network (FCN) will be trained to detect salient regions separately. While the existing USOD methods have achieved some progress, there are still challenges for them towards satisfactory performance on the complex scene, including (1) poor object wholeness owing to neglecting the hierarchy of those salient regions; (2) unsatisfactory pseudo labels causing by unprimitive fusion of hand-crafted results. To address these issues, in this paper, we introduce the property of part-whole relations endowed by a Belief Capsule Network (BCNet) for deep USOD, which is achieved by a multi-stream capsule routing strategy with a belief score for each stream within the CapsNets architecture. To train BCNet well, we generate high-quality pseudo labels from multiple hand-crafted detectors by developing a consistency-aware fusion strategy. Concretely, a weeding out criterion is first defined to filter out unreliable training samples based on the inter-method consistency among four hand-crafted saliency maps. In the following, a dynamic fusion mechanism is designed to generate high-quality pseudo labels from the remaining samples for BCNet training. Experiments on five public datasets illustrate the superiority of the proposed method. Codes have been released on: https://github.com/Mirlongue/Deep-Unsupervised-Part-Whole-Relational-Visual-Saliency.

引用

页数：13

共 50 条

[31] Part-whole physicalism and mental causation
Ehring, D
SYNTHESE, 2003, 136 (03) : 359 - 388
[32] Part-whole practice of movement sequences
Park, JH
Wilde, H
Shea, CH
JOURNAL OF MOTOR BEHAVIOR, 2004, 36 (01) : 51 - 61
[33] Part-Whole Physicalism and Mental Causation
Douglas Ehring
Synthese, 2003, 136 : 359 - 388
[34] Neural models for part-whole hierarchies
Riesenhuber, M
Dayan, P
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 17 - 23
[35] Representation of part-whole similarity in geology
Sandro Rama Fiorini
Mara Abel
Joel Luis Carbonera
Earth Science Informatics, 2015, 8 : 77 - 94
[36] A PART-WHOLE STRATEGY FOR THE STUDY OF OPINIONS
LEVER, H
SMOOHA, S
PUBLIC OPINION QUARTERLY, 1981, 45 (04) : 560 - 570
[37] DETERMINANTS OF PART-WHOLE PERCEPTION IN CHILDREN
ELKIND, D
ANAGNOSTOPOULOU, R
MALONE, S
CHILD DEVELOPMENT, 1970, 41 (02) : 391 - +
[38] HANDEDNESS AND PART-WHOLE RELATIONSHIPS - REPLICATION
HARDYCK, C
CORTEX, 1977, 13 (02) : 177 - 183
[39] Method for learning part-whole relations
van Hage, Willem Robert
Kolb, Hap
Schreiber, Guus
SEMANTIC WEB - ISEC 2006, PROCEEDINGS, 2006, 4273 : 723 - 735
[40] Separating Inference from Feature Learning in Deep Unsupervised Visual Saliency Estimation
Taille, Bruno
Ortiz, Michael Garcia
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1195 - 1201

← 1 2 3 4 5 →