Constrained clustering with weak label prior

被引:3
|
作者
Zhang, Jing [1 ]
Fan, Ruidong [1 ]
Tao, Hong [1 ]
Jiang, Jiacheng [1 ]
Hou, Chenping [1 ]
机构
[1] Natl Univ Def Technol, Coll Sci, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
clustering; weak label prior; cluster ratio; pairwise constraints; ALGORITHM; GRAPH;
D O I
10.1007/s11704-023-3355-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is widely exploited in data mining. It has been proved that embedding weak label prior into clustering is effective to promote its performance. Previous researches mainly focus on only one type of prior. However, in many real scenarios, two kinds of weak label prior information, e.g., pairwise constraints and cluster ratio, are easily obtained or already available. How to incorporate them to improve clustering performance is important but rarely studied. We propose a novel constrained Clustering with Weak Label Prior method (CWLP), which is an integrated framework. Within the unified spectral clustering model, the pairwise constraints are employed as a regularizer in spectral embedding and label proportion is added as a constraint in spectral rotation. To approximate a variant of the embedding matrix more precisely, we replace a cluster indicator matrix with its scaled version. Instead of fixing an initial similarity matrix, we propose a new similarity matrix that is more suitable for deriving clustering results. Except for the theoretical convergence and computational complexity analyses, we validate the effectiveness of CWLP through several benchmark datasets, together with its ability to discriminate suspected breast cancer patients from healthy controls. The experimental evaluation illustrates the superiority of our proposed approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A new model for constrained clustering
    Un nouveau modèle pour la classification non supervisée sous contraintes
    1600, Lavoisier (28):
  • [42] Active Sampling for Constrained Clustering
    Okabe, Masayuki
    Yamada, Seiji
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2014, 18 (02) : 232 - 238
  • [43] Doubly Constrained Fair Clustering
    Dickerson, John
    Esmaeili, Seyed A.
    Morgenstern, Jamie
    Zhang, Claire Jie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] Constrained Mean Shift Clustering
    Schier, Maximilian
    Reinders, Christoph
    Rosenhahn, Bodo
    PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2022, : 235 - 243
  • [45] Partition Level Constrained Clustering
    Liu, Hongfu
    Tao, Zhiqiang
    Fu, Yun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (10) : 2469 - 2483
  • [46] Clustering Analysis of Unlabeled Data and Weak-Label Detection Analysis Method Integrating Soft Computing Technology
    Liang, Chunhua
    IEEE ACCESS, 2024, 12 : 6852 - 6863
  • [47] Constrained Clustering and Its Application to Face Clustering in Videos
    Wu, Baoyuan
    Zhang, Yifan
    Hu, Bao-Gang
    Ji, Qiang
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3507 - 3514
  • [48] Label Enhancement for Label Distribution Learning via Prior Knowledge
    Gao, Yongbiao
    Zhang, Yu
    Geng, Xin
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3223 - 3229
  • [49] Label Image Constrained Multiatlas Selection
    Yan, Pingkun
    Cao, Yihui
    Yuan, Yuan
    Turkbey, Baris
    Choyke, Peter L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (06) : 1158 - 1168
  • [50] Label Constrained Shortest Path Estimation
    Likhyani, Ankita
    Bedathur, Srikanta
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1177 - 1180