Constrained clustering with weak label prior

被引:3
|
作者
Zhang, Jing [1 ]
Fan, Ruidong [1 ]
Tao, Hong [1 ]
Jiang, Jiacheng [1 ]
Hou, Chenping [1 ]
机构
[1] Natl Univ Def Technol, Coll Sci, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
clustering; weak label prior; cluster ratio; pairwise constraints; ALGORITHM; GRAPH;
D O I
10.1007/s11704-023-3355-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is widely exploited in data mining. It has been proved that embedding weak label prior into clustering is effective to promote its performance. Previous researches mainly focus on only one type of prior. However, in many real scenarios, two kinds of weak label prior information, e.g., pairwise constraints and cluster ratio, are easily obtained or already available. How to incorporate them to improve clustering performance is important but rarely studied. We propose a novel constrained Clustering with Weak Label Prior method (CWLP), which is an integrated framework. Within the unified spectral clustering model, the pairwise constraints are employed as a regularizer in spectral embedding and label proportion is added as a constraint in spectral rotation. To approximate a variant of the embedding matrix more precisely, we replace a cluster indicator matrix with its scaled version. Instead of fixing an initial similarity matrix, we propose a new similarity matrix that is more suitable for deriving clustering results. Except for the theoretical convergence and computational complexity analyses, we validate the effectiveness of CWLP through several benchmark datasets, together with its ability to discriminate suspected breast cancer patients from healthy controls. The experimental evaluation illustrates the superiority of our proposed approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A Markov random field-regulated Pitman-Yor process prior for spatially constrained data clustering
    Chatzis, Sotirios P.
    PATTERN RECOGNITION, 2013, 46 (06) : 1595 - 1603
  • [22] CONSTRAINED CLUSTERING AS AN OPTIMIZATION METHOD
    ROSE, K
    GUREWITZ, E
    FOX, GC
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (08) : 785 - 794
  • [23] Constrained clustering by constraint programming
    Thi-Bich-Hanh Dao
    Khanh-Chuong Duong
    Vrain, Christel
    ARTIFICIAL INTELLIGENCE, 2017, 244 : 70 - 94
  • [24] Constrained Clustering With Imperfect Oracles
    Zhu, Xiatian
    Loy, Chen Change
    Gong, Shaogang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (06) : 1345 - 1357
  • [25] Bayesian contiguity constrained clustering
    Come, Etienne
    STATISTICS AND COMPUTING, 2024, 34 (02)
  • [26] Clustering constrained symbolic data
    de Carvalho, Francisco de A. T.
    Csernel, Marc
    Lechevallier, Yves
    PATTERN RECOGNITION LETTERS, 2009, 30 (11) : 1037 - 1045
  • [27] Clustering with Constrained Similarity Learning
    Okabe, Masayuki
    Yamada, Seiji
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 30 - +
  • [28] A framework for deep constrained clustering
    Zhang, Hongjing
    Zhan, Tianyang
    Basu, Sugato
    Davidson, Ian
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (02) : 593 - 620
  • [29] A framework for deep constrained clustering
    Hongjing Zhang
    Tianyang Zhan
    Sugato Basu
    Ian Davidson
    Data Mining and Knowledge Discovery, 2021, 35 : 593 - 620
  • [30] Constrained Locally Weighted Clustering
    Cheng, Hao
    Hua, Kien A.
    Khanh Vu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 90 - 101