Constrained clustering with weak label prior

被引:3
|
作者
Zhang, Jing [1 ]
Fan, Ruidong [1 ]
Tao, Hong [1 ]
Jiang, Jiacheng [1 ]
Hou, Chenping [1 ]
机构
[1] Natl Univ Def Technol, Coll Sci, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
clustering; weak label prior; cluster ratio; pairwise constraints; ALGORITHM; GRAPH;
D O I
10.1007/s11704-023-3355-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is widely exploited in data mining. It has been proved that embedding weak label prior into clustering is effective to promote its performance. Previous researches mainly focus on only one type of prior. However, in many real scenarios, two kinds of weak label prior information, e.g., pairwise constraints and cluster ratio, are easily obtained or already available. How to incorporate them to improve clustering performance is important but rarely studied. We propose a novel constrained Clustering with Weak Label Prior method (CWLP), which is an integrated framework. Within the unified spectral clustering model, the pairwise constraints are employed as a regularizer in spectral embedding and label proportion is added as a constraint in spectral rotation. To approximate a variant of the embedding matrix more precisely, we replace a cluster indicator matrix with its scaled version. Instead of fixing an initial similarity matrix, we propose a new similarity matrix that is more suitable for deriving clustering results. Except for the theoretical convergence and computational complexity analyses, we validate the effectiveness of CWLP through several benchmark datasets, together with its ability to discriminate suspected breast cancer patients from healthy controls. The experimental evaluation illustrates the superiority of our proposed approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Improving clustering with constrained communities
    Xu, Xiaohua
    He, Ping
    NEUROCOMPUTING, 2016, 188 : 239 - 252
  • [32] Scalable Constrained Spectral Clustering
    Li, Jianyuan
    Xia, Yingjie
    Shan, Zhenyu
    Liu, Yuncai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (02) : 589 - 593
  • [33] Fairness in constrained spectral clustering
    Agrawal, Laxita
    Saradhi, V. Vijaya
    Sharma, Teena
    NEUROCOMPUTING, 2025, 633
  • [34] Constrained Density Peak Clustering
    Vu, Viet-Thang
    Bui, T. T. Quyen
    Nguyen, Tien Loi
    Tran, Doan-Vinh
    Do, Hong-Quan
    Vu, Viet-Vu
    Avdoshin, Sergey M.
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2023, 19 (01)
  • [35] Robust constrained fuzzy clustering
    Fritz, Heinrich
    Garcia-Escudero, Luis A.
    Mayo-Iscar, Agustin
    INFORMATION SCIENCES, 2013, 245 : 38 - 52
  • [36] Active Sampling for Constrained Clustering
    Okabe, Masayuki
    Yamada, Seiji
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 399 - 402
  • [37] Clustering constrained on linear networks
    Martinez, Asael Fabian
    Chaudhuri, Somnath
    Diaz-Avalos, Carlos
    Juan, Pablo
    Mateu, Jorge
    Mena, Ramses H.
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2023, 37 (05) : 1983 - 1995
  • [38] Anchored Constrained Clustering Ensemble
    Guilbert, Mathieu
    Vrain, Christel
    Thi-Bich-Hanh Dao
    de Souto, Marcilio C. P.
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [39] Efficient Incremental Constrained Clustering
    Davidson, Ian
    Ester, Martin
    Ravi, S. S.
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 240 - +
  • [40] Clustering constrained on linear networks
    Asael Fabian Martínez
    Somnath Chaudhuri
    Carlos Díaz-Avalos
    Pablo Juan
    Jorge Mateu
    Ramsés H. Mena
    Stochastic Environmental Research and Risk Assessment, 2023, 37 : 1983 - 1995