Constraint-based clustering by fast search and find of density peaks

被引:29
|
作者
Liu, Ruhui [1 ]
Huang, Weiping [1 ]
Fei, Zhengshun [1 ]
Wang, Kai [1 ]
Liang, Jun [1 ]
机构
[1] Zhejiang Univ, State Key Lab Ind Control Technol, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; Constraint-based clustering; Density-based clustering; Density peak; Clustering center selection; ALGORITHM;
D O I
10.1016/j.neucom.2018.06.058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering by fast search and find of density peaks (CFDP) algorithm first proposed on Science is based on assumptions that the cluster center has the highest density among its neighbors and keeps a distance from other cluster centers. In CFDP algorithm, a local density metric and a minimal distance vector are first calculated for constructing a decision graph to select cluster centers. However, CFDP's performance is quite sensitive to parameter selection and relies on other prior knowledge. To solve the problem, this paper proposed a new clustering algorithm named constraint-based clustering by fast search and find of density peaks (CCFDP). In the proposed algorithm, several potential cluster centers are automatically formed and the structural information from constraints could be made full use of. CCFDP adopts a new method to obtain the density metric and the decision graph. After that, the decision graph is analyzed from different perspectives to help complete the final clustering. CCFDP is a semi-supervised robust clustering algorithm, combining semi-supervised constraints, density clustering and hierarchical clustering. Three synthetic and seven open datasets are used for testing its performance and robustness. The final results show that CCFDP outperforms other well-known constraint-based clustering algorithms. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:223 / 237
页数:15
相关论文
共 50 条
  • [41] Combined Density-based and Constraint-based Algorithm for Clustering
    陈同孝
    陈荣昌
    林志强
    邱永兴
    Journal of DongHua University, 2006, (06) : 36 - 38
  • [42] Grouping Method of Semiconductor Bonding Equipment Based on Clustering by Fast Search and Find of Density Peaks for Dynamic Matching According to Processing Tasks
    Gao, Zhijun
    Si, Wen
    Han, Zhonghua
    Peng, Jiayu
    Qiao, Feng
    PROCESSES, 2019, 7 (09)
  • [43] Constraint-based clustering selection
    Van Craenendonck, Toon
    Blockeel, Hendrik
    MACHINE LEARNING, 2017, 106 (9-10) : 1497 - 1521
  • [44] Constraint-based clustering selection
    Toon Van Craenendonck
    Hendrik Blockeel
    Machine Learning, 2017, 106 : 1497 - 1521
  • [45] Constraint-based query clustering
    Ruiz, Carlos
    Menasalvas, Ernestina
    Spiliopoulou, Myra
    ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 304 - +
  • [46] Fast Clustering by Affinity Propagation Based on Density Peaks
    Li, Yang
    Guo, Chonghui
    Sun, Leilei
    IEEE ACCESS, 2020, 8 : 138884 - 138897
  • [47] Fast and general density peaks clustering
    Sieranoja, Sami
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2019, 128 : 551 - 558
  • [48] Semi-Supervised Density Peaks Clustering Based on Constraint Projection
    Yan, Shan
    Wang, Hongjun
    Li, Tianrui
    Chu, Jielei
    Guo, Jin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 140 - 147
  • [49] An improved density peaks clustering based on sparrow search algorithm
    Chen, Yaru
    Zhou, Jie
    He, Xingshi
    Luo, Xinglong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11017 - 11037
  • [50] Constraint-based clustering in large databases
    Tung, AKH
    Han, JW
    Lakshmanan, LVS
    Ng, RT
    DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 : 405 - 419