A Fast and More Accurate Seed-and-Extension Density-Based Clustering Algorithm

被引:0
|
作者
Tung, Ming-Hao [1 ]
Chen, Yi-Ping Phoebe [2 ]
Liu, Chen-Yu [3 ]
Liao, Chung-Shou [4 ]
机构
[1] Micron Technol Inc, Res & Dev, Hsinchu, Taiwan
[2] La Trobe Univ, Dept Comp Sci & Informat Technol, Melbourne, Australia
[3] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Hsinchu, Taiwan
[4] Natl Tsing Hua Univ, Ind Engn & Engn Management, Hsinchu, Taiwan
关键词
Clustering algorithms; Heuristic algorithms; Partitioning algorithms; Forestry; Machine learning algorithms; Shape; Numerical models; Center selection; density peaks; seed-and-extension; spanning tree; clustering;
D O I
10.1109/TKDE.2022.3161117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering algorithms have been widely studied in many scientific areas, such as data mining, knowledge discovery, bioinformatics and machine learning. A density-based clustering algorithm, called density peaks (DP), which was proposed by Rodriguez and Laio, outperforms almost all other approaches. Although the DP algorithm performs well in many cases, there is still room for improvement in the precision of its output clusters as well as the quality of the selected centers. In this study, we propose a more accurate clustering algorithm, seed-and-extension-based density peaks (SDP). SDP selects the centers that hold the features of their clusters while building a spanning forest, and meanwhile, constructs the output clusters in a seed-and-extension manner. Experiment results demonstrate the effectiveness of SDP, especially when dealing with clusters with relatively high densities. Precisely, we show that SDP is more accurate than the DP algorithm as well as other state-of-the-art clustering approaches concerning the quality of both output clusters and cluster centers while maintaining similar running time of the DP algorithm, particularly for a variety of time-series data. Moreover, SDP outperforms DP in the dynamic model in which data point insertion and deletion are allowed. From a practical perspective, the proposed SDP algorithm is obviously helpful to many application problems.
引用
收藏
页码:5458 / 5471
页数:14
相关论文
共 50 条
  • [21] A Density-based clustering algorithm suitable to various density dataset
    School of Software, Dalian University of Technology, Dalian 116621, China
    J. Comput. Inf. Syst., 2008, 6 (2473-2481):
  • [22] Video abstraction using density-based clustering algorithm
    Fereshteh Falah Chamasemani
    Lilly Suriani Affendey
    Norwati Mustapha
    Fatimah Khalid
    The Visual Computer, 2018, 34 : 1299 - 1314
  • [23] Video abstraction using density-based clustering algorithm
    Chamasemani, Fereshteh Falah
    Affendey, Lilly Suriani
    Mustapha, Norwati
    Khalid, Fatimah
    VISUAL COMPUTER, 2018, 34 (10): : 1299 - 1314
  • [24] An Improved BAT Algorithm Using Density-Based Clustering
    Al-Asadi, Samraa Adnan
    Al-Mamory, Safaa O.
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2023, 26 (72): : 102 - 123
  • [25] A GPU-Accelerated Density-Based Clustering Algorithm
    Loh, Woong-Kee
    Kim, Young-Kuk
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 775 - 776
  • [26] A density-based clustering algorithm for the CYGNO data analysis
    Baracchini, E.
    Benussi, L.
    Bianco, S.
    Capoccia, C.
    Caponero, M.
    Cavoto, G.
    Cortez, A.
    Costa, I. A.
    Di Marco, E.
    D'Imperio, G.
    Dho, G.
    Lacoangeli, F.
    Maccarrone, G.
    Marafini, M.
    Mazzitelli, G.
    Messina, A.
    Nobrega, R. A.
    Orlandi, A.
    Paoletti, E.
    Passamonti, L.
    Petrucci, F.
    Piccolo, D.
    Pierluigi, D.
    Pinci, D.
    Renga, F.
    Rosatelli, F.
    Russo, A.
    Saviano, G.
    Tesauroc, R.
    Tomassini, S.
    JOURNAL OF INSTRUMENTATION, 2020, 15 (12)
  • [27] Optimal choice of parameters for a density-based clustering algorithm
    Gan, WY
    Li, DY
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2003, 2639 : 603 - 606
  • [28] An Efficient Density-based clustering algorithm for face groping
    Pei, Shenfei
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    NEUROCOMPUTING, 2021, 462 : 331 - 343
  • [29] A density-based evolutionary clustering algorithm for intelligent development
    Xie, Haibin
    Li, Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [30] Density-based clustering algorithm for mixture data sets
    Huang, De-Cai
    Wu, Tian-Hong
    Kongzhi yu Juece/Control and Decision, 2010, 25 (03): : 416 - 421