Flexible density peak clustering for real-world data

被引:2
|
作者
Hou, Jian [1 ]
Lin, Houshen [1 ]
Yuan, Huaqiang [1 ]
Pelillo, Marcello [2 ,3 ]
机构
[1] Dongguan Univ Technol, Sch Comp Sci & Technol, Dongguan 523808, Peoples R China
[2] Ca Foscari Univ, DAIS, I-30172 Venice, Italy
[3] Ca Foscari Univ, European Ctr Living Technol, I-30123 Venice, Italy
基金
中国国家自然科学基金;
关键词
Clustering; Density peak; Real-world data; Number of clusters; FAST SEARCH; K-MEANS; FIND;
D O I
10.1016/j.patcog.2024.110772
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In density based clustering, the density peak algorithm has attracted much attention due to its effectiveness and simplicity, and a vast amount of clustering approaches have been proposed based on this algorithm. Some of these works require manual selection of cluster centers with a decision graph, where human involvement leads to uncertainty in clustering results. In order to avoid human involvement, some other algorithms depend on user-specified number of clusters to determine cluster centers automatically. However, it is well known that accurate estimation of number of clusters is a long-standing difficulty in data clustering. In this paper we present a sequential density peak clustering algorithm to extract clusters one by one, thereby determining the number of clusters automatically and avoiding manual selection of cluster centers in the meanwhile. Starting from a density peak, our algorithm generates an initial cluster surrounding the density peak in the first step, and then obtains the final cluster by expanding the initial cluster based on the relative density relationship among neighboring data points. With a peeling-off strategy, we obtain all the clusters sequentially. Our algorithm works well with clusters of Gaussian distribution and is therefore potential for clustering of real-world data. Experiments with a large number of synthetic and real datasets and comparisons with existing algorithms demonstrate the effectiveness of the proposed algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] The potential of real-world data
    Julian Nowogrodzki
    Nature, 2020, 585 (7826) : S19 - S19
  • [32] REAL-WORLD PROBLEMS WITH REAL-WORLD DATA: ADDRESSING DATA QUALITY IN THE ELECTRONIC HEALTH RECORD
    Anderson, Wesley
    Boyce, Danielle
    Kurtycz, Ruth
    Roddy, Will
    Heavner, Smith
    CRITICAL CARE MEDICINE, 2024, 52
  • [33] Peak Inspiratory Flow Rate in COPD: An Analysis of Clinical Trial and Real-World Data
    Anderson, Martin
    Collison, Kathryn
    Drummond, M. Bradley
    Hamilton, Melanie
    Jain, Renu
    Martin, Neil
    Mularski, Richard A.
    Thomas, Mike
    Zhu, Chang-Qing
    Ferguson, Gary T.
    INTERNATIONAL JOURNAL OF CHRONIC OBSTRUCTIVE PULMONARY DISEASE, 2021, 16 : 933 - 943
  • [34] Turning real-world data into real-world evidence: some practical guidance
    Schneeweiss, Sebastian
    PRAVENTION UND GESUNDHEITSFORDERUNG, 2023,
  • [35] Advancing regulatory science through real-world data and real-world evidence
    Cure, Pablo
    Fessel, Joshua P.
    Hartshorn, Christopher M.
    Steele, Scott J.
    JOURNAL OF CLINICAL AND TRANSLATIONAL SCIENCE, 2024, 8 (01)
  • [36] Intrathecal catheterisation after accidental dural puncture: real-world data, real-world benefits and real-world barriers
    Broom, M. A.
    ANAESTHESIA, 2023, 78 (10) : 1195 - 1198
  • [37] A Statistical Roadmap for Journey from Real-World Data to Real-World Evidence
    Yixin Fang
    Hongwei Wang
    Weili He
    Therapeutic Innovation & Regulatory Science, 2020, 54 : 749 - 757
  • [38] What Does It Take to Transform Real-World Data Into Real-World Evidence?
    Ramamoorthy, Anuradha
    Huang, Shiew-Mei
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2019, 106 (01) : 10 - 18
  • [39] Real-World Evidence and Real-World Data for Evaluating Drug Safety and Effectiveness
    Corrigan-Curay, Jacqueline
    Sacks, Leonard
    Woodcock, Janet
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 320 (09): : 867 - 868
  • [40] A Statistical Roadmap for Journey from Real-World Data to Real-World Evidence
    Fang, Yixin
    Wang, Hongwei
    He, Weili
    THERAPEUTIC INNOVATION & REGULATORY SCIENCE, 2020, 54 (04) : 749 - 757