On Probabilistic k-Richness of the k-Means Algorithms

被引:2
|
作者
Klopotek, Robert A. [1 ]
Klopotek, Mieczyslaw A. [2 ]
机构
[1] Cardinal Stefan Wyszynski Univ Warsaw, Fac Math & Nat Sci, Sch Exact Sci, Warsaw, Poland
[2] Polish Acad Sci, Comp Sci Fundamental Res Inst, Warsaw, Poland
来源
MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE | 2019年 / 11943卷
关键词
k-means; k-means plus; k-richness; Probabilistic k-richness; Weak probabilistic k-richness;
D O I
10.1007/978-3-030-37599-7_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With Kleinberg's axiomatic system for clustering, a discussion has been initiated, what kind of properties clustering algorithms should have and have not. As Ackerman et al. pointed out, the static properties studied by Kleinberg and other are not appropriate for clustering algorithms with elements of randomness. Therefore they introduced the property of probabilistic k-richness and claimed, without a proof that the versions of k-means both with random initialisation and k-means++ initialization have this property. We prove that k-means++ has the property of probabilistic k-richness, while k-means with random initialisation for well separated clusters does not. To characterize the latter, we introduce the notion of weak probabilistic k-richness and prove it for this algorithm. For completeness, we provide with a constructive proof that the theoretical k-means has the (deterministic) k-richness property.
引用
收藏
页码:259 / 271
页数:13
相关论文
共 50 条
  • [31] K and starting means for k-means algorithm
    Fahim, Ahmed
    JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 55
  • [32] K-Means Genetic Algorithms with Greedy Genetic Operators
    Kazakovtsev, Lev
    Rozhnov, Ivan
    Shkaberina, Guzel
    Orlov, Viktor
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [33] A Survey on Feature Weighting Based K-Means Algorithms
    de Amorim, Renato Cordeiro
    JOURNAL OF CLASSIFICATION, 2016, 33 (02) : 210 - 242
  • [34] A practical comparison of two K-Means clustering algorithms
    Wilkin, Gregory A.
    Huang, Xiuzhen
    BMC BIOINFORMATICS, 2008, 9 (Suppl 6)
  • [35] A Survey on Feature Weighting Based K-Means Algorithms
    Renato Cordeiro de Amorim
    Journal of Classification, 2016, 33 : 210 - 242
  • [36] An Extensive Empirical Comparison of k-means Initialization Algorithms
    Harris, Simon
    De Amorim, Renato Cordeiro
    IEEE ACCESS, 2022, 10 : 58752 - 58768
  • [37] PREDICTION OF DEMOGRAPHICAL CHARACTERISTICS USING K-MEANS ALGORITHMS
    Sari, Murat
    Tuna, Can
    Demir, Ibrahim
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2020, 38 (02): : 1051 - 1059
  • [38] Optimal Differentially Private Algorithms for k-Means Clustering
    Huang, Zhiyi
    Liu, Jinyan
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 395 - 408
  • [39] Comparative Analysis of K-Means and Traversal Optimisation Algorithms
    Adama, David Ada
    Olatunji, Timilehin Yinka
    Yahaya, Salisu Wada
    Lotfi, Ahmad
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 1409 : 300 - 311
  • [40] Fair Coresets and Streaming Algorithms for Fair k-means
    Schmidt, Melanie
    Schwiegelshohn, Chris
    Sohler, Christian
    APPROXIMATION AND ONLINE ALGORITHMS (WAOA 2019), 2020, 11926 : 232 - 251