A Review of Unsupervised K-Value Selection Techniques in Clustering Algorithms

被引:1
|
作者
Pegado-Bardayo, Ana [1 ]
Lorenzo-Espejo, Antonio [1 ]
Munuzuri, Jesus [1 ]
Escudero-Santana, Alejandro [1 ]
机构
[1] Univ Seville, Seville, Spain
关键词
clustering; k-means; unsupervised learning; k-value; DATA SET; VALIDATION; NUMBER;
D O I
10.3926/jiem.6791
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose: Automatic grouping of data according to certain characteristics is made possible by clustering algorithms, which makes them an essential tool when working with large datasets. However, although they are unsupervised tools, they generally require the specification of the number of clusters to be formed, k , a task that may be simple for a human, but quite complex to automate. Despite the most commonly used k-value selection techniques offer acceptable results, they are not without shortcomings, suggesting that there is ample room for improvement. This paper briefly introduces clustering techniques, discusses the main shortcomings of conventional k-value selection techniques and examines the advantages and limitations of nine promising alternatives presented in recent years. Design/methodology/approach: An evaluation of the main shortcomings of classic k- value estimation techniques has been carried out, and the newest proposals have been explained and compared. Findings: New k- value estimation indices and methodologies proposed by authors guarantee better results, extending the use of these techniques to large volumes of data, and complex shapes and structures. However, no generical methodology able to overcome all the described shortcomings has still been developed. Research limitations/implications: This research is limited to the newest developed techniques for k- value estimation, including proposals published since 2019. Older proposals have not been considered, as the newest ones overcome the former's shortcomings. A k- value estimation techniques review published in 2019 is cited in the test as a base reference. Practical implications: Although the examples listed in the text apply to industry, the techniques described and discussed in this review are applicable to any area of science that can benefit from the use of clustering techniques. Originality/value: To date, there has been no paper comparing the new k- value estimation techniques. Although there are literature reviews comparing the classical methods, these methods are nowadays nearly obsolete due to the complexity of the data usually faced.
引用
收藏
页码:641 / 649
页数:9
相关论文
共 50 条
  • [31] NEONATAL CHANGES IN K-VALUE IN NEWBORN INFANTS OF DIABETIC MOTHERS
    PEDERSEN, LM
    PEDERSEN, J
    DIABETOLOGIA, 1969, 5 (01) : 51 - &
  • [32] A novel trajectory learning method for robotic arms based on Gaussian Mixture Model and k-value selection algorithm
    Yan, Jingnan
    Wu, Yue
    Ji, Kexin
    Cheng, Cheng
    Zheng, Yili
    PLOS ONE, 2025, 20 (02):
  • [33] Sampling Algorithms for Unsupervised Prototype Selection
    Ortiz-Bejar, Jose
    Solorzano-Rodriguez, Arturo A.
    Silva-Chavez, Juan C.
    Tellez, Eric S.
    Graff, Mario
    Ortiz-Bejar, Jesus
    2022 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2022,
  • [34] On Algorithms Selection for Unsupervised Anomaly Detection
    Zoppi, Tommaso
    Ceccarelli, Andrea
    Bondavalli, Andrea
    2018 IEEE 23RD PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC), 2018, : 279 - 288
  • [35] A Comparison of Unsupervised Learning Algorithms for Gesture Clustering
    Ball, Adrian
    Rye, David
    Ramos, Fabio
    Velonaki, Mari
    PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, : 111 - 112
  • [36] Exploration of Feature Engineering Techniques and Unsupervised Machine Learning Clustering Algorithms for Geophysical Data on Levees
    Russo, Brittany M.
    Athanasopoulos-Zekkos, Adda
    GEO-CONGRESS 2024: GEOTECHNICAL DATA ANALYSIS AND COMPUTATION, 2024, 352 : 454 - 463
  • [37] OXYGEN K-VALUE IN RELATION TO RADIATION QUALITY AND MODEL TUMORS
    HENDRY, JH
    GILBERT, CW
    HOWARD, A
    BRITISH JOURNAL OF RADIOLOGY, 1975, 48 (571): : 608 - 608
  • [38] DEFINITION OF EXCHANGEABLE QUANTITIES IN K-VALUE LOGIC AND MAXIMALITY PROBLEM
    HARNAU, W
    ZEITSCHRIFT FUR MATHEMATISCHE LOGIK UND GRUNDLAGEN DER MATHEMATIK, 1974, 20 (04): : 339 - 352
  • [39] Analysis of applicability to build K-value model in southwest China
    Liu, Lilong
    Wu, Pituan
    Chen, Jun
    Li, Junyu
    Fen, Haiyang
    Li, Feida
    INTERNATIONAL CONFERENCE ON INTELLIGENT EARTH OBSERVING AND APPLICATIONS 2015, 2015, 9808
  • [40] K-VALUE PREDICTIONS FOR THE METHANE-ETHANE-PROPANE SYSTEM
    MUKHOPADHYAY, M
    AWASTHI, R
    CRYOGENICS, 1981, 21 (06) : 345 - 348