A Review of Unsupervised K-Value Selection Techniques in Clustering Algorithms

被引:1
|
作者
Pegado-Bardayo, Ana [1 ]
Lorenzo-Espejo, Antonio [1 ]
Munuzuri, Jesus [1 ]
Escudero-Santana, Alejandro [1 ]
机构
[1] Univ Seville, Seville, Spain
来源
JOURNAL OF INDUSTRIAL ENGINEERING AND MANAGEMENT-JIEM | 2024年 / 17卷 / 03期
关键词
clustering; k-means; unsupervised learning; k-value; DATA SET; VALIDATION; NUMBER;
D O I
10.3926/jiem.6791
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose: Automatic grouping of data according to certain characteristics is made possible by clustering algorithms, which makes them an essential tool when working with large datasets. However, although they are unsupervised tools, they generally require the specification of the number of clusters to be formed, k , a task that may be simple for a human, but quite complex to automate. Despite the most commonly used k-value selection techniques offer acceptable results, they are not without shortcomings, suggesting that there is ample room for improvement. This paper briefly introduces clustering techniques, discusses the main shortcomings of conventional k-value selection techniques and examines the advantages and limitations of nine promising alternatives presented in recent years. Design/methodology/approach: An evaluation of the main shortcomings of classic k- value estimation techniques has been carried out, and the newest proposals have been explained and compared. Findings: New k- value estimation indices and methodologies proposed by authors guarantee better results, extending the use of these techniques to large volumes of data, and complex shapes and structures. However, no generical methodology able to overcome all the described shortcomings has still been developed. Research limitations/implications: This research is limited to the newest developed techniques for k- value estimation, including proposals published since 2019. Older proposals have not been considered, as the newest ones overcome the former's shortcomings. A k- value estimation techniques review published in 2019 is cited in the test as a base reference. Practical implications: Although the examples listed in the text apply to industry, the techniques described and discussed in this review are applicable to any area of science that can benefit from the use of clustering techniques. Originality/value: To date, there has been no paper comparing the new k- value estimation techniques. Although there are literature reviews comparing the classical methods, these methods are nowadays nearly obsolete due to the complexity of the data usually faced.
引用
收藏
页码:641 / 649
页数:9
相关论文
共 50 条
  • [21] An Unsupervised Attribute Clustering Algorithm for Unsupervised Feature Selection
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 710 - 716
  • [22] K-value, an index for estimating fish freshness and quality
    Lakshmanam, PT
    Gopakumar, P
    CURRENT SCIENCE, 1999, 76 (03): : 400 - 404
  • [23] A new empirical K-value equation for reservoir fluids
    Ghafoori, Mohammad Javad
    Aghamiri, Seyed Foad
    Talaie, Mohammad Reza
    FUEL, 2012, 98 : 236 - 242
  • [24] Effect of K-Value on Plasticizer Diffusion in PVC.
    Mozisek, Max
    1973, 26 (06): : 474 - 480
  • [25] Capacitance measurements and k-value extractions of low-k films
    Ciofi, Ivan
    Baklanov, Mikhail R.
    Tokei, Zsolt
    Beyer, Gerald P.
    MICROELECTRONIC ENGINEERING, 2010, 87 (11) : 2391 - 2406
  • [26] Simple predictive procedure calculates heptane K-value
    Moshfeghian, M
    Johannes, AH
    Maddox, RN
    OIL & GAS JOURNAL, 2003, 101 (14) : 62 - 65
  • [27] THE K-VALUE CONCEPT APPLIED FOR GGBFS - PRINCIPLES AND EXPERIENCES
    Haerdtl, R.
    INTERNATIONAL RILEM CONFERENCE ON MATERIAL SCIENCE (MATSCI), VOL III, 2010, 77 : 189 - 198
  • [28] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [29] Unsupervised feature selection for balanced clustering
    Zhou, Peng
    Chen, Jiangyong
    Fan, Mingyu
    Du, Liang
    Shen, Yi-Dong
    Li, Xuejun
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [30] Climate classifications: the value of unsupervised clustering
    Zscheischler, Jakob
    Mahecha, Miguel D.
    Harmeling, Stefan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 897 - 906