Trends in Unsupervised Methodologies for Optimal K-Value Selection in Clustering Algorithms

被引:0
|
作者
Pegado-Bardayo, Ana [1 ]
Munuzuri, Jesus [1 ]
Escudero-Santana, Alejandro [1 ]
Lorenzo-Espejo, Antonio [1 ]
机构
[1] Univ Seville, Dpto Organizac Ind & Gest Empresas 2, Escuela Tecn Super Ingn, Camino Descubrimientos S-N, Seville 41092, Spain
关键词
Clustering; k-value; k-means; unsupervised learning; DATA SET; NUMBER;
D O I
10.1007/978-3-031-57996-7_49
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering algorithms are a powerful machine learning tool when working with large datasets, as they allow data to be grouped according to certain characteristics without the need to manually label the data. These algorithms generally request the number of clusters to be formed (k) as a parameter of the model and, while in some instances it is possible to indicate this number manually, most situations require this estimation to be an unsupervised task. The most widespread techniques offer acceptable results, but there is still much room for improvement. This study highlights their main shortcomings and reviews some of the advances in the estimation of this parameter presented in recent years, exploring their advantages and limitations.
引用
收藏
页码:282 / 287
页数:6
相关论文
共 41 条
  • [31] Adaptive Binary Bat and Markov Clustering Algorithms for Optimal Text Feature Selection in News Events Detection Model
    Al-Dyani, Wafa Zubair
    Ahmad, Farzana Kabir
    Kamaruddin, Siti Sakira
    IEEE ACCESS, 2022, 10 : 85655 - 85676
  • [32] Selection of Optimal Number of Clusters and Centroids for K-means and Fuzzy C-means Clustering: A Review
    Pugazhenthi, A.
    Kumar, Lakshmi Sutha
    PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [33] UNSUPERVISED K-MEANS CLUSTERING BASED OUT-OF-SET CANDIDATE SELECTION FOR ROBUST OPEN-SET LANGUAGE RECOGNITION
    Zhang, Qian
    Hansen, John H. L.
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 324 - 329
  • [34] Addressing limitations of the K-means clustering algorithm: outliers, non-spherical data, and optimal cluster selection
    Khan, Iliyas Karim
    Daud, Hanita Binti
    Zainuddin, Nooraini binti
    Sokkalingam, Rajalingam
    Museeb, Abdul
    Inayat, Agha
    AIMS MATHEMATICS, 2024, 9 (09): : 25070 - 25097
  • [35] A NEAR-OPTIMAL INITIAL SEED VALUE SELECTION IN K-MEANS ALGORITHM USING A GENETIC ALGORITHM
    BABU, GP
    MURTY, MN
    PATTERN RECOGNITION LETTERS, 1993, 14 (10) : 763 - 769
  • [36] Unsupervised Feature Selection Using an Integrated Strategy of Hierarchical Clustering With Singular Value Decomposition: An Integrative Biomarker Discovery Method With Application to Acute Myeloid Leukemia
    Bhadra, Tapas
    Mallik, Saurav
    Sohel, Amir
    Zhao, Zhongming
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1354 - 1364
  • [37] Performance analysis of optimal cluster selection and intrusion detection by hierarchical K-means clustering with hybrid ABC-DT
    Jesuretnam, Josemila Baby
    Rose, Jeba James
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2021, 17 (01) : 49 - 63
  • [38] Optimal allocation of multiple distributed generations including uncertainties in distribution networks by k-means clustering and particle swarm optimization algorithms
    Eyüboğlu O.H.
    Gül Ö.
    Renewable Energy and Power Quality Journal, 2021, 19 : 79 - 84
  • [39] Utilizing Clonal Selection Theory Inspired Algorithms and K-Means Clustering for Predicting OPEC Carbon Dioxide Emissions from Petroleum Consumption
    Lasisi, Ayodele
    Ghazali, Rozaida
    Chiroma, Haruna
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 101 - 110
  • [40] Optimal selection based K-mean clustering technique to improve the energy efficiency in cognitive radio networks for 6G applications
    Rajavel, S. Esakki
    Aruna, T.
    Rajakumar, G.
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2021, 34 (18)