Automatic centroid initialization in k-means using artificial hummingbird algorithm

被引:0
|
作者
Kusum Preeti [1 ]
undefined Deep [2 ]
机构
[1] Indian Institute of Technology Roorkee,Department of Mathematics
[2] The University of Tennessee Health Science Centre,undefined
关键词
K-means; Nature inspired algorithm; Data clustering; Clustering analysis;
D O I
10.1007/s00521-024-10764-4
中图分类号
学科分类号
摘要
K-means is a widely used technique that heavily relies on the initial cluster centroid location. Poorly chosen centroids can cause the algorithm to get trapped in suboptimal solutions. Additionally, determining the optimal number of clusters for large datasets is computationally expensive. To address these challenges, a recently developed Artificial Hummingbird Algorithm (AHA) is used to initialize cluster centroid locations and automatically determine the best estimate for the number of clusters. AHA simulates the specialized flight skills and intelligent foraging strategies of hummingbirds, striking a fine balance between exploration and exploitation during the search process. Unlike other data clustering approaches that use a fixed threshold in heuristic methods, we propose a dynamic threshold based on the variance of the data with respect to its centroids for activating cluster centroids in AHA. The data are automatically partitioned into k cluster centroids such that cohesion, measured by cluster diameters, and separation, measured by nearest neighbor distance, are optimized. The algorithm is tested on various datasets, including real-world data, fundamental clustering benchmarks, synthetic data, and high-dimensional data. To evaluate performance, metrics such as fitness value, inter-cluster distance, and intra-cluster distance were used. Results indicate that the proposed method ranked first and achieved superior clustering performance compared to state-of-the-art algorithms.
引用
收藏
页码:3373 / 3398
页数:25
相关论文
共 50 条
  • [31] Clustering Centroid Selection using a K-means and Rapid Density Peak Search Fusion Algorithm
    Zhang, Chenyang
    Wang, Jiamei
    Li, Xinyun
    Fu, Fei
    Wang, Weiquan
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 201 - 207
  • [32] Modified K-Means Algorithm Using Timestamp Initialization in Sliding Window to Detect Anomaly Traffic
    Putra, I. Wayan Oka Krismawan
    Purwanto, Yudha
    Suratman, Fiky Yosef
    2015 INTERNATIONAL CONFERENCE ON CONTROL, ELECTRONICS, RENEWABLE ENERGY AND COMMUNICATIONS (ICCEREC), 2015, : 19 - 23
  • [33] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [34] Automatic Detection System of Olive Trees Using Improved K-Means Algorithm
    Waleed, Muhammad
    Um, Tai-Won
    Khan, Aftab
    Khan, Umair
    REMOTE SENSING, 2020, 12 (05)
  • [35] Local Tone Mapping Using the K-means Algorithm and Automatic Gamma Setting
    Lee, Ji Won
    Park, Rae-Hong
    Chang, SoonKeun
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (01) : 209 - 217
  • [36] Local Tone Mapping using K-means Algorithm and Automatic Gamma Setting
    Lee, Ji Won
    Park, Rae-Hong
    Chang, SoonKeun
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 807 - +
  • [37] Automatic Mood Detection of Indian Music Using MFCCs and K-means Algorithm
    Vyas, Garima
    Dutta, Malay Kishore
    2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 117 - 122
  • [38] Centroid Update Approach to K-Means Clustering
    Borlea, Ioan-Daniel
    Precup, Radu-Emil
    Dragan, Florin
    Borlea, Alexandra-Bianca
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2017, 17 (04) : 3 - 10
  • [39] A Pillar Algorithm for K-Means Optimization by Distance Maximization for Initial Centroid Designation
    Barakbah, Ali Ridho
    Kiyoki, Yasushi
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 61 - 68
  • [40] Stable Initialization Scheme for K-Means Clustering
    XU Junling1
    2. State Key Laboratory of Software Engineering
    3. Department of Computer
    WuhanUniversityJournalofNaturalSciences, 2009, 14 (01) : 24 - 28