Reuse-centric k-means configuration

被引:0
|
作者
Zhang, Lijun [1 ]
Guan, Hui [1 ]
Ding, Yufei [2 ]
Shen, Xipeng [3 ]
Krim, Hamid [3 ]
机构
[1] Univ Massachusetts, Amherst, MA 01002 USA
[2] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
[3] North Carolina State Univ, Raleigh, NC 27606 USA
基金
美国国家科学基金会;
关键词
K-means; Algorithm configuration; Computation reuse; TOP;
D O I
10.1016/j.is.2021.101787
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
K-means configuration is to find a configuration of k-means (e.g., the number of clusters, feature sets) that maximize some objectives. It is a time-consuming process due to the iterative nature of k-means. This paper proposes reuse-centric k-means configuration to accelerate k-means configuration. It is based on the observation that the explorations of different configurations share lots of common or similar computations. Effectively reusing the computations from prior trials of different configurations could largely shorten the configuration time. To materialize the idea, the paper presents a set of novel techniques, including reuse-based filtering, center reuse, and a two-phase design to capitalize on the reuse opportunities on three levels: validation, number of clusters, and feature sets. Experiments on k-means-based data classification tasks show that reuse-centric k-means configuration can speed up a heuristic search-based configuration process by a factor of 5.8, and a uniform search-based attainment of classification error surfaces by a factor of 9.1. The paper meanwhile provides some important insights on how to effectively apply the acceleration techniques to tap into a full potential. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Cartesian k-means
    Norouzi, Mohammad
    Fleet, David J.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3017 - 3024
  • [22] Balanced k-Means
    Tai, Chen-Ling
    Wang, Chen-Shu
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2017), PT II, 2017, 10192 : 75 - 82
  • [23] Integrating Curriculum Learning With k-Means: A Data-Centric Approach to Faster Clustering
    Majeed, Abdul
    Hwang, Seong Oun
    IT PROFESSIONAL, 2024, 26 (05) : 36 - 46
  • [24] A Modified K-means Algorithms - Bi-Level K-Means Algorithm
    Yu, Shyr-Shen
    Chu, Shao-Wei
    Wang, Ching-Lin
    Chan, Yung-Kuan
    Chuang, Chia-Yi
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY, 2014, : 10 - 13
  • [25] A Modified K-means Algorithm - Two-Layer K-means Algorithm
    Liu, Chen-Chung
    Chu, Shao-Wei
    Chan, Yung-Kuan
    Yu, Shyr-Shen
    2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 447 - 450
  • [26] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [27] Sorted K-Means Towards the Enhancement of K-Means to Form Stable Clusters
    Arora, Preeti
    Virmani, Deepali
    Jindal, Himanshu
    Sharma, Mritunjaya
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORKS, 2017, 508 : 479 - 486
  • [28] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [29] Task models, scenarios, and critical parameters: Toward the establishment of an effective infrastructure for reuse-centric requirements analysis
    Montabert, Cyril
    Mccrickard, Scott
    3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2006, : 186 - +
  • [30] Selection of K in K-means clustering
    Pham, DT
    Dimov, SS
    Nguyen, CD
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2005, 219 (01) : 103 - 119