Reuse-centric k-means configuration

被引:0
|
作者
Zhang, Lijun [1 ]
Guan, Hui [1 ]
Ding, Yufei [2 ]
Shen, Xipeng [3 ]
Krim, Hamid [3 ]
机构
[1] Univ Massachusetts, Amherst, MA 01002 USA
[2] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
[3] North Carolina State Univ, Raleigh, NC 27606 USA
基金
美国国家科学基金会;
关键词
K-means; Algorithm configuration; Computation reuse; TOP;
D O I
10.1016/j.is.2021.101787
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
K-means configuration is to find a configuration of k-means (e.g., the number of clusters, feature sets) that maximize some objectives. It is a time-consuming process due to the iterative nature of k-means. This paper proposes reuse-centric k-means configuration to accelerate k-means configuration. It is based on the observation that the explorations of different configurations share lots of common or similar computations. Effectively reusing the computations from prior trials of different configurations could largely shorten the configuration time. To materialize the idea, the paper presents a set of novel techniques, including reuse-based filtering, center reuse, and a two-phase design to capitalize on the reuse opportunities on three levels: validation, number of clusters, and feature sets. Experiments on k-means-based data classification tasks show that reuse-centric k-means configuration can speed up a heuristic search-based configuration process by a factor of 5.8, and a uniform search-based attainment of classification error surfaces by a factor of 9.1. The paper meanwhile provides some important insights on how to effectively apply the acceleration techniques to tap into a full potential. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Reuse-Centric K-Means Configuration
    Guan, Hui
    Ding, Yufei
    Shen, Xipeng
    Krim, Hamid
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1224 - 1227
  • [2] General Reuse-Centric CNN Accelerator
    Cicek, Nihat Mert
    Ning, Lin
    Ozturk, Ozcan
    Shen, Xipeng
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (04) : 880 - 891
  • [3] IIP Framework: A Tool for Reuse-Centric Analog Circuit Design
    Prautsch, Benjamin
    Eichler, Uwe
    Rao, Sunil
    Zeugmann, Bjoern
    Puppala, Ajith
    Reich, Torsten
    Lienig, Jens
    2016 13TH INTERNATIONAL CONFERENCE ON SYNTHESIS, MODELING, ANALYSIS AND SIMULATION METHODS AND APPLICATIONS TO CIRCUIT DESIGN (SMACD), 2016,
  • [4] K-means - a fast and efficient K-means algorithms
    Nguyen C.D.
    Duong T.H.
    Nguyen, Cuong Duc (nguyenduccuong@tdt.edu.vn), 2018, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (11) : 27 - 45
  • [5] Exact Acceleration of K-Means plus plus and K-Means∥
    Raff, Edward
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2928 - 2935
  • [6] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    ALGORITHMS, 2018, 11 (10):
  • [7] Empirical Evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means Clustering Algorithms
    Banerjee, Shreya
    Choudhary, Ankit
    Pal, Somnath
    2015 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2015, : 172 - 176
  • [8] Modular configuration of service elements based on the improved K-means algorithm
    Zhang, Fei
    EXPERT SYSTEMS, 2019, 36 (05)
  • [9] A Computation Algorithm for the Configuration of BLE Devices Using k-Means Method
    Onishi, Kensuke
    NEW TRENDS IN SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2016, 286 : 15 - 26
  • [10] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163