Near-Optimal Clustering in the k-machine model

被引:9
|
作者
Bandyapadhyay, Sayan [1 ]
Inamdar, Tanmay [1 ]
Pai, Shreyas [1 ]
Pemmaraju, Sriram V. [1 ]
机构
[1] Univ Iowa, Iowa City, IA 52242 USA
关键词
Clustering; Facility location; k-median; k-center; k-machine model; large-scale clustering; distributed clustering;
D O I
10.1145/3154273.3154317
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers have focused on designing algorithms for clustering problems in models of computation suited for large-scale computation such as MapReduce, Pregel, and streaming models. The k-machine model (Klauck et al., SODA 2015) is a simple, message-passing model for large-scale distributed graph processing. This paper considers three of the most prominent examples of clustering problems: the uncapacitated facility location problem, the p-median problem, and the p-center problem and presents O(1)-factor approximation algorithms for these problems running in (O) over tilde (n/k) rounds in the k-machine model. These algorithms are optimal upto polylogarithmic factors because this paper also shows (Omega) over tilde (n/k) lower bounds for obtaining poly(n)-factor approximation algorithms for these problems. These are the first results for clustering problems in the k-machine model. We assume that the metric provided as input for these clustering problems in only implicitly provided, as an edge-weighted graph and in a nutshell, our main technical contribution is to show that constant-factor approximation algorithms for all three clustering problems can be obtained by learning only a small portion of the input metric.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] The dynamics and near-optimal controls of a dengue model with threshold policy
    You, Wei
    Meyer-Baese, Anke
    Xu, Xinzhong
    Zhang, Qimin
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2024, 47 (17) : 13313 - 13335
  • [42] Near-optimal experimental design for model selection in systems biology
    Busetto, Alberto Giovanni
    Hauser, Alain
    Krummenacher, Gabriel
    Sunnaker, Mikael
    Dimopoulos, Sotiris
    Ong, Cheng Soon
    Stelling, Joerg
    Buhmann, Joachim M.
    BIOINFORMATICS, 2013, 29 (20) : 2625 - 2632
  • [43] Near-optimal control for a stochastic SIRS model with imprecise parameters
    Mu, Xiaojie
    Zhang, Qimin
    Rong, Libin
    ASIAN JOURNAL OF CONTROL, 2020, 22 (05) : 2090 - 2105
  • [44] Optimal, near-optimal, and robust epidemic control
    Dylan H. Morris
    Fernando W. Rossine
    Joshua B. Plotkin
    Simon A. Levin
    Communications Physics, 4
  • [45] Optimal, near-optimal, and robust epidemic control
    Morris, Dylan H.
    Rossine, Fernando W.
    Plotkin, Joshua B.
    Levin, Simon A.
    COMMUNICATIONS PHYSICS, 2021, 4 (01)
  • [46] OPTIMAL AND NEAR-OPTIMAL BROADCAST IN RANDOM GRAPHS
    SCHEINERMAN, ER
    WIERMAN, JC
    DISCRETE APPLIED MATHEMATICS, 1989, 25 (03) : 289 - 297
  • [47] A Fast and Near-Optimal Clustering Algorithm for Low-Power Clock Tree Synthesis
    Shelar, Rupesh S.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2012, 31 (11) : 1781 - 1786
  • [48] Near-Optimal User Clustering and Power Control for Uplink MISO-NOMA Networks
    Zhang, Junxia
    Liu, Ming
    Xiong, Ke
    Zhang, Mingshan
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [49] Near-optimal Virtual Machine Placement with Product Traffic Pattern in Data Centers
    You, Kun
    Tang, Bin
    Ding, Feng
    2013 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2013, : 3705 - +
  • [50] Near-optimal Linear Decision Trees for k-SUM and Related Problems
    Kane, Daniel M.
    Lovett, Shachar
    Moran, Shay
    JOURNAL OF THE ACM, 2019, 66 (03)