Near-Optimal Clustering in the k-machine model

被引:9
|
作者
Bandyapadhyay, Sayan [1 ]
Inamdar, Tanmay [1 ]
Pai, Shreyas [1 ]
Pemmaraju, Sriram V. [1 ]
机构
[1] Univ Iowa, Iowa City, IA 52242 USA
关键词
Clustering; Facility location; k-median; k-center; k-machine model; large-scale clustering; distributed clustering;
D O I
10.1145/3154273.3154317
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers have focused on designing algorithms for clustering problems in models of computation suited for large-scale computation such as MapReduce, Pregel, and streaming models. The k-machine model (Klauck et al., SODA 2015) is a simple, message-passing model for large-scale distributed graph processing. This paper considers three of the most prominent examples of clustering problems: the uncapacitated facility location problem, the p-median problem, and the p-center problem and presents O(1)-factor approximation algorithms for these problems running in (O) over tilde (n/k) rounds in the k-machine model. These algorithms are optimal upto polylogarithmic factors because this paper also shows (Omega) over tilde (n/k) lower bounds for obtaining poly(n)-factor approximation algorithms for these problems. These are the first results for clustering problems in the k-machine model. We assume that the metric provided as input for these clustering problems in only implicitly provided, as an edge-weighted graph and in a nutshell, our main technical contribution is to show that constant-factor approximation algorithms for all three clustering problems can be obtained by learning only a small portion of the input metric.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] THE NEAR-OPTIMAL INSTRUCTION SET
    SMITH, T
    IEEE MICRO, 1982, 2 (03) : 5 - 6
  • [32] Near-Optimal Online Auctions
    Blum, Avrim
    Hartline, Jason D.
    PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2005, : 1156 - 1163
  • [33] Near-optimal terrain collision
    Malaek, S. M.
    Abbasi, A.
    2006 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2006, : 2990 - +
  • [34] Near-optimal adaptive polygonization
    Seibold, W
    Joy, KI
    COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 1999, : 206 - 213
  • [35] Near-Optimal Light Spanners
    Chechik, Shiri
    Wulff-Nilsen, Christian
    ACM TRANSACTIONS ON ALGORITHMS, 2018, 14 (03)
  • [36] Near-optimal block alignments
    Tseng, Kuo-Tsung
    Yang, Chang-Biau
    Huang, Kuo-Si
    Peng, Yung-Hsing
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 789 - 795
  • [37] Near-optimal list colorings
    Molloy, M
    Reed, B
    RANDOM STRUCTURES & ALGORITHMS, 2000, 17 (3-4) : 376 - 402
  • [38] Near-optimal sequence alignment
    Vingron, M
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) : 346 - 352
  • [39] Near-optimal frequency-weighted interpolatory model reduction
    Breiten, Tobias
    Beattie, Christopher
    Gugercin, Serkan
    SYSTEMS & CONTROL LETTERS, 2015, 78 : 8 - 18
  • [40] The near-optimal feasible space of a renewable power system model
    Neumann, Fabian
    Brown, Tom
    ELECTRIC POWER SYSTEMS RESEARCH, 2021, 190