Near-Optimal Clustering in the k-machine model

被引：9

作者：

Bandyapadhyay, Sayan ^{[1
]}

Inamdar, Tanmay ^{[1
]}

Pai, Shreyas ^{[1
]}

Pemmaraju, Sriram V. ^{[1
]}

机构：

[1] Univ Iowa, Iowa City, IA 52242 USA

来源：

ICDCN'18: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING | 2018年

关键词：

Clustering; Facility location; k-median; k-center; k-machine model; large-scale clustering; distributed clustering;

D O I：

10.1145/3154273.3154317

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers have focused on designing algorithms for clustering problems in models of computation suited for large-scale computation such as MapReduce, Pregel, and streaming models. The k-machine model (Klauck et al., SODA 2015) is a simple, message-passing model for large-scale distributed graph processing. This paper considers three of the most prominent examples of clustering problems: the uncapacitated facility location problem, the p-median problem, and the p-center problem and presents O(1)-factor approximation algorithms for these problems running in (O) over tilde (n/k) rounds in the k-machine model. These algorithms are optimal upto polylogarithmic factors because this paper also shows (Omega) over tilde (n/k) lower bounds for obtaining poly(n)-factor approximation algorithms for these problems. These are the first results for clustering problems in the k-machine model. We assume that the metric provided as input for these clustering problems in only implicitly provided, as an edge-weighted graph and in a nutshell, our main technical contribution is to show that constant-factor approximation algorithms for all three clustering problems can be obtained by learning only a small portion of the input metric.

引用

页数：10

共 50 条

[31] THE NEAR-OPTIMAL INSTRUCTION SET
SMITH, T
IEEE MICRO, 1982, 2 (03) : 5 - 6
[32] Near-Optimal Online Auctions
Blum, Avrim
Hartline, Jason D.
PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2005, : 1156 - 1163
[33] Near-optimal terrain collision
Malaek, S. M.
Abbasi, A.
2006 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2006, : 2990 - +
[34] Near-optimal adaptive polygonization
Seibold, W
Joy, KI
COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 1999, : 206 - 213
[35] Near-Optimal Light Spanners
Chechik, Shiri
Wulff-Nilsen, Christian
ACM TRANSACTIONS ON ALGORITHMS, 2018, 14 (03)
[36] Near-optimal block alignments
Tseng, Kuo-Tsung
Yang, Chang-Biau
Huang, Kuo-Si
Peng, Yung-Hsing
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 789 - 795
[37] Near-optimal list colorings
Molloy, M
Reed, B
RANDOM STRUCTURES & ALGORITHMS, 2000, 17 (3-4) : 376 - 402
[38] Near-optimal sequence alignment
Vingron, M
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) : 346 - 352
[39] Near-optimal frequency-weighted interpolatory model reduction
Breiten, Tobias
Beattie, Christopher
Gugercin, Serkan
SYSTEMS & CONTROL LETTERS, 2015, 78 : 8 - 18
[40] The near-optimal feasible space of a renewable power system model
Neumann, Fabian
Brown, Tom
ELECTRIC POWER SYSTEMS RESEARCH, 2021, 190

← 1 2 3 4 5 →