The Communication Complexity of Distributed ε-Approximations

被引:1
|
作者
Huang, Zengfeng [1 ]
Yi, Ke [2 ]
机构
[1] Aarhus Univ, MADALGO, DK-8000 Aarhus C, Denmark
[2] HKUST, Dept CSE, Hong Kong, Hong Kong, Peoples R China
关键词
epsilon-approximations; communication complexity; discrepancy; distributed data; BOUNDS;
D O I
10.1109/FOCS.2014.69
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data summarization is an effective approach to dealing with the "big data" problem. While data summarization problems traditionally have been studied is the streaming model, the focus is starting to shift to distributed models, as distributed/parallel computation seems to be the only viable way to handle today's massive data sets. In this paper, we study epsilon-approximations, a classical data summary that, intuitively speaking, preserves approximately the density of the underlying data set over a certain range space. We consider the problem of computing epsilon-approximations for a data set which is held jointly by k players, and give general communication upper and lower bounds that hold for any range space whose discrepancy is known.
引用
收藏
页码:591 / 600
页数:10
相关论文
共 50 条
  • [31] Projection-free Distributed Online Learning with Sublinear Communication Complexity
    Wan, Yuanyu
    Wang, Guanghui
    Tu, Wei-Wei
    Zhang, Lijun
    Journal of Machine Learning Research, 2022, 23 : 1 - 53
  • [32] Projection-free Distributed Online Learning with Sublinear Communication Complexity
    Wan, Yuanyu
    Wang, Guanghui
    Tu, Wei-Wei
    Zhang, Lijun
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [33] PltcRB:a practical distributed randomness beacon with optimal amortized communication complexity
    Zheyi WU
    Haolin LIU
    Lei WANG
    Science China(Information Sciences), 2025, 68 (02) : 269 - 284
  • [34] Towards Optimal Communication Complexity in Distributed Non-Convex Optimization
    Patel, Kumar Kshitij
    Wang, Lingxiao
    Woodworth, Blake
    Bullins, Brian
    Srebro, Nati
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [35] The complexity of approximations for epistemic synthesis
    Huang, Xiaowei
    van der Meyden, Ron
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (202): : 120 - 137
  • [36] Multimessage Multicasting: Complexity and approximations
    Gonzalez, TF
    THIRTIETH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL 1: SOFTWARE TECHNOLOGY AND ARCHITECTURE, 1997, : 211 - 220
  • [37] Complexity and approximations for multimessage multicasting
    Gonzalez, TF
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 55 (02) : 215 - 235
  • [38] On the Complexity of Communication Complexity
    Kushilevitz, Eyal
    Weinreb, Enav
    STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 465 - 473
  • [39] Distributed Optimization for Overparameterized Problems: Achieving Optimal Dimension Independent Communication Complexity
    Song, Bingqing
    Tsaknakis, Ioannis
    Yau, Chung-Yiu
    Wai, Hoi-To
    Hong, Mingyi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [40] Storage-Computation-Communication Tradeoff in Distributed Computing: Fundamental Limits and Complexity
    Yan, Qifa
    Yang, Sheng
    Wigger, Michele
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2022, 68 (08) : 5496 - 5512