Distributed Submodular Maximization

被引:0
|
作者
Mirzasoleiman, Baharan [1 ]
Karbasi, Amin [2 ]
Sarkar, Rik [3 ]
Krause, Andreas [1 ]
机构
[1] ETH, Dept Comp Sci, Univ Str 6, CH-8092 Zurich, Switzerland
[2] Yale Univ, Sch Engn & Appl Sci, New Haven, CT USA
[3] Univ Edinburgh, Dept Informat, 10 Crichton St, Edinburgh EH8 9AB, Midlothian, Scotland
关键词
distributed computing; submodular functions; approximation algorithms; greedy algorithms; map-reduce; SET; APPROXIMATIONS; ALGORITHM; NETWORKS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many large-scale machine learning problems-clustering, non-parametric learning, kernel machines, etc.-require selecting a small yet representative subset from a large dataset. Such problems can often be reduced to maximizing a submodular set function subject to various constraints. Classical approaches to submodular optimization require centralized access to the full dataset, which is impractical for truly large-scale problems. In this paper, we consider the problem of submodular function maximization in a distributed fashion. We develop a simple, two-stage protocol GREEDI, that is easily implemented using MapReduce style computations. We theoretically analyze our approach, and show that under certain natural conditions, performance close to the centralized approach can be achieved. We begin with monotone submodular maximization subject to a cardinality constraint, and then extend this approach to obtain approximation guarantees for (not necessarily monotone) submodular maximization subject to more general constraints including matroid or knapsack constraints. In our extensive experiments, we demonstrate the effectiveness of our approach on several applications, including sparse Gaussian process inference and exemplar based clustering on tens of millions of examples using Hadoop.
引用
收藏
页数:44
相关论文
共 50 条
  • [1] Distributed Maximization of Submodular and Approximately Submodular Functions
    Ye, Lintao
    Sundaram, Shreyas
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 2979 - 2984
  • [2] On Distributed Submodular Maximization with Limited Information
    Gharesifard, Bahman
    Smith, Stephen L.
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 1048 - 1053
  • [3] Distributed Submodular Maximization with Parallel Execution
    Sun, Haoyuan
    Grimsman, David
    Marden, Jason R.
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 1477 - 1482
  • [4] The Impact of Information in Distributed Submodular Maximization
    Grimsman, David
    Ali, Mohd Shabbir
    Hespanha, Joao P.
    Marden, Jason R.
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2019, 6 (04): : 1334 - 1343
  • [5] A New Framework for Distributed Submodular Maximization
    Barbosa, Rafael da Ponte
    Ene, Alina
    Nguyen, Huy L.
    Ward, Justin
    2016 IEEE 57TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2016, : 645 - 654
  • [6] Distributed Submodular Maximization With Limited Information
    Gharesifard, Bahman
    Smith, Stephen L.
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2018, 5 (04): : 1635 - 1645
  • [7] Distributed Submodular Maximization with Bounded Communication Cost
    Castiglia, Timothy
    Patterson, Stacy
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 3006 - 3011
  • [8] A Distributed Algorithm for Partitioned Robust Submodular Maximization
    Bogunovic, Ilija
    Mitrovic, Slobodan
    Scarlett, Jonathan
    Cevher, Volkan
    2017 IEEE 7TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2017,
  • [9] Bicriteria Distributed Submodular Maximization in a Few Rounds
    Epasto, Alessandro
    Mirrokni, Vahab
    Zadimoghaddam, Morteza
    PROCEEDINGS OF THE 29TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'17), 2017, : 25 - 33
  • [10] Distributed submodular maximization: trading performance for privacy
    Rezazadeh, Navid
    Kia, Solmaz S.
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5953 - 5958