A core-set approach for distributed quadratic programming in big-data classification

被引:0
|
作者
Notarstefano, Giuseppe [1 ]
机构
[1] Univ Salento, Dept Engn, Via Monteroni, I-73100 Lecce, Italy
关键词
Distributed optimization; Big-Data Optimization; Support Vector Machine (SVM); Machine Learning; Core Set; Asynchronous networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new challenge for learning algorithms in cyber-physical network systems is the distributed solution of big-data classification problems, i.e., problems in which both the number of training samples and their dimension is high. Motivated by several problem set-ups in Machine Learning, in this paper we consider a special class of quadratic optimization problems involving a "large" number of input data, whose dimension is "big". To solve these quadratic optimization problems over peer-to-peer networks, we propose an asynchronous, distributed algorithm that scales with both the number and the dimension of the input data (training samples in the classification problem). The proposed distributed optimization algorithm relies on the notion of "core-set" which is used in geometric optimization to approximate the value function associated to a given set of points with a smaller subset of points. By computing local core-sets on a smaller version of the global problem and exchanging them with neighbors, the nodes reach consensus on a set of active constraints representing an approximate solution for the global quadratic program.
引用
收藏
页码:1372 / 1377
页数:6
相关论文
共 50 条
  • [1] A Minimax Approach for Classification with Big-data
    Krishnan, R.
    Jagannathan, S.
    Samaranayake, V. A.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1437 - 1444
  • [2] Harmony: An Approach for Geo-distributed Processing of Big-Data Applications
    Zhang, Han
    Ramapantulu, Lavanya
    Teo, Yong Meng
    2019 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2019, : 160 - 170
  • [3] DQPFS: Distributed quadratic programming based feature selection for big data
    Soheili, Majid
    Eftekhari-Moghadam, Amir Masoud
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 138 : 1 - 14
  • [4] Distributed Big-Data Optimization via Block Communications
    Notarnicola, Ivano
    Sun, Ying
    Scutari, Gesualdo
    Notarstefano, Giuseppe
    2017 IEEE 7TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2017,
  • [5] Classification techniques for Disease detection using Big-data
    Shah, Jaimin
    Patel, Raj
    2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2019, : 140 - 145
  • [6] Improved Locally Linear Embedding for Big-data Classification
    Ramirez, Andres
    Rahnemoonfar, Maryam
    BIGSPATIAL 2017: PROCEEDINGS OF THE 6TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ANALYTICS FOR BIG GEOSPATIAL DATA (BIGSPATIAL-2017), 2017, : 37 - 41
  • [7] A Big-Data Approach to Contemporary French Politics
    Sobanet, Andrew
    Singh, Lisa
    CONTEMPORARY FRENCH AND FRANCOPHONE STUDIES, 2020, 24 (05) : 625 - 634
  • [8] A Survey on Geographically Distributed Big-Data Processing Using MapReduce
    Dolev, Shlomi
    Florissi, Patricia
    Gudes, Ehud
    Sharma, Shantanu
    Singer, Ido
    IEEE TRANSACTIONS ON BIG DATA, 2019, 5 (01) : 60 - 80
  • [9] Distributed Big-Data Optimization via Blockwise Gradient Tracking
    Notarnicola, Ivano
    Sun, Ying
    Scutari, Gesualdo
    Notarstefano, Giuseppe
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (05) : 2045 - 2060
  • [10] Role of big-data in classification and novel class detection in data streams
    Chandak M.B.
    Journal of Big Data, 3 (1)