Serverless Federated AUPRC Optimization for Multi-Party Collaborative Imbalanced Data Mining

被引:2
|
作者
Wu, Xidong [1 ]
Hu, Zhengmian [1 ]
Pei, Jian [2 ]
Huang, Heng [3 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
[2] Duke Univ, Dept Comp Sci, Durham, NC 27706 USA
[3] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
AUPRC; federated learning; imbalanced data; stochastic optimization; serverless federated learning;
D O I
10.1145/3580305.3599499
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the big data challenges, serverless multi-party collaborative training has recently attracted attention in the data mining community, since they can cut down the communications cost by avoiding the server node bottleneck. However, traditional serverless multi-party collaborative training algorithms were mainly designed for balanced data mining tasks and are intended to optimize accuracy (e.g., cross-entropy). The data distribution in many real-world applications is skewed and classifiers, which are trained to improve accuracy, perform poorly when applied to imbalanced data tasks since models could be significantly biased toward the primary class. Therefore, the Area Under Precision-Recall Curve (AUPRC) was introduced as an effective metric. Although multiple single-machine methods have been designed to train models for AUPRC maximization, the algorithm for multi-party collaborative training has never been studied. The change from the single-machine to the multi-party setting poses critical challenges. For example, existing single-machine-based AUPRC maximization algorithms maintain an inner state for local each data point, thus these methods are not applicable to large-scale multi-party collaborative training due to the dependence on each local data point. To address the above challenge, in this paper, we reformulate the serverless multi-party collaborative AUPRC maximization problem as a conditional stochastic optimization problem in a serverless multi-party collaborative learning setting and propose a new ServerLess biAsed sTochastic gradiEnt (SLATE) algorithm to directly optimize the AUPRC. After that, we use the variance reduction technique and propose ServerLess biAsed sTochastic gradiEnt with Momentum-based variance reduction (SLATE-M) algorithm to improve the convergence rate, which matches the best theoretical convergence result reached by the single-machine online method. To the best of our knowledge, this is the first work to solve the multi-party collaborative AUPRC maximization problem. Finally, extensive experiments show the advantages of directly optimizing the AUPRC with distributed learning methods and also verify the efficiency of our new algorithms (i.e., SLATE and SLATE-M).
引用
收藏
页码:2648 / 2659
页数:12
相关论文
共 50 条
  • [21] EFMVFL: An Efficient and Flexible Multi-party Vertical Federated Learning without a Third Party
    Huang, Yimin
    Wang, Wanwan
    Zhao, Xingying
    Wang, Yukun
    Feng, Xinyu
    He, Hao
    Yao, Ming
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [22] Homeland security and privacy sensitive data mining from multi-party distributed resources
    Kargupta, H
    Liu, K
    Datta, S
    Ryan, J
    Sivakumar, K
    PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 1257 - 1260
  • [23] Augmented Multi-Party Computation Against Gradient Leakage in Federated Learning
    Zhang, Chi
    Ekanut, Sotthiwat
    Zhen, Liangli
    Li, Zengxiang
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (06) : 742 - 751
  • [24] A Privacy-Preserving Scheme for Multi-Party Vertical Federated Learning
    FAN Mochan
    ZHANG Zhipeng
    LI Difei
    ZHANG Qiming
    YAO Haidong
    ZTE Communications, 2024, 22 (04) : 89 - 96
  • [25] Secure Byzantine resilient federated learning based on multi-party computation
    Gao, Hongfeng
    Huang, Hao
    Tian, Youliang
    Tongxin Xuebao/Journal on Communications, 2025, 46 (02): : 108 - 122
  • [26] A Verifiable Federated Learning Scheme Based on Secure Multi-party Computation
    Mou, Wenhao
    Fu, Chunlei
    Lei, Yan
    Hu, Chunqiang
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT II, 2021, 12938 : 198 - 209
  • [27] Multi-Party Federated Recommendation Based on Semi-Supervised Learning
    Liu, Xin
    Lv, Jiuluan
    Chen, Feng
    Wei, Qingjie
    He, Hangxuan
    Qian, Ying
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 356 - 370
  • [28] MPCFL: Towards Multi-party Computation for Secure Federated Learning Aggregation
    Kaminaga, Hiroki
    Awaysheh, Feras M.
    Alawadi, Sadi
    Kamm, Liina
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,
  • [29] A Serverless Federated Learning Service Ecosystem for Multi-Cloud Collaborative Environments
    Hu, Cong
    Guan, Zhitao
    Yu, Pengfei
    Yao, Zhen
    Zhang, Cuicui
    Lu, Ruixuan
    Wang, Peng
    2023 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING, CLOUDNET, 2023, : 364 - 371
  • [30] Differential Privacy-Preserving of Multi-Party Collaboration Under Federated Learning in Data Center Networks
    Wang, Xi
    Fan, Weibei
    Hu, Xinzhi
    He, Jing
    Chi, Chi-Hung
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1223 - 1237