FEDBERT: When Federated Learning Meets Pre-training

被引:48
|
作者
Tian, Yuanyishu [1 ]
Wan, Yao [1 ]
Lyu, Lingjuan [2 ]
Yao, Dezhong [1 ]
Jin, Hai [1 ]
Sun, Lichao [3 ]
机构
[1] Huazhong Univ Sci & Technol, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Sch Comp Sci & Technol,Cluster & Grid Comp Lab, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Sony AI, Minato Ku, 1-7-1 Konan, Tokyo, Japan
[3] Lehigh Univ, 113 Res Dr, Bethlehem, PA 18015 USA
基金
中国国家自然科学基金;
关键词
Federated learning; pre-training; BERT; NLP;
D O I
10.1145/3510033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fast growth of pre-trained models (PTMs) has brought natural language processing to a new era, which has become a dominant technique for various natural language processing (NLP) applications. Every user can download the weights of PTMs, then fine-tune the weights for a task on the local side. However, the pre-training of a model relies heavily on accessing a large-scale of training data and requires a vast amount of computing resources. These strict requirements make it impossible for any single client to pre-train such a model. To grant clients with limited computing capability to participate in pre-training a large model, we propose a new learning approach, FEDBERT, that takes advantage of the federated learning and split learning approaches, resorting to pre-training BERT in a federated way. FEDBERT can prevent sharing the raw data information and obtain excellent performance. Extensive experiments on seven GLUE tasks demonstrate that FEDBERT can maintain its effectiveness without communicating to the sensitive local data of clients.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Learning Visual Prior via Generative Pre-Training
    Xie, Jinheng
    Ye, Kai
    Li, Yudong
    Li, Yuexiang
    Lin, Kevin Qinghong
    Zheng, Yefeng
    Shen, Linlin
    Shou, Mike Zheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [32] Improving Reinforcement Learning Pre-Training with Variational Dropout
    Blau, Tom
    Ott, Lionel
    Ramos, Fabio
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 4115 - 4122
  • [33] When Collaborative Federated Learning Meets Blockchain to Preserve Privacy in Healthcare
    El Houda, Zakaria Abou
    Hafid, Abdelhakim Senhaji
    Khoukhi, Lyes
    Brik, Bouziane
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2455 - 2465
  • [34] When Federated Learning Meets Oligopoly Competition: Stability and Model Differentiation
    Huang, Chao
    Dachille, Justin
    Liu, Xin
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (16): : 27409 - 27420
  • [35] Learning to See before Learning to Act: Visual Pre-training for Manipulation
    Lin Yen-Chen
    Zeng, Andy
    Song, Shuran
    Isola, Phillip
    Lin, Tsung-Yi
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 7286 - 7293
  • [36] Hybrid Learning: When Centralized Learning Meets Federated Learning in the Mobile Edge Computing Systems
    Feng, Chenyuan
    Yang, Howard H.
    Wang, Siye
    Zhao, Zhongyuan
    Quek, Tony Q. S.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (12) : 7008 - 7022
  • [37] FedPETuning: When Federated Learning Meets the Parameter-Efficient Tuning Methods of Pre-trained Language Models
    Zhang, Zhuo
    Yang, Yuanhang
    Dai, Yong
    Wang, Qifan
    Yu, Yue
    Que, Lizhen
    Xu, Zenglin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9963 - 9977
  • [38] PoisonedEncoder: Poisoning the Unlabeled Pre-training Data in Contrastive Learning
    Liu, Hongbin
    Jia, Jinyuan
    Gong, Neil Zhenqiang
    PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 3629 - 3645
  • [39] Multilingual Molecular Representation Learning via Contrastive Pre-training
    Guo, Zhihui
    Sharma, Pramod
    Martinez, Andy
    Du, Liang
    Abraham, Robin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 3441 - 3453
  • [40] A Contrastive Learning Pre-Training Method for Motif Occupancy Identification
    Lin, Ken
    Quan, Xiongwen
    Yin, Wenya
    Zhang, Han
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (09)