Data heterogeneous federated learning algorithm for industrial entity extraction

被引:2
|
作者
Fu, Shengze [1 ]
Zhao, Xiaoli [1 ]
Yang, Chi [1 ]
Fang, Zhijun [2 ]
机构
[1] Shanghai Univ Engn Sci, Coll Elect & Elect Engn, Shanghai 201600, Peoples R China
[2] Donghua Univ, Sch Comp Sci & technol, Shanghai, Peoples R China
关键词
Entity extraction; Federated learning; Non-IID; Data quality performance; BLIND QUALITY ASSESSMENT;
D O I
10.1016/j.displa.2023.102504
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Entity extraction is an important part to realize digital transformation in the industrial field. Building an entity extraction model in the industrial field requires a lot of data. The parties in industry often cannot share data due to commercial competition and security and privacy issues, thus forming "Data Island". Federated learning provides a solution to this problem. Federated learning is a distributed machine learning framework that allows each party to train locally and independently using their own private data. The model parameters or gradient information of each party will be aggregated to the central server, thus forming a model jointly trained by all parties. This approach can not only protect the security and privacy of data from all parties, but also fully utilize their data resources. Federated learning can effectively solve the problem of data island, but it still faces some problems and challenges, among which the most typical problem is data heterogeneity. To address the data islanding problem and data heterogeneity problem faced by industrial entity extraction, this paper uses a federated learning framework to solve the data islanding problem and proposes the FedDP algorithm. This algorithm assigns weights based on the data quality performance of each participant. Participants with relatively good data quality performance have higher weights in the aggregation stage, while participants with relatively poor data quality performance have lower weights in the aggregation stage, thus optimizing the performance of federated learning in heterogeneous data scenarios.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Adaptive Clustered Federated Learning for Heterogeneous Data in Edge Computing
    Biyao Gong
    Tianzhang Xing
    Zhidan Liu
    Junfeng Wang
    Xiuya Liu
    Mobile Networks and Applications, 2022, 27 : 1520 - 1530
  • [32] Federated variational generative learning for heterogeneous data in distributed environments
    Xie, Wei
    Xiong, Runqun
    Zhang, Jinghui
    Jin, Jiahui
    Luo, Junzhou
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 191
  • [33] A Remedy for Heterogeneous Data: Clustered Federated Learning with Gradient Trajectory
    Liu, Ruiqi
    Yu, Songcan
    Lan, Linsi
    Wang, Junbo
    Kant, Krishna
    Calleja, Neville
    BIG DATA MINING AND ANALYTICS, 2024, 7 (04): : 1050 - 1064
  • [34] Over-the-Air Federated Learning from Heterogeneous Data
    Sery, Tomer
    Shlezinger, Nir
    Cohen, Kobi
    Eldar, Yonina
    IEEE Transactions on Signal Processing, 2021, 69 : 3796 - 3811
  • [35] FedMatch: Federated Learning Over Heterogeneous Question Answering Data
    Chen, Jiangui
    Zhang, Ruqing
    Guo, Jiafeng
    Fan, Yixing
    Cheng, Xueqi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 181 - 190
  • [36] Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
    Zhou, Tailin
    Lin, Zehong
    Zhang, Jun
    Tsang, Danny H. K.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 12131 - 12145
  • [37] Data-Free Knowledge Distillation for Heterogeneous Federated Learning
    Zhu, Zhuangdi
    Hong, Junyuan
    Zhou, Jiayu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [38] Adaptive Clustered Federated Learning for Heterogeneous Data in Edge Computing
    Gong, Biyao
    Xing, Tianzhang
    Liu, Zhidan
    Wang, Junfeng
    Liu, Xiuya
    Mobile Networks and Applications, 2022, 27 (04): : 1520 - 1530
  • [39] Wireless Federated Distillation for Distributed Edge Learning with Heterogeneous Data
    Ahn, Jin-Hyun
    Simeone, Osvaldo
    Kang, Joonhyuk
    2019 IEEE 30TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2019, : 1138 - 1143
  • [40] Clustering-Based Federated Learning for Heterogeneous IoT Data
    Li, Shumin
    Wei, Linna
    Zhang, Weidong
    Wu, Xuangou
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 172 - 179