Data heterogeneous federated learning algorithm for industrial entity extraction

被引:2
|
作者
Fu, Shengze [1 ]
Zhao, Xiaoli [1 ]
Yang, Chi [1 ]
Fang, Zhijun [2 ]
机构
[1] Shanghai Univ Engn Sci, Coll Elect & Elect Engn, Shanghai 201600, Peoples R China
[2] Donghua Univ, Sch Comp Sci & technol, Shanghai, Peoples R China
关键词
Entity extraction; Federated learning; Non-IID; Data quality performance; BLIND QUALITY ASSESSMENT;
D O I
10.1016/j.displa.2023.102504
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Entity extraction is an important part to realize digital transformation in the industrial field. Building an entity extraction model in the industrial field requires a lot of data. The parties in industry often cannot share data due to commercial competition and security and privacy issues, thus forming "Data Island". Federated learning provides a solution to this problem. Federated learning is a distributed machine learning framework that allows each party to train locally and independently using their own private data. The model parameters or gradient information of each party will be aggregated to the central server, thus forming a model jointly trained by all parties. This approach can not only protect the security and privacy of data from all parties, but also fully utilize their data resources. Federated learning can effectively solve the problem of data island, but it still faces some problems and challenges, among which the most typical problem is data heterogeneity. To address the data islanding problem and data heterogeneity problem faced by industrial entity extraction, this paper uses a federated learning framework to solve the data islanding problem and proposes the FedDP algorithm. This algorithm assigns weights based on the data quality performance of each participant. Participants with relatively good data quality performance have higher weights in the aggregation stage, while participants with relatively poor data quality performance have lower weights in the aggregation stage, thus optimizing the performance of federated learning in heterogeneous data scenarios.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] HFedMS: Heterogeneous Federated Learning With Memorable Data Semantics in Industrial Metaverse
    Zeng, Shenglai
    Li, Zonghang
    Yu, Hongfang
    Zhang, Zhihao
    Luo, Long
    Li, Bo
    Niyato, Dusit
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 3055 - 3069
  • [2] Fair Federated Learning for Heterogeneous Data
    Kanaparthy, Samhita
    Padala, Manisha
    Damle, Sankarshan
    Gujar, Sujit
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 298 - 299
  • [3] High-performance federated continual learning algorithm for heterogeneous streaming data
    Jiang H.
    He T.
    Liu M.
    Sun S.
    Wang Y.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (05): : 123 - 136
  • [4] FedCPD: A Federated Learning Algorithm for Processing and Securing Distributed Heterogeneous Data in the Metaverse
    Sun, Le
    Zhang, Zhimeng
    Muhammad, Ghulam
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5540 - 5551
  • [5] A multi-modal heterogeneous data mining algorithm using federated learning
    Wei, Xianyong
    Journal of Engineering, 2021, 2021 (08): : 458 - 466
  • [6] Multi-source Heterogeneous Data Fusion Algorithm Based on Federated Learning
    Zhou, Jincheng
    Lei, Yang
    SOFT COMPUTING IN DATA SCIENCE, SCDS 2023, 2023, 1771 : 46 - 60
  • [7] A federated learning differential privacy algorithm for non-Gaussian heterogeneous data
    Yang, Xinyu
    Wu, Weisan
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [8] A federated learning differential privacy algorithm for non-Gaussian heterogeneous data
    Xinyu Yang
    Weisan Wu
    Scientific Reports, 13
  • [9] A multi-modal heterogeneous data mining algorithm using federated learning
    Wei, Xianyong
    JOURNAL OF ENGINEERING-JOE, 2021, 2021 (08): : 458 - 466
  • [10] Federated learning with incremental clustering for heterogeneous data
    Espinoza Castellon, Fabiola
    Mayoue, Aurelien
    Sublemontier, Jacques-Henri
    Gouy-Pailler, Cedric
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,