A Spark-based parallel genetic algorithm for Bayesian network structure learning

被引:0
|
作者
Wu, Naixin [1 ]
机构
[1] Wuxi Inst Technol, Informat Ctr, Wuxi 214121, Jiangsu, Peoples R China
关键词
Bayesian networks; structure learning; genetic algorithm; parallel; BIC score; learning accuracy;
D O I
10.1504/IJCSM.2024.140876
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The Bayesian network structure learning (BNSL) algorithm based on genetic algorithm (GA) has the problem of long search time and being prone to falling into local optima. When the sampling data is large, the single machine BNSL algorithm cannot obtain the BN structure within a limited time. To address this issue, this paper proposes a parallel BNSL algorithm based on the Spark framework with GA (PGA-BN). The three main stages of the proposed PGABN are population initialisation, BIC score calculation, and evolution operators, which are all designed in parallel on each partition to accelerate based on Spark. The experiments are studied on two typical BN datasets with different sample sizes to evaluate the parallel performance of the PGA-BN algorithm. Experimental results showed that the PGA-BN is significantly faster than its single-machine version with the satisfied accuracy.
引用
收藏
页码:109 / 117
页数:10
相关论文
共 50 条
  • [31] Parallel and Distributed Bayesian Network Structure Learning
    Yang, Jian
    Jiang, Jiantong
    Wen, Zeyi
    Mian, Ajmal
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (04) : 517 - 530
  • [32] ASCF: Optimization of the Apriori Algorithm Using Spark-Based Cuckoo Filter Structure
    Alrahwan, Bana Ahmad
    Farouk, Mona
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [33] Research on learning Bayesian network structure based on genetic algorithms
    Liu, D.Y.
    Wang, F.
    Lu, Y.N.
    Xue, W.X.
    Wang, S.X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2001, 38 (08):
  • [34] Bayesian network structure learning based on cuckoo search algorithm
    Askari, Mahbobe Bani Asad
    Ahsaee, Mostafa Ghazizadeh
    2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 127 - 130
  • [35] A Spark-based Apriori algorithm with reduced shuffle overhead
    Shashi Raj
    Dharavath Ramesh
    Krishan Kumar Sethi
    The Journal of Supercomputing, 2021, 77 : 133 - 151
  • [36] A Spark-based Incremental Algorithm for Frequent Itemset Mining
    Wen, Haoxing
    Li, Xiaoguang
    Kou, Mingdong
    Tou, Huaixiao
    He, Hengyi
    Yang, Yulu
    BDIOT 2018: PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS, 2018, : 53 - 58
  • [37] LEARNING BAYESIAN NETWORK BY GENETIC ALGORITHM USING STRUCTURE-PARAMETER RESTRICTIONS
    Zhang, Chongyang
    Cao, Ming
    Peng, Biao
    Zheng, Shibao
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [38] Leveraging spark-based machine learning algorithm for audience sentiment analysis in youtube content
    Subha, K.
    Bharathi, N.
    INTELLIGENT DATA ANALYSIS, 2024, 28 (05) : 1395 - 1405
  • [39] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964
  • [40] A Spark-based Apriori algorithm with reduced shuffle overhead
    Raj, Shashi
    Ramesh, Dharavath
    Sethi, Krishan Kumar
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (01): : 133 - 151