A Spark-based parallel genetic algorithm for Bayesian network structure learning

被引:0
|
作者
Wu, Naixin [1 ]
机构
[1] Wuxi Inst Technol, Informat Ctr, Wuxi 214121, Jiangsu, Peoples R China
关键词
Bayesian networks; structure learning; genetic algorithm; parallel; BIC score; learning accuracy;
D O I
10.1504/IJCSM.2024.140876
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The Bayesian network structure learning (BNSL) algorithm based on genetic algorithm (GA) has the problem of long search time and being prone to falling into local optima. When the sampling data is large, the single machine BNSL algorithm cannot obtain the BN structure within a limited time. To address this issue, this paper proposes a parallel BNSL algorithm based on the Spark framework with GA (PGA-BN). The three main stages of the proposed PGABN are population initialisation, BIC score calculation, and evolution operators, which are all designed in parallel on each partition to accelerate based on Spark. The experiments are studied on two typical BN datasets with different sample sizes to evaluate the parallel performance of the PGA-BN algorithm. Experimental results showed that the PGA-BN is significantly faster than its single-machine version with the satisfied accuracy.
引用
收藏
页码:109 / 117
页数:10
相关论文
共 50 条
  • [41] Spark-Based Monte Carlo Parallel Simulation Approach for FTA
    Liu, Yan
    Ren, Yi
    Liu, Linlin
    Li, Zhifeng
    PROCEEDINGS OF THE 2015 FIRST INTERNATIONAL CONFERENCE ON RELIABILITY SYSTEMS ENGINEERING 2015 ICRSE, 2015,
  • [42] A Spark-based parallel framework for geospatial raster data processing
    Gao, Fan
    Yue, Peng
    2018 7TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2018, : 53 - 56
  • [43] Integrating expert knowledge for Bayesian network structure learning based on intuitionistic fuzzy set and Genetic Algorithm
    Li, Hui
    Wang, Fuli
    Li, Hongru
    INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 41 - 56
  • [44] A Spark-Based Parallel Fuzzy c-Means Segmentation Algorithm for Agricultural Image Big Data
    Liu, Bin
    He, Songrui
    He, Dongjian
    Zhang, Yin
    Guizani, Mohsen
    IEEE ACCESS, 2019, 7 : 42169 - 42180
  • [45] A Spark-based genetic algorithm for sensor placement in large scale drinking water distribution systems
    Chengyu Hu
    Guo Ren
    Chao Liu
    Ming Li
    Wei Jie
    Cluster Computing, 2017, 20 : 1089 - 1099
  • [46] Learning Bayesian network structure with immune algorithm
    Cai, Zhiqiang
    Si, Shubin
    Sun, Shudong
    Dui, Hongyan
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2015, 26 (02) : 282 - 291
  • [47] A hybrid algorithm for Bayesian network structure learning
    Ji, Junzhong
    Hu, Renbing
    Zhang, Hongxun
    Liu, Chunnian
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (09): : 1498 - 1507
  • [48] Learning Bayesian network structure with immune algorithm
    Zhiqiang Cai
    Shubin Si
    Shudong Sun
    Hongyan Dui
    Journal of Systems Engineering and Electronics, 2015, 26 (02) : 282 - 291
  • [49] A Spark-based genetic algorithm for sensor placement in large scale drinking water distribution systems
    Hu, Chengyu
    Ren, Guo
    Liu, Chao
    Li, Ming
    Jie, Wei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (02): : 1089 - 1099
  • [50] An efficient Bayesian network structure learning algorithm based on structural information
    Fang, Wei
    Zhang, Weijian
    Ma, Li
    Wu, Yunlin
    Yan, Kefei
    Lu, Hengyang
    Sun, Jun
    Wu, Xiaojun
    Yuan, Bo
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 76