Parallelization of FP-growth Algorithm for Mining Probabilistic Numerical Data based on MapReduce

被引:0
|
作者
Pei, Bin [1 ]
Wang, Xiuzhen [1 ]
Wang, Fenmei [1 ]
机构
[1] New Star Res Inst Appl Technol, Comp Res & Teaching Sect, Hefei, Peoples R China
关键词
FP-growth Algorithm; MapReduce; parallelization; probabilistic data; FUZZY ASSOCIATION RULES;
D O I
10.1109/ISCID.2016.165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many association rule mining algorithms find associations and correlations from traditional transaction databases, in which the content of each transaction is definitely precise. However, due to instrument errors, imprecise of sensor monitoring systems, and so on, real-world data tend to be numerical data with inherent uncertainty. To deal with these situations, we propose a FP-growth-based mining algorithm PNFP-growth to efficiently find association rules from probabilistic numerical data, where each numerical item in the transactions is associated with an existential probability. In addition, to deal with big data situation, we also introduce a parallelized PNFPGrowth in the MapReduce framework, which scales well with the size of the dataset while minimizing data replication and communication cost.
引用
收藏
页码:223 / 226
页数:4
相关论文
共 50 条
  • [1] FP-growth algorithm based on Boolean matrix and MapReduce
    College of Computer, Sichuan University, Chengdu 610065, Sichuan, China
    Huanan Ligong Daxue Xuebao, 1 (135-141):
  • [2] The Research and Improvement Based on FP-Growth Data Mining Algorithm
    Yao, Quanzhu
    Gao, Xingxing
    Lei, Xueli
    Zhang, Tong
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON MODELING, SIMULATION AND OPTIMIZATION TECHNOLOGIES AND APPLICATIONS (MSOTA2016), 2016, 58 : 287 - 293
  • [3] The Study of Improved FP-Growth Algorithm in MapReduce
    Sun, Hong
    Zhang, Huaxuan
    Chen, Shiping
    Hu, Chunyan
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON CLOUD COMPUTING AND INFORMATION SECURITY (CCIS 2013), 2013, 52 : 250 - 253
  • [4] A Novel Incremental Data Mining Algorithm based on FP-Growth for Big Data
    Chang, Hong-Yi
    Lin, Jia-Chi
    Cheng, Mei-Li
    Huang, Shih-Chang
    PROCEEDINGS 2016 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS NANA 2016, 2016, : 375 - 378
  • [5] An optimized fuzzy based FP-growth algorithm for mining temporal data
    Kumar, B. Praveen
    Padmavathy, T.
    Muthunagai, S. U.
    Paulraj, D.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 41 - 51
  • [6] AN OPTIMISED FP-GROWTH ALGORITHM USING MAPREDUCE PARADIGMS
    Wu, Xiuguo
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2021, 22 (10) : 2127 - 2137
  • [7] Deep Parallelization of Parallel FP-Growth Using Parent-Child MapReduce
    Makanju, Adetokunbo
    Farzanyar, Zahra
    An, Aijun
    Cercone, Nick
    Hu, Zane Zhenhua
    Hu, Yonggang
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1422 - 1431
  • [8] A heuristic approach for load balancing the FP-growth algorithm on MapReduce
    Bagui, Sikha
    Devulapalli, Keerthi
    Coffey, John
    Bagui, Sikha (bagui@uwf.edu), 1600, Elsevier B.V. (07):
  • [9] Incremental FP-Growth Mining Strategy for Dynamic Threshold Value and Database Based on MapReduce
    Wei, Xiaoting
    Ma, Yunlong
    Zhang, Feng
    Liu, Min
    Shen, Weiming
    PROCEEDINGS OF THE 2014 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2014, : 271 - 276
  • [10] Paths sharing based FP-Growth data mining algorithms
    Ji, Shandong
    Zhang, Dengyin
    Zhang, Liu
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,