Map-optimize-reduce: CAN tree assisted FP-growth algorithm for clusters based FP mining on Hadoop

被引:21
|
作者
Ragaventhiran, J. [1 ]
Kavithadevi, M. K. [2 ]
机构
[1] Syed Ammal Engn Coll, Dept CSE, Ramanathapuram, India
[2] Thiagarajar Coll Engn, Dept CSE, Madurai, Tamil Nadu, India
关键词
Frequent pattern mining; Map-optimize-reduce; Clustering; Load balancing; CAN tree based FP growth; User query; FREQUENT PATTERNS; SEQUENTIAL PATTERNS; SKEWED DATA; MAPREDUCE;
D O I
10.1016/j.future.2019.09.041
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Over the past era, Frequent Pattern Mining (FPM) is emerging as a significant approach to discover fascinating knowledge concealed in the data. However, preceding works failed to address the validation of FPM with user queries and also achieving better scalability and execution time is still bottleneck owing to difficulties in handling large dataset. To address this downside, our proposed work establishes FPM using extend version of MapReduce framework in Hadoop environment. Our proposed work comprises of five processes that are: 1) Preprocessing 2) Affinity Propagation (AP) based Clustering 3) Load Balancing 4) Map-Optimize-Reduce 5) Mining User Queries. Primarily, our proposed work performs preprocessing to remove data redundancy. To speed up the MapReduce framework, we propose AP clustering which generates effective clusters from the given dataset. Load balancing is executed to balance load among different blocks concerning where reputation is computed. To avoid oversight in scanning and minimal searching space in MapReduce, optimizer is included between Mapper and Reducer where Emperor Penguin Colony (EPC) optimization is used. Frequent patterns are mined using CANonical order (CAN) tree based Frequent Pattern (FP) growth which reduces execution time and frequent tree construction. User provides Mining_Request to the Hadoop and frequent patterns are mined for given query which is send back to the user. If user given query is not present in the CAN tree, then it sends Relevance Feedback as a recommendation to the user. Finally, we validate our proposed work performance with the previous works for succeeding metrics that are Execution Time, Response Time, Load Balancing Rate, and Scalability. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:111 / 122
页数:12
相关论文
共 50 条
  • [21] Improvement of FP-Growth Algorithm for Mining Description-Oriented Rules
    Gruca, Aleksandra
    MAN-MACHINE INTERACTIONS 3, 2014, 242 : 183 - 192
  • [22] An Improved Association Rule Mining Algorithm Based on Ant Lion Optimizer Algorithm and FP-Growth
    Dong, Dawei
    Ye, Zhiwei
    Cao, Yu
    Xie, Shiwei
    Wang, Fengwen
    Ming, Wei
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, : 458 - 463
  • [23] A Power Load Association Rules Mining Method Based on Improved FP-Growth Algorithm
    Wang, Ze-Zhong
    Cao, Shuo
    2018 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2018, : 2833 - 2837
  • [24] Correlation Failure Analysis Based on the Improved FP-Growth Algorithm
    Wang, Yanhui
    Wang, Shujun
    Lin, Shuai
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES FOR RAIL TRANSPORTATION: TRANSPORTATION, 2016, 378 : 135 - 145
  • [25] Improvement and research of FP-growth algorithm based on distributed spark
    Deng Lingling
    Lou Yuansheng
    2015 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2015, : 105 - 108
  • [26] Research on Association Rules Parallel Algorithm Based on FP-Growth
    Chen, Ke
    Zhang, Lijun
    Li, Sansi
    Ke, Wende
    INFORMATION COMPUTING AND APPLICATIONS, PT II, 2011, 244 : 249 - +
  • [27] Chinese Document Keyword Extraction Algorithm based on FP-Growth
    Zhao, Meng
    Yu, Wanjun
    Lu, Wenjing
    Liu, Quan
    Li, Jinxiao
    2016 INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2016, : 202 - 205
  • [28] THE EVALUATION OF ETHNIC COSTUME COURSES BASED ON FP-GROWTH ALGORITHM
    Xu, Rui
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (01): : 313 - 326
  • [29] An Empirical Analysis and Comparison of Apriori and FP-Growth Algorithm for Frequent Pattern Mining
    Singh, Avadh Kishor
    Kumar, Ajeet
    Maurya, Ashish K.
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 1599 - 1602
  • [30] Mining research on correlation factors of residential electricity stability based on improved FP-growth algorithm
    Pan, Hua
    Liu, Rong
    MANAGEMENT OF ENVIRONMENTAL QUALITY, 2024, 35 (03) : 547 - 566