A MapReduce-based Fuzzy Associative Classifier for Big Data

被引:0
|
作者
Ducange, Pietro [1 ]
Marcelloni, Francesco [2 ]
Segatori, Armando [2 ]
机构
[1] ECampus Univ, Fac Ingn, I-22060 Novedrate, Italy
[2] Univ Pisa, Dip Ingn Informaz, I-56122 Pisa, Italy
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose an efficient distributed fuzzy associative classification model based on the MapReduce paradigm. The learning algorithm first mines a set of fuzzy association classification rules by employing a distributed version of a fuzzy extension of the well-known FP-Growth algorithm. Then, it prunes this set by using three purposely adapted types of pruning. We implemented the distributed fuzzy associative classifier using the Hadoop framework. We show the scalability of our approach by carrying out a number of experiments on a real-world big dataset. In particular, we evaluate the achievable speedup on a small computer cluster, highlighting that the proposed approach allows handling big datasets even with modest hardware support.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] LandQυ2: A MapReduce-Based System for Processing Arable Land Quality Big Data
    Yao, Xiaochuang
    Mokbel, Mohamed E.
    Ye, Sijing
    Li, Guoqing
    Alarabi, Louai
    Eldawy, Ahmed
    Zhao, Zuliang
    Zhao, Long
    Zhu, Dehai
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (07)
  • [32] MapReduce-Based Bayesian Automatic Text Classifier Used in Digital Library
    Niu, Zhen
    Yin, Zelong
    Cui, Huayang
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2012, 316 : 121 - 126
  • [33] Analysis of microarray leukemia data using an efficient MapReduce-based K-nearest-neighbor classifier
    Kumar, Mukesh
    Rath, Nitish Kumar
    Rath, Santanu Kumar
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 60 : 395 - 409
  • [34] Gaussian relevance vector MapReduce-based annealed Glowworm optimization for big medical data scheduling
    Patan, Rizwan
    Kallam, Suresh
    Gandomi, Amir H.
    Hanne, Thomas
    Ramachandran, Manikandan
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2022, 73 (10) : 2204 - 2215
  • [35] A MapReduce-Based Nearest Neighbor Approach for Big-Data-Driven Traffic Flow Prediction
    Xia, Dawen
    Li, Huaqing
    Wang, Binfeng
    Li, Yantao
    Zhang, Zili
    IEEE ACCESS, 2016, 4 : 2920 - 2934
  • [36] A MapReduce-Based Big Spatial Data Framework for Solving the Problem of Covering a Polygon with Orthogonal Rectangles
    Eken, Suleyman
    Sayar, Ahmet
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (01): : 36 - 42
  • [37] Knowledge Extraction from Big Data using MapReduce-based Parallel-Reduct Algorithm
    Chowdhury, Tapan
    Chakraborty, Susanta
    Setua, S. K.
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 240 - 246
  • [38] CloudEC: A MapReduce-based Algorithm for Correcting Errors in Next-generation Sequencing Big Data
    Chung, Wei-Chun
    Ho, Jan-Ming
    Lin, Chung-Yen
    Lee, D. T.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2836 - 2842
  • [39] MapReduce-based Parallel Algorithms for Multidimensionnal Data Analysis
    Pan, Jie
    Magoules, Frederic
    Le Biannic, Yann
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2012, 6 (02) : 325 - 350
  • [40] A Mapreduce Fuzzy Techniques of Big Data Classification
    El Bakry, Malak
    Safwat, Soha
    Hegazy, Osman
    PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 118 - 128