An overview of recent distributed algorithms for learning fuzzy models in Big Data classification

被引:15
|
作者
Ducange, Pietro [1 ]
Fazzolari, Michela [2 ]
Marcelloni, Francesco [1 ]
机构
[1] Dipartimento Ingn Informaz, Largo Lucio Lazzarino 1, I-56122 Pisa, Italy
[2] CNR, Ist Informat & Telemat, Via Giuseppe Moruzzi 1, I-56124 Pisa, Italy
关键词
Big Data; Fuzzy models; Data mining; Classification algorithms; Distributed computing; MULTIOBJECTIVE EVOLUTIONARY APPROACH; ASSOCIATIVE CLASSIFICATION; CLUSTERING-ALGORITHM; SYSTEMS; MAPREDUCE; ANALYTICS; DESIGN; GRANULARITY; CLASSIFIERS; SELECTION;
D O I
10.1186/s40537-020-00298-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, a huge amount of data are generated, often in very short time intervals and in various formats, by a number of different heterogeneous sources such as social networks and media, mobile devices, internet transactions, networked devices and sensors. These data, identified as Big Data in the literature, are characterized by the popular Vs features, such as Value, Veracity, Variety, Velocity and Volume. In particular, Value focuses on the useful knowledge that may be mined from data. Thus, in the last years, a number of data mining and machine learning algorithms have been proposed to extract knowledge from Big Data. These algorithms have been generally implemented by using ad-hoc programming paradigms, such as MapReduce, on specific distributed computing frameworks, such as Apache Hadoop and Apache Spark. In the context of Big Data, fuzzy models are currently playing a significant role, thanks to their capability of handling vague and imprecise data and their innate characteristic to be interpretable. In this work, we give an overview of the most recent distributed learning algorithms for generating fuzzy classification models for Big Data. In particular, we first show some design and implementation details of these learning algorithms. Thereafter, we compare them in terms of accuracy and interpretability. Finally, we argue about their scalability.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] An overview of recent distributed algorithms for learning fuzzy models in Big Data classification
    Pietro Ducange
    Michela Fazzolari
    Francesco Marcelloni
    Journal of Big Data, 7
  • [2] A STUDY ON THE ERROR OF DISTRIBUTED ALGORITHMS FOR BIG DATA CLASSIFICATION WITH SVM
    Wang, Cheng
    Cao, Feilong
    ANZIAM JOURNAL, 2017, 58 (3-4): : 231 - 237
  • [3] Reliable Distributed Fuzzy Discretizer for Associative Classification of Big Data
    Pushparani, Hepzi Jeya
    Goldena, Nancy Jasmine
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)
  • [4] Models and algorithms for classifying big data based on distributed data streams
    Mao G.-J.
    Hu D.-J.
    Xie S.-Y.
    1600, Science Press (40): : 161 - 175
  • [5] Distributed Fuzzy Cognitive Maps for Feature Selection in Big Data Classification
    Haritha, K.
    Judy, M., V
    Papageorgiou, Konstantinos
    Georgiannis, Vassilis C.
    Papageorgiou, Elpiniki
    ALGORITHMS, 2022, 15 (10)
  • [6] Comparison of machine learning algorithms for classification of Big Data sets
    Singh, Barkha
    Indu, Sreedevi
    Majumdar, Sudipta
    THEORETICAL COMPUTER SCIENCE, 2025, 1024
  • [7] Parallel and Distributed Machine Learning Algorithms for Scalable Big Data Analytics
    Bal, Henri
    Pal, Arindam
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 108 : 1159 - 1161
  • [8] Consensus Learning for Distributed Fuzzy Neural Network in Big Data Environment
    Shi, Ye
    Lin, Chin-Teng
    Chang, Yu-Cheng
    Ding, Weiping
    Shi, Yuhui
    Yao, Xin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (01): : 29 - 41
  • [9] Performance Analysis of Machine Learning Algorithms for Big Data Classification: ML and Al-Based Algorithms for Big Data Analysis
    Punia, Sanjeev Kumar
    Kumar, Manoj
    Stephan, Thompson
    Deverajan, Ganesh Gopal
    Patan, Rizwan
    INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2021, 12 (04) : 60 - 75
  • [10] Incremental fuzzy learning algorithms in big data problems: a study on the size of learning subsets
    Romero-Zaliz, Rocio
    Gonzalez, Antonio
    Perez, Raul
    2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,