An overview of recent distributed algorithms for learning fuzzy models in Big Data classification

被引:15
|
作者
Ducange, Pietro [1 ]
Fazzolari, Michela [2 ]
Marcelloni, Francesco [1 ]
机构
[1] Dipartimento Ingn Informaz, Largo Lucio Lazzarino 1, I-56122 Pisa, Italy
[2] CNR, Ist Informat & Telemat, Via Giuseppe Moruzzi 1, I-56124 Pisa, Italy
关键词
Big Data; Fuzzy models; Data mining; Classification algorithms; Distributed computing; MULTIOBJECTIVE EVOLUTIONARY APPROACH; ASSOCIATIVE CLASSIFICATION; CLUSTERING-ALGORITHM; SYSTEMS; MAPREDUCE; ANALYTICS; DESIGN; GRANULARITY; CLASSIFIERS; SELECTION;
D O I
10.1186/s40537-020-00298-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, a huge amount of data are generated, often in very short time intervals and in various formats, by a number of different heterogeneous sources such as social networks and media, mobile devices, internet transactions, networked devices and sensors. These data, identified as Big Data in the literature, are characterized by the popular Vs features, such as Value, Veracity, Variety, Velocity and Volume. In particular, Value focuses on the useful knowledge that may be mined from data. Thus, in the last years, a number of data mining and machine learning algorithms have been proposed to extract knowledge from Big Data. These algorithms have been generally implemented by using ad-hoc programming paradigms, such as MapReduce, on specific distributed computing frameworks, such as Apache Hadoop and Apache Spark. In the context of Big Data, fuzzy models are currently playing a significant role, thanks to their capability of handling vague and imprecise data and their innate characteristic to be interpretable. In this work, we give an overview of the most recent distributed learning algorithms for generating fuzzy classification models for Big Data. In particular, we first show some design and implementation details of these learning algorithms. Thereafter, we compare them in terms of accuracy and interpretability. Finally, we argue about their scalability.
引用
收藏
页数:29
相关论文
共 50 条
  • [41] Runtime prediction of big data jobs: performance comparison of machine learning algorithms and analytical models
    Ahmed, Nasim
    Barczak, Andre L. C.
    Rashid, Mohammad A.
    Susnjak, Teo
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [42] A Study of Recent Classification Algorithms and a Novel Approach for EEG Data Classification
    Cinar, Eyup
    Sahin, Ferat
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3366 - 3372
  • [43] A Review of Distributed Data Models for Learning
    Angel Rodriguez, Miguel
    Fernandez, Alberto
    Peregrin, Antonio
    Herrera, Francisco
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 88 - 97
  • [44] Machine learning algorithms for oncology big data treatment
    Mohammed, Zouiten
    ICCWCS'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND WIRELESS COMMUNICATION SYSTEMS, 2017,
  • [45] Streaming Machine Learning Algorithms with Big Data Systems
    Abeykoon, Vibhatha
    Kamburugamuve, Supun
    Govindrarajan, Kannan
    Wickramasinghe, Pulasthi
    Widanage, Chathura
    Perera, Niranda
    Uyar, Ahmet
    Gunduz, Gurhan
    Akkas, Selahattin
    Von Laszewski, Gregor
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5661 - 5666
  • [46] CLASSIFICATION ALGORITHMS FOR BIG DATA ANALYSIS, A MAP REDUCE APPROACH
    Ayma, V. A.
    Ferreira, R. S.
    Happ, P.
    Oliveira, D.
    Feitosaa, R.
    Costa, G.
    Plaza, A.
    Gamba, P.
    PIA15+HRIGI15 - JOINT ISPRS CONFERENCE, VOL. I, 2015, 40-3 (W2): : 17 - 21
  • [47] Online learning algorithms for big data analytics: A survey
    Li, Zhijie
    Li, Yuanxiang
    Wang, Feng
    He, Guoliang
    Kuang, Li
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (08): : 1707 - 1721
  • [48] A SURVEY OF MACHINE LEARNING ALGORITHMS FOR BIG DATA ANALYTICS
    Athmaja, S.
    Hanumanthappa, M.
    Kavitha, Vasantha
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [49] Attribute-Distributed Learning: Models, Limits, and Algorithms
    Zheng, Haipeng
    Kulkarni, Sanjeev R.
    Poor, H. Vincent
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (01) : 386 - 398
  • [50] Comparison of Machine Learning Algorithms in Data classification
    ul Hassan, Ch Anwar
    Khan, Muhammad Sufyan
    Shah, Munam Ali
    2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 270 - 275