A MapReduce-Based ELM for Regression in Big Data

被引:2
|
作者
Wu, B. [1 ]
Yan, T. H. [1 ]
Xu, X. S. [1 ]
He, B. [2 ,3 ]
Li, W. H. [4 ]
机构
[1] China Jiliang Univ, Sch Mech & Elect Engn, Hangzhou 310018, Zhejiang, Peoples R China
[2] Ocean Univ China, Sch Informat Sci, Qingdao 266100, Peoples R China
[3] Ocean Univ China, Engn Coll, Qingdao 266100, Peoples R China
[4] Univ Wollongong, Sch Mech Mat & Mechatron Engn, Wollongong, NSW, Australia
关键词
ELM; Regression; Machine learning; Mapreduce; Big data; EXTREME LEARNING-MACHINE; CLASSIFICATION;
D O I
10.1007/978-3-319-46257-8_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regression is one of the most basic problems in machine learning. In big data era, for regression problem, extreme learning machine (ELM) can get better generalization performance and much fast training speed. However, the enlarging volume of dataset for training makes regression by ELM a challenging task, and it is hard to finish the training in a reasonable time or it will be out of memory. In this paper, through analyzing the theory of ELM, a MapReduce-Based ELM method is proposed. Under the MapReduce framework, ELM submodels are trained in every slave node parallelly. A combination method is designed to combine all the submodels as a complete model. The experiment results demonstrate that the MapReduce-Based ELM can efficient process big dataset on commodity hardware and it has a good performance on speedup under the cloud environment where the dataset is stored as data block in different machines.
引用
收藏
页码:164 / 173
页数:10
相关论文
共 50 条
  • [41] Tri-training and MapReduce-based massive data learning
    Guo, Mao-Zu
    Deng, Chao
    Liu, Yang
    Li, Ping
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 355 - 380
  • [42] The HiBench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis
    Huang, Shengsheng
    Huang, Jie
    Dai, Jinquan
    Xie, Tao
    Huang, Bo
    NEW FRONTIERS IN INFORMATION AND SOFTWARE AS SERVICES: SERVICE AND APPLICATION DESIGN CHALLENGES IN THE CLOUD, 2011, 74 : 209 - 228
  • [43] A MapReduce-Based Parallel Frequent Pattern Growth Algorithm for Spatiotemporal Association Analysis of Mobile Trajectory Big Data
    Xia, Dawen
    Lu, Xiaonan
    Li, Huaqing
    Wang, Wendong
    Li, Yantao
    Zhang, Zili
    COMPLEXITY, 2018,
  • [44] Analysis of the Big Data based on MapReduce
    Tian, Zi-de
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 224 - 228
  • [45] MapReduce-Based Warehouse Systems: A Survey
    Sureshrao, Gore Sumit
    Ambulgekar, H. P.
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING AND TECHNOLOGY RESEARCH (ICAETR), 2014,
  • [46] MapReduce-based big data classification model using feature subset selection and hyperparameter tuned deep belief network
    Rajendran, Surendran
    Khalaf, Osamah Ibrahim
    Alotaibi, Youseef
    Alghamdi, Saleh
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [47] A Demonstration of Shahed: A MapReduce-based System for Querying and Visualizing Satellite Data
    Eldawy, Ahmed
    Alharthi, Saif
    Alzaidy, Abdulhadi
    Daghistani, Anas
    Ghani, Sohaib
    Basalamah, Saleh
    Mokbel, Mohamed F.
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1444 - 1447
  • [48] MapReduce-based big data classification model using feature subset selection and hyperparameter tuned deep belief network
    Surendran Rajendran
    Osamah Ibrahim Khalaf
    Youseef Alotaibi
    Saleh Alghamdi
    Scientific Reports, 11
  • [49] MapReduce-based Parallelized Approximation of Frequent Itemsets Mining in Uncertain Data
    Xu, Jing
    Mao, Xiao-Jiao
    Lu, Wen-Yang
    Zhu, Qi-Hai
    Li, Ning
    Yang, Yu-Bin
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 136 - 144
  • [50] A MapReduce-based improvement algorithm for DBSCAN
    Hu, Xiaojuan
    Liu, Lei
    Qiu, Ningjia
    Yang, Di
    Li, Meng
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 12 (01) : 53 - 61