A MapReduce-Based ELM for Regression in Big Data

被引:2
|
作者
Wu, B. [1 ]
Yan, T. H. [1 ]
Xu, X. S. [1 ]
He, B. [2 ,3 ]
Li, W. H. [4 ]
机构
[1] China Jiliang Univ, Sch Mech & Elect Engn, Hangzhou 310018, Zhejiang, Peoples R China
[2] Ocean Univ China, Sch Informat Sci, Qingdao 266100, Peoples R China
[3] Ocean Univ China, Engn Coll, Qingdao 266100, Peoples R China
[4] Univ Wollongong, Sch Mech Mat & Mechatron Engn, Wollongong, NSW, Australia
关键词
ELM; Regression; Machine learning; Mapreduce; Big data; EXTREME LEARNING-MACHINE; CLASSIFICATION;
D O I
10.1007/978-3-319-46257-8_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regression is one of the most basic problems in machine learning. In big data era, for regression problem, extreme learning machine (ELM) can get better generalization performance and much fast training speed. However, the enlarging volume of dataset for training makes regression by ELM a challenging task, and it is hard to finish the training in a reasonable time or it will be out of memory. In this paper, through analyzing the theory of ELM, a MapReduce-Based ELM method is proposed. Under the MapReduce framework, ELM submodels are trained in every slave node parallelly. A combination method is designed to combine all the submodels as a complete model. The experiment results demonstrate that the MapReduce-Based ELM can efficient process big dataset on commodity hardware and it has a good performance on speedup under the cloud environment where the dataset is stored as data block in different machines.
引用
收藏
页码:164 / 173
页数:10
相关论文
共 50 条
  • [21] MapReduce-based Parallel Linear Regression for Face Recognition
    Zhang, LiSheng
    Liu, HuaYong
    Lei, DaJiang
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 2628 - 2632
  • [22] Scaling up MapReduce-based Big Data Processing on Multi-GPU systems
    Jiang, Hai
    Chen, Yi
    Qiao, Zhi
    Weng, Tien-Hsiung
    Li, Kuan-Ching
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (01): : 369 - 383
  • [23] Distributed Big Data Clustering using MapReduce-based Fuzzy C-Medoids
    Sardar T.H.
    Ansari Z.
    Journal of The Institution of Engineers (India): Series B, 2022, 103 (01) : 73 - 82
  • [24] MapReduce-Based Complex Big Data Analytics over Uncertain and Imprecise Social Networks
    Braun, Peter
    Cuzzocrea, Alfredo
    Jiang, Fan
    Leung, Carson Kai-Sang
    Pazdor, Adam G. M.
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 130 - 145
  • [25] MapReduce-Based D_ELT Framework to Address the Challenges of Geospatial Big Data
    Jo, Junghee
    Lee, Kang-Woo
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (11)
  • [26] Scaling up MapReduce-based Big Data Processing on Multi-GPU systems
    Hai Jiang
    Yi Chen
    Zhi Qiao
    Tien-Hsiung Weng
    Kuan-Ching Li
    Cluster Computing, 2015, 18 : 369 - 383
  • [27] MapReduce-based parallel GEP algorithm for efficient function mining in big data applications
    Liu, Yang
    Ma, Chenxiao
    Xu, Lixiong
    Shen, Xiaodong
    Li, Maozhen
    Li, Pengcheng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (23):
  • [28] LandQυ2: A MapReduce-Based System for Processing Arable Land Quality Big Data
    Yao, Xiaochuang
    Mokbel, Mohamed E.
    Ye, Sijing
    Li, Guoqing
    Alarabi, Louai
    Eldawy, Ahmed
    Zhao, Zuliang
    Zhao, Long
    Zhu, Dehai
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (07)
  • [29] Gaussian relevance vector MapReduce-based annealed Glowworm optimization for big medical data scheduling
    Patan, Rizwan
    Kallam, Suresh
    Gandomi, Amir H.
    Hanne, Thomas
    Ramachandran, Manikandan
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2022, 73 (10) : 2204 - 2215
  • [30] A MapReduce-Based Nearest Neighbor Approach for Big-Data-Driven Traffic Flow Prediction
    Xia, Dawen
    Li, Huaqing
    Wang, Binfeng
    Li, Yantao
    Zhang, Zili
    IEEE ACCESS, 2016, 4 : 2920 - 2934