Scalable Generalized Multitarget Linear Regression With Output Dependence Estimation

被引:0
|
作者
Camejo Corona, Julio [1 ]
Gonzalez, Hector [2 ]
Morell, Carlos [3 ]
机构
[1] Univ Cienfuegos Carlos Rafael Rodriguez UCF, Carretera Rodas Km 4, Cienfuegos, Cuba
[2] Univ Ciencias Informat UCI, Havana, Cuba
[3] Univ Cent Marta Abreu UCLV, Villa Clara, Santa Clara, Cuba
关键词
Big data; Apache Spark; Multi-target regression;
D O I
10.1007/978-3-030-89691-1_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays the phenomenon of Big Data is overwhelming our capacity to extract relevant knowledge through classical machine learning techniques. Multitarget regression has arisen in several interesting industrial and environmental application domains, such as ecological modeling and energy forecasting. However, standard multi-target regressors are not designed to perform well with such amounts of data. This paper proposes a scalable implementation for a multi-target linear regression algorithm with output dependence estimation for Big Data analytics in Apache Spark. Our experiments on large-scale datasets show an accurate analysis compared to standard implementation and order of training time reduction as the available number of working nodes in the processing cluster increases.
引用
收藏
页码:60 / 68
页数:9
相关论文
共 50 条
  • [1] Truncated estimation in functional generalized linear regression models
    Liu, Xi
    Divani, Afshin A.
    Petersen, Alexander
    Computational Statistics and Data Analysis, 2022, 169
  • [2] PRINCIPAL COMPONENT ESTIMATION FOR GENERALIZED LINEAR-REGRESSION
    MARX, BD
    SMITH, EP
    BIOMETRIKA, 1990, 77 (01) : 23 - 31
  • [3] Truncated estimation in functional generalized linear regression models
    Liu, Xi
    Divani, Afshin A.
    Petersen, Alexander
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 169
  • [5] Testing the hypothesis of a generalized linear regression model using nonparametric regression estimation
    Celia, Rodriguez-Campos, M.
    Gonzalez-Manteiga, W.
    Cao, R.
    Journal of Statistical Planning and Inference, 67 (01):
  • [6] Testing the hypothesis of a generalized linear regression model using nonparametric regression estimation
    Rodriguez-Campos, MC
    Gonzalez-Manteiga, W
    Cao, R
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1998, 67 (01) : 99 - 122
  • [7] IMPLICATIONS OF SURVEY DESIGN FOR GENERALIZED REGRESSION ESTIMATION OF LINEAR FUNCTIONS
    SARNDAL, CE
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1982, 7 (02) : 155 - 170
  • [8] Estimation for generalized partially functional linear additive regression model
    Du, Jiang
    Cao, Ruiyuan
    Kwessi, Eddy
    Zhang, Zhongzhan
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (05) : 914 - 925
  • [9] THE ESTIMATION OF LINEAR REGRESSION IS BASED ON THE GENERALIZED LEAST MODULES METHOD
    Tyrsin, A. N.
    Sokolov, L. A.
    VESTNIK SAMARSKOGO GOSUDARSTVENNOGO TEKHNICHESKOGO UNIVERSITETA-SERIYA-FIZIKO-MATEMATICHESKIYE NAUKI, 2010, (05): : 134 - 142
  • [10] Scalable holistic linear regression
    Bertsimas, Dimitris
    Li, Michael Lingzhi
    OPERATIONS RESEARCH LETTERS, 2020, 48 (03) : 203 - 208