SMOTEBoost for Regression: Improving the Prediction of Extreme Values

被引:25
|
作者
Moniz, Nuno [1 ]
Ribeiro, Rita P. [1 ]
Cerqueira, Vitor [1 ]
Chawla, Nitesh [2 ]
机构
[1] Univ Porto, INESC TEC, Porto, Portugal
[2] Univ Notre Dame, Indiana, PA USA
关键词
Imbalanced Domain Learning; Ensemble Learning; Boosting; Regression;
D O I
10.1109/DSAA.2018.00025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning with imbalanced domains is one of the biggest challenges in machine learning. Such tasks differ from standard learning tasks by assuming a skewed distribution of target variables, and user domain preference towards under-represented cases. Most research has focused on imbalanced classification tasks, where a wide range of solutions has been tested. Still, little work has been done concerning imbalanced regression tasks. In this paper, we propose an adaptation of the SMOTEBoost approach for the problem of imbalanced regression. Originally designed for classification tasks, it combines boosting methods and the SMOTE resampling strategy. We present four variants of SMOTEBoost and provide an experimental evaluation using 30 datasets with an extensive analysis of results in order to assess the ability of SMOTEBoost methods in predicting extreme target values, and their predictive trade-off concerning baseline boosting methods. SMOTEBoost is publicly available in a software package.
引用
收藏
页码:150 / 159
页数:10
相关论文
共 50 条
  • [41] An Optimized Extreme Learning Machine Algorithm for Improving Software Maintainability Prediction
    Gupta, Shkha
    Chug, Anuradha
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 829 - 836
  • [42] The role of clouds in improving the regression model for hourly values of diffuse solar radiation
    Furlan, Claudia
    de Oliveira, Amauri Pereira
    Soares, Jacyra
    Codato, Georgia
    Escobedo, Joao Francisco
    APPLIED ENERGY, 2012, 92 : 240 - 254
  • [43] Extreme Rainfall Prediction using Bayesian Quantile Regression in Statistical Downscaling Modeling
    Rachmawati, Ro'fah Nur
    Sungkawa, Iwa
    Rahayu, Anita
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 406 - 413
  • [44] Seasonal prediction of winter extreme precipitation over Canada by support vector regression
    Zeng, Z.
    Hsieh, W. W.
    Shabbar, A.
    Burrows, W. R.
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2011, 15 (01) : 65 - 74
  • [46] Prediction of the Extreme Values and the OptimalRatios of Triple Jump Based on the Grey System Theory
    XU Ming(Department of physical Education
    Journal of Systems Science and Systems Engineering, 1999, (01) : 40 - 50
  • [47] PREDICTION OF EXTREME VALUES OF OFFSHORE STRUCTURAL RESPONSE BY AN EFFICIENT TIME SIMULATION TECHNIQUE
    Abu Husain, M. K.
    Zaki, N. I. Mohd
    Najafian, G.
    33RD INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2014, VOL 4A, 2014,
  • [48] Improving the calibration of the best member method using quantile regression to forecast extreme temperatures
    Gogonel, A.
    Collet, J.
    Bar-Hen, A.
    NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2013, 13 (05) : 1161 - 1168
  • [49] Evaluation of Gaussian process regression kernel functions for improving groundwater prediction
    Pan, Yue
    Zeng, Xiankui
    Xu, Hongxia
    Sun, Yuanyuan
    Wang, Dong
    Wu, Jichun
    JOURNAL OF HYDROLOGY, 2021, 603
  • [50] Evaluation of Gaussian process regression kernel functions for improving groundwater prediction
    Pan, Yue
    Zeng, Xiankui
    Xu, Hongxia
    Sun, Yuanyuan
    Wang, Dong
    Wu, Jichun
    Journal of Hydrology, 2021, 603