A Hybrid Regression Model for Improving Prediction Accuracy

被引:1
|
作者
Poojari, Satyanarayana [1 ]
Ismail, B. [2 ]
机构
[1] Mangalore Univ, Dept Stat, Mangalagangothri, India
[2] Yenepoya Deemed Be Univ, Dept Stat, Mangalore, India
关键词
Regression Tree; KNN; Hybrid model; SVR; Simulation;
D O I
10.1285/i20705948v16n3p784
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Regression Tree (RT) and K-Nearest Neighbor (KNN) models play significant roles in machine learning. RT facilitates interpretable decision-making, aiding in the comprehension of complex data relationships, while KNN is valued for its simplicity, adaptability to non-linear data, and robustness to noise, making it a versatile tool across various applications. The primary drawback of Regression Tree is its tendency to assign the same predicted value (average value) to all tuples satisfying the same corresponding splitting criterion. K-Nearest Neighbors (KNN) is sensitive to irrelevant or redundant features since all features contribute to similarity. This paper proposes a hybrid regression model based on Regression Tree (RT) and KNN, addressing the aforementioned issues. The model's performance is compared with KNN using 10 types of distance measures and further assessed against RT, K Nearest Neighbor regression (KNN), and Support Vector Regression (SVR) through a Monte Carlo simulation study. Simulation results indicate that the hybrid model outperforms all other regression models, regardless of sample size, when observations follow normal distributions or t-distributions.The proposed model's effectiveness is demonstrated through a real-life application using data on global warming in Delhi.
引用
收藏
页码:784 / 801
页数:19
相关论文
共 50 条
  • [41] Hybrid prediction model for improving reliability in self-healing system
    Yoo, Gijong
    Park, Jeongmin
    Lee, Eunseok
    FOURTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS, PROCEEDINGS, 2006, : 108 - +
  • [42] A hybrid RF-LSTM based on CEEMDAN for improving the accuracy of building energy consumption prediction
    Karijadi, Irene
    Chou, Shuo-Yan
    ENERGY AND BUILDINGS, 2022, 259
  • [43] Model Selection of Symbolic Regression to Improve the Accuracy of PM2.5 Concentration Prediction
    Yang, Guangfei
    Huang, Jian
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 189 - 197
  • [44] A hybrid FSRF model based on regression algorithm for diabetes medical expense prediction
    Luo, Min
    Xiao, Fei
    Chen, Zi-yu
    Wang, Xiao-kang
    Hou, Wen-hui
    Wang, Jian-qiang
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 207
  • [45] Improving Time Series Regression Model Accuracy via Systematic Training Dataset Augmentation and Sampling
    Stroebel, Robin
    Mau, Marcus
    Puchta, Alexander
    Fleischer, Juergen
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (02): : 1072 - 1086
  • [46] SMOTEBoost for Regression: Improving the Prediction of Extreme Values
    Moniz, Nuno
    Ribeiro, Rita P.
    Cerqueira, Vitor
    Chawla, Nitesh
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 150 - 159
  • [47] Prediction of Accuracy in Emergency Health Records using Hybrid Machine Learning Model
    Raghavendra, G. S.
    Mahesh, Shanthi
    Rao, M. V. P. Chandra Sekhara
    JOURNAL OF PHARMACEUTICAL RESEARCH INTERNATIONAL, 2021, 33 (58A) : 206 - 212
  • [48] Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy
    Surjeet Dalal
    Edeh Michael Onyema
    Amit Malik
    World Journal of Gastroenterology, 2022, 28 (46) : 6551 - 6563
  • [49] Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy
    Dalal, Surjeet
    Onyema, Edeh Michael
    Malik, Amit
    WORLD JOURNAL OF GASTROENTEROLOGY, 2022, 28 (46) : 6551 - 6563
  • [50] Improving accuracy of neural prediction by market segmentation
    Quah, TS
    Srinivasan, B
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 935 - 939