A Hybrid Regression Model for Improving Prediction Accuracy

被引：1

作者：

Poojari, Satyanarayana ^{[1
]}

Ismail, B. ^{[2
]}

机构：

[1] Mangalore Univ, Dept Stat, Mangalagangothri, India

[2] Yenepoya Deemed Be Univ, Dept Stat, Mangalore, India

来源：

ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS | 2023年 / 16卷 / 03期

关键词：

Regression Tree; KNN; Hybrid model; SVR; Simulation;

D O I：

10.1285/i20705948v16n3p784

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Regression Tree (RT) and K-Nearest Neighbor (KNN) models play significant roles in machine learning. RT facilitates interpretable decision-making, aiding in the comprehension of complex data relationships, while KNN is valued for its simplicity, adaptability to non-linear data, and robustness to noise, making it a versatile tool across various applications. The primary drawback of Regression Tree is its tendency to assign the same predicted value (average value) to all tuples satisfying the same corresponding splitting criterion. K-Nearest Neighbors (KNN) is sensitive to irrelevant or redundant features since all features contribute to similarity. This paper proposes a hybrid regression model based on Regression Tree (RT) and KNN, addressing the aforementioned issues. The model's performance is compared with KNN using 10 types of distance measures and further assessed against RT, K Nearest Neighbor regression (KNN), and Support Vector Regression (SVR) through a Monte Carlo simulation study. Simulation results indicate that the hybrid model outperforms all other regression models, regardless of sample size, when observations follow normal distributions or t-distributions.The proposed model's effectiveness is demonstrated through a real-life application using data on global warming in Delhi.

引用

页码：784 / 801

页数：19

共 50 条

[41] Hybrid prediction model for improving reliability in self-healing system
Yoo, Gijong
Park, Jeongmin
Lee, Eunseok
FOURTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS, PROCEEDINGS, 2006, : 108 - +
[42] A hybrid RF-LSTM based on CEEMDAN for improving the accuracy of building energy consumption prediction
Karijadi, Irene
Chou, Shuo-Yan
ENERGY AND BUILDINGS, 2022, 259
[43] Model Selection of Symbolic Regression to Improve the Accuracy of PM2.5 Concentration Prediction
Yang, Guangfei
Huang, Jian
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 189 - 197
[44] A hybrid FSRF model based on regression algorithm for diabetes medical expense prediction
Luo, Min
Xiao, Fei
Chen, Zi-yu
Wang, Xiao-kang
Hou, Wen-hui
Wang, Jian-qiang
TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 207
[45] Improving Time Series Regression Model Accuracy via Systematic Training Dataset Augmentation and Sampling
Stroebel, Robin
Mau, Marcus
Puchta, Alexander
Fleischer, Juergen
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (02): : 1072 - 1086
[46] SMOTEBoost for Regression: Improving the Prediction of Extreme Values
Moniz, Nuno
Ribeiro, Rita P.
Cerqueira, Vitor
Chawla, Nitesh
2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 150 - 159
[47] Prediction of Accuracy in Emergency Health Records using Hybrid Machine Learning Model
Raghavendra, G. S.
Mahesh, Shanthi
Rao, M. V. P. Chandra Sekhara
JOURNAL OF PHARMACEUTICAL RESEARCH INTERNATIONAL, 2021, 33 (58A) : 206 - 212
[48] Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy
Surjeet Dalal
Edeh Michael Onyema
Amit Malik
World Journal of Gastroenterology, 2022, 28 (46) : 6551 - 6563
[49] Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy
Dalal, Surjeet
Onyema, Edeh Michael
Malik, Amit
WORLD JOURNAL OF GASTROENTEROLOGY, 2022, 28 (46) : 6551 - 6563
[50] Improving accuracy of neural prediction by market segmentation
Quah, TS
Srinivasan, B
IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 935 - 939

← 1 2 3 4 5 →