A Novel Method for Disease Prediction: Hybrid of Random Forest and Multivariate Adaptive Regression Splines

被引:26
|
作者
Yao, Dengju [1 ]
Yang, Jing [1 ]
Zhan, Xiaojuan [2 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[2] Heilongjiang Inst Technol, Dept Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
data mining; medical data; random forest; multivariate adaptive regression splines;
D O I
10.4304/jcp.8.1.170-177
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Using data mining technology for disease prediction and diagnosis has become the focus of attention. Data mining technology provides an important means for extracting valuable medical rules hidden in medical data and acts as an important role in disease prediction and clinical diagnosis. This paper surveys some kind of popular data mining techniques for disease prediction and diagnosis, such as decision tree, associated rule analysis and clustering analysis. Then, a novel hybrid method of random forest and multivariate adaptive regression splines is proposed for building disease prediction model. Firstly, random forest algorithm is used to perform a preliminary screening of variables and to gain an importance ranks. Then, the new dataset selected by top-k important predictors is input into the MARS procedure, which is responsible for building interpretable models for predicting disease survivability. The capability of this combination method is evaluated using basic performance measurements (e.g., accuracy, sensitivity, and specificity) along with a 10-fold crossvalidation. Experimental results show that the proposed method provides a higher accuracy and a relatively simple model.
引用
收藏
页码:170 / 177
页数:8
相关论文
共 50 条
  • [41] A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines
    Wensheng Zhu
    Heping Zhang
    Frontiers of Mathematics in China, 2013, 8 : 731 - 743
  • [42] A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines
    Zhu, Wensheng
    Zhang, Heping
    FRONTIERS OF MATHEMATICS IN CHINA, 2013, 8 (03) : 731 - 743
  • [43] Multivariate Adaptive Regression Splines: A Potential Method for Tissue Culture Data Analysis
    Akin, Meleksen
    Eyduran, Sadiye Peral
    Ercisli, Sezai
    Reed, Barbara M.
    IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2020, 56 (01) : S29 - S29
  • [44] Rock slope damage level prediction by using multivariate adaptive regression splines (MARS)
    Erdik, Tarkan
    Pektas, Ali Osman
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07): : 2269 - 2278
  • [45] Multivariate Adaptive Regression Splines Enhance Genomic Prediction of Non-Additive Traits
    de Oliveira Celeri, Mauricio
    da Costa, Weverton Gomes
    Nascimento, Ana Carolina Campana
    Azevedo, Camila Ferreira
    Cruz, Cosme Damiao
    Sagae, Vitor Seiti
    Nascimento, Moyses
    AGRONOMY-BASEL, 2024, 14 (10):
  • [46] Multivariate adaptive regression splines based prediction of peak expiratory flow with spirometric data
    Vijayaraghavan, Pramila
    Veezhinathan, Mahesh
    TECHNOLOGY AND HEALTH CARE, 2016, 24 : S253 - S260
  • [47] Rock slope damage level prediction by using multivariate adaptive regression splines (MARS)
    Tarkan Erdik
    Ali Osman Pektas
    Neural Computing and Applications, 2019, 31 : 2269 - 2278
  • [48] Regional VTEC modeling with multivariate adaptive regression splines
    Durmaz, Murat
    Karslioglu, Mahmut Onur
    Nohutcu, Metin
    ADVANCES IN SPACE RESEARCH, 2010, 46 (02) : 180 - 189
  • [49] Multivariate adaptive regression splines: a powerful method for detecting disease-risk relationship differences among subgroups
    York, TP
    Eaves, LJ
    van den Oord, EJCG
    STATISTICS IN MEDICINE, 2006, 25 (08) : 1355 - 1367
  • [50] An Improved Multivariate Adaptive Regression Splines (MARS) Method for Prediction of Compressive Strength of High-Strength (HS) Concrete
    Fu, Limei
    Peng, Zhihao
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (04) : 4511 - 4530