A two-stage stacked-based heterogeneous ensemble learning for cancer survival prediction

被引:8
|
作者
Yan, Fangzhou [1 ]
Feng, Yi [2 ]
机构
[1] Sichuan Univ, Coll Elect Engn, Chengdu 610064, Peoples R China
[2] Sichuan Univ, Business Sch, Chengdu 610064, Peoples R China
关键词
Stacked generalization strategy; Cancer survival prediction; Feature selection; Heterogeneous ensemble learning; MODEL; ALGORITHM; CLASSIFICATION; OPTIMIZATION; DIAGNOSIS; SYSTEM;
D O I
10.1007/s40747-022-00791-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer survival prediction is one of the three major tasks of cancer prognosis. To improve the accuracy of cancer survival prediction, in this paper, we propose a priori knowledge- and stability-based feature selection (PKSFS) method and develop a novel two-stage heterogeneous stacked ensemble learning model (BQAXR) to predict the survival status of cancer patients. Specifically, PKSFS first obtains the optimal feature subsets from the high-dimensional cancer datasets to guide the subsequent model construction. Then, BQAXR seeks to generate five high-quality heterogeneous learners, among which the shortcomings of the learners are overcome by using improved methods, and integrate them in two stages through the stacked generalization strategy based on optimal feature subsets. To verify the merits of PKSFS and BQAXR, this paper collected the real survival datasets of gastric cancer and skin cancer from the Surveillance, Epidemiology, and End Results (SEER) database of the National Cancer Institute, and conducted extensive numerical experiments from different perspectives based on these two datasets. The accuracy and AUC of the proposed method are 0.8209 and 0.8203 in the gastric cancer dataset, and 0.8336 and 0.8214 in the skin cancer dataset. The results show that PKSFS has marked advantages over popular feature selection methods in processing high-dimensional datasets. By taking full advantage of heterogeneous high-quality learners, BQAXR is not only superior to mainstream machine learning methods, but also outperforms improved machine learning methods, which indicates can effectively improve the accuracy of cancer survival prediction and provide a reference for doctors to make medical decisions.
引用
收藏
页码:4619 / 4639
页数:21
相关论文
共 50 条
  • [31] A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning
    Ying Xu
    Cuijuan Yang
    Shaoliang Peng
    Yusuke Nojima
    Applied Intelligence, 2020, 50 : 3852 - 3867
  • [32] Heterogeneous stacked ensemble classifier for software defect prediction
    Goyal, Somya
    Bhatia, Pradeep Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37033 - 37055
  • [33] Heterogeneous stacked ensemble classifier for software defect prediction
    Somya Goyal
    Pradeep Kumar Bhatia
    Multimedia Tools and Applications, 2022, 81 : 37033 - 37055
  • [34] Pathway-Structured Predictive Model for Cancer Survival Prediction: A Two-Stage Approach
    Zhang, Xinyan
    Li, Yan
    Akinyemiju, Tomi
    Ojesina, Akinyemi I.
    Buckhaults, Phillip
    Liu, Nianjun
    Xu, Bo
    Yi, Nengjun
    GENETICS, 2017, 205 (01) : 89 - +
  • [35] Metaheuristic-Driven Two-Stage Ensemble Deep Learning for Lung/Colon Cancer Classification
    Razmjouei, Pouyan
    Moharamkhani, Elaheh
    Hasanvand, Mohamad
    Daneshfar, Mar yam
    Shokouhifar, Mohammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 3855 - 3880
  • [37] PREDICTION OF FORMATION ENERGY USING TWO-STAGE MACHINE LEARNING BASED ON CLUSTERING
    Fan, Xingyue
    MATERIALI IN TEHNOLOGIJE, 2021, 55 (02): : 263 - 268
  • [38] Survival Prediction of Heart Failure Patients using Stacked Ensemble Machine Learning Algorithm
    Zaman, S. M. Mehedi
    Qureshi, Wasay Mahmood
    Raihan, Md Mohsin Sarker
    Bin Shams, Abdullah
    Sultana, Sharmin
    2021 IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2022, : 117 - 120
  • [39] Two-stage framework for diabetic retinopathy diagnosis and disease stage screening with ensemble learning
    Alshayeji, Mohammad H.
    Abed, Sa'ed
    Sindhu, Silpa ChandraBhasi
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 225
  • [40] Data-driven decision model based on local two-stage weighted ensemble learning
    Xu, Che
    Chang, Wenjun
    Liu, Weiyong
    ANNALS OF OPERATIONS RESEARCH, 2023, 325 (02) : 995 - 1028