A novel non-negative Bayesian stacking modeling method for Cancer survival prediction using high-dimensional omics data

被引:0
|
作者
Shen, Junjie [1 ]
Wang, Shuo [2 ]
Sun, Hao [1 ]
Huang, Jie [1 ]
Bai, Lu [1 ]
Wang, Xichao [1 ]
Dong, Yongfei [1 ]
Tang, Zaixiang [1 ]
机构
[1] Soochow Univ, Sch Publ Hlth, Jiangsu Key Lab Prevent & Translat Med Major Chron, Dept Biostat,Suzhou Med Coll, Suzhou 215123, Jiangsu, Peoples R China
[2] Univ Freiburg, Inst Med Biometry & Stat, Fac Med & Med Ctr, D-79085 Freiburg, Germany
基金
中国国家自然科学基金;
关键词
Survival stacking; Non-negative Bayesian model; Artificial neural network; GENERALIZED LINEAR-MODELS; REGULARIZATION PATHS; REGRESSION SHRINKAGE; HUNTINGTON-DISEASE; SELECTION; LASSO; GENES;
D O I
10.1186/s12874-024-02232-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Survival prediction using high-dimensional molecular data is a hot topic in the field of genomics and precision medicine, especially for cancer studies. Considering that carcinogenesis has a pathway-based pathogenesis, developing models using such group structures is a closer mimic of disease progression and prognosis. Many approaches can be used to integrate group information; however, most of them are single-model methods, which may account for unstable prediction.Methods We introduced a novel survival stacking method that modeled using group structure information to improve the robustness of cancer survival prediction in the context of high-dimensional omics data. With a super learner, survival stacking combines the prediction from multiple sub-models that are independently trained using the features in pre-grouped biological pathways. In addition to a non-negative linear combination of sub-models, we extended the super learner to non-negative Bayesian hierarchical generalized linear model and artificial neural network. We compared the proposed modeling strategy with the widely used survival penalized method Lasso Cox and several group penalized methods, e.g., group Lasso Cox, via simulation study and real-world data application.Results The proposed survival stacking method showed superior and robust performance in terms of discrimination compared with single-model methods in case of high-noise simulated data and real-world data. The non-negative Bayesian stacking method can identify important biological signal pathways and genes that are associated with the prognosis of cancer.Conclusions This study proposed a novel survival stacking strategy incorporating biological group information into the cancer prognosis models. Additionally, this study extended the super learner to non-negative Bayesian model and ANN, enriching the combination of sub-models. The proposed Bayesian stacking strategy exhibited favorable properties in the prediction and interpretation of complex survival data, which may aid in discovering cancer targets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A General Framework for High-Dimensional Data Reduction Using Unsupervised Bayesian Model
    Jin, Longcun
    Wan, Wanggen
    Wu, Yongliang
    Cui, Bin
    Yu, Xiaoqing
    LIFE SYSTEM MODELING AND INTELLIGENT COMPUTING, PT II, 2010, 98 : 96 - 101
  • [42] Survival Analysis with High-Dimensional Omics Data Using a Threshold Gradient Descent Regularization-Based Neural Network Approach
    Fan, Yu
    Zhang, Sanguo
    Ma, Shuangge
    GENES, 2022, 13 (09)
  • [43] Development of Prognostic Biomarker Signatures for Survival Using High-Dimensional Data
    Simon, Richard
    BIOPHARMACEUTICAL APPLIED STATISTICS SYMPOSIUM, VOL 3: PHARMACEUTICAL APPLICATIONS, 2018, : 339 - 351
  • [44] Prediction of Islet Autoimmunity and Type 1 Diabetes Using Integrative Bayesian Modeling of Omics Data
    Frohnert, Brigitte I.
    Webb-Robertson, Bobbie-Jo
    Bramer, Lisa
    Reehl, Sarah M.
    Waugh, Kathleen C.
    Steck, Andrea
    Norris, Jill
    Rewers, Marian
    DIABETES, 2016, 65 : A414 - A415
  • [45] Manifold Discovery for High-Dimensional Data Using Deep Method
    CHEN, J. I. N. G. J. I. N.
    CHEN, S. H. U. P. I. N. G.
    DING, X. U. A. N.
    IEEE ACCESS, 2022, 10 : 65221 - 65227
  • [46] Wavelet-based Bayesian approximate kernel method for high-dimensional data analysis
    Guo, Wenxing
    Zhang, Xueying
    Jiang, Bei
    Kong, Linglong
    Hu, Yaozhong
    COMPUTATIONAL STATISTICS, 2024, 39 (04) : 2323 - 2341
  • [47] Wavelet-based Bayesian approximate kernel method for high-dimensional data analysis
    Wenxing Guo
    Xueying Zhang
    Bei Jiang
    Linglong Kong
    Yaozhong Hu
    Computational Statistics, 2024, 39 : 2323 - 2341
  • [48] Editorial: Application of Novel Statistical and Machine-Learning Methods to High-Dimensional Clinical Cancer and (Multi-)Omics Data
    Xu, Chao
    Cao, Shaolong
    Alam, Md Ashad
    FRONTIERS IN GENETICS, 2021, 12
  • [49] Supervised clustering of high-dimensional data using regularized mixture modeling
    Chang, Wennan
    Wan, Changlin
    Zang, Yong
    Zhang, Chi
    Cao, Sha
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [50] SDAE-GAN: Enable high-dimensional pathological images in liver cancer survival prediction with a policy gradient based data augmentation method
    Wu, Hejun
    Gao, Rong
    Sheng, Yeong Poh
    Chen, Bo
    Li, Shuo
    MEDICAL IMAGE ANALYSIS, 2020, 62