A novel non-negative Bayesian stacking modeling method for Cancer survival prediction using high-dimensional omics data

被引:0
|
作者
Shen, Junjie [1 ]
Wang, Shuo [2 ]
Sun, Hao [1 ]
Huang, Jie [1 ]
Bai, Lu [1 ]
Wang, Xichao [1 ]
Dong, Yongfei [1 ]
Tang, Zaixiang [1 ]
机构
[1] Soochow Univ, Sch Publ Hlth, Jiangsu Key Lab Prevent & Translat Med Major Chron, Dept Biostat,Suzhou Med Coll, Suzhou 215123, Jiangsu, Peoples R China
[2] Univ Freiburg, Inst Med Biometry & Stat, Fac Med & Med Ctr, D-79085 Freiburg, Germany
基金
中国国家自然科学基金;
关键词
Survival stacking; Non-negative Bayesian model; Artificial neural network; GENERALIZED LINEAR-MODELS; REGULARIZATION PATHS; REGRESSION SHRINKAGE; HUNTINGTON-DISEASE; SELECTION; LASSO; GENES;
D O I
10.1186/s12874-024-02232-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Survival prediction using high-dimensional molecular data is a hot topic in the field of genomics and precision medicine, especially for cancer studies. Considering that carcinogenesis has a pathway-based pathogenesis, developing models using such group structures is a closer mimic of disease progression and prognosis. Many approaches can be used to integrate group information; however, most of them are single-model methods, which may account for unstable prediction.Methods We introduced a novel survival stacking method that modeled using group structure information to improve the robustness of cancer survival prediction in the context of high-dimensional omics data. With a super learner, survival stacking combines the prediction from multiple sub-models that are independently trained using the features in pre-grouped biological pathways. In addition to a non-negative linear combination of sub-models, we extended the super learner to non-negative Bayesian hierarchical generalized linear model and artificial neural network. We compared the proposed modeling strategy with the widely used survival penalized method Lasso Cox and several group penalized methods, e.g., group Lasso Cox, via simulation study and real-world data application.Results The proposed survival stacking method showed superior and robust performance in terms of discrimination compared with single-model methods in case of high-noise simulated data and real-world data. The non-negative Bayesian stacking method can identify important biological signal pathways and genes that are associated with the prognosis of cancer.Conclusions This study proposed a novel survival stacking strategy incorporating biological group information into the cancer prognosis models. Additionally, this study extended the super learner to non-negative Bayesian model and ANN, enriching the combination of sub-models. The proposed Bayesian stacking strategy exhibited favorable properties in the prediction and interpretation of complex survival data, which may aid in discovering cancer targets.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] AucPR: An AUC-based approach using penalized regression for disease prediction with high-dimensional omics data
    Yu, Wenbao
    Park, Taesung
    BMC GENOMICS, 2014, 15
  • [22] AucPR: An AUC-based approach using penalized regression for disease prediction with high-dimensional omics data
    Wenbao Yu
    Taesung Park
    BMC Genomics, 15
  • [23] Bayesian Method for Modeling Male Breast Cancer Survival Data
    Khan, Hafiz Mohammad Rafiqullah
    Saxena, Anshul
    Rana, Sagar
    Ahmed, Nasar Uddin
    ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2014, 15 (02) : 663 - 669
  • [24] An Integrated Deep Network for Cancer Survival Prediction Using Omics Data
    Hassanzadeh, Hamid Reza
    Wang, May D.
    FRONTIERS IN BIG DATA, 2021, 4
  • [25] A penalized linear mixed model with generalized method of moments for prediction analysis on high-dimensional multi-omics data
    Wang, Xiaqiong
    Wen, Yalu
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [26] Supervised Context-Aware Non-Negative Matrix Factorization to Handle High-Dimensional High-Correlated Imbalanced Biomedical Data
    Braytee, Ali
    Liu, Wei
    Kennedy, Paul J.
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4512 - 4519
  • [27] Feature selection using autoencoders with Bayesian methods to high-dimensional data
    Shu, Lei
    Huang, Kun
    Jiang, Wenhao
    Wu, Wenming
    Liu, Hongling
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7397 - 7406
  • [28] Filter and Wrapper Stacking Ensemble (FWSE): a robust approach for reliable biomarker discovery in high-dimensional omics data
    Budhraja, Sugam
    Doborjeh, Maryam
    Singh, Balkaran
    Tan, Samuel
    Doborjeh, Zohreh
    Lai, Edmund
    Merkin, Alexander
    Lee, Jimmy
    Goh, Wilson
    Kasabov, Nikola
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [29] A Novel Indexing Method for Improving Timeliness of High-Dimensional Data
    Lu, Jian
    Pham, Huong
    Zhu, Hongwei
    Chen, Cindy
    AMCIS 2014 PROCEEDINGS, 2014,
  • [30] A double regression method for graphical modeling of high-dimensional nonlinear and non-Gaussian data
    Liang, Siqi
    Liang, Faming
    STATISTICS AND ITS INTERFACE, 2024, 17 (04) : 669 - 680