A pathway-based computational framework for identification of a new modal of multi-omics biomarkers and its application in esophageal cancer

被引:1
|
作者
Zhou, Qi [1 ]
Ye, Weicai [2 ,3 ]
Yu, Xiaolan [1 ,4 ]
Bao, Yun-Juan [1 ]
机构
[1] Hubei Univ, Sch Life Sci, State Key Lab Biocatalysis & Enzyme Engn, Wuhan, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Sci, Guangzhou, Peoples R China
[3] Sun Yat Sen Univ, Natl Engn Lab Big Data Anal & Applicat, Guangzhou, Peoples R China
[4] Hubei Jiangxia Lab, Wuhan, Peoples R China
关键词
Multi-omics biomarkers; Machine learning; Pathway; Esophageal carcinoma; SQUAMOUS-CELL CARCINOMA; EXPRESSION PROFILES; EARLY-DIAGNOSIS; PROGNOSIS; PACKAGE; GROWTH;
D O I
10.1016/j.cmpb.2024.108077
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: The pathway -based strategy has been recently proposed for identifying biomarkers with the advantages of higher biological interpretability and cross -data robustness than the conventional gene -based strategy. However, its utility in clinical applications has been limited due to the high computational complexity and ill-defined performance. Objective: The current study presents a machine learning -based computational framework using multi-omics data for identifying a new modal of biomarkers, called pathway -derived core biomarkers, which have the advantages of both gene -based and pathway -based biomarkers. Methods: Machine -learning methods and gene -pathway network were integrated to select the pathway -derived core biomarkers. Multiple machine -learning algorithms were used to construct and validate the diagnostic models of the biomarkers based on more than 1400 multi-omics clinical samples of esophageal squamous cell carcinoma (ESCC). Results: The results showed that the classifier models based on the new modal biomarkers achieved superior performance in the training datasets with an average AUC/accuracy of 0.98/0.95 and 0.89/0.81 for mRNAs and miRNA, respectively, higher than the currently known classifier models based on the conventional gene -based strategy and pathway -based strategy. In the testing cohorts, the AUC/accuracy increased by 6.1 %/7.3 % than the models based on the native gene -based biomarkers. The improved performance was further confirmed in independent validation cohorts. Specifically, the sensitivity/specificity increased by -3 % and the variance significantly decreased by -69 % compared with that of the native gene -based biomarkers. Importantly, the pathway -derived core biomarkers also recovered 45 % more previously reported biomarkers than the gene -based biomarkers and are more functionally relevant to the ESCC etiology (involved in 14 versus 7 pathways related with ESCC or other cancer), highlighting the cross -data robustness of this new modal of biomarkers via enhanced functional relevance. Conclusions: The results demonstrated that the new modal of biomarkers not only have improved predicting performance and robustness, but also exhibit higher functional interpretability thus leading to the potential application in cancer diagnosis.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Research on radiotherapy related genes and prognostic target identification of rectal cancer based on multi-omics
    Liu, Yi
    Yang, Yanguang
    Ni, Feng
    Tai, Guomei
    Yu, Cenming
    Jiang, Xiaohui
    Wang, Ding
    JOURNAL OF TRANSLATIONAL MEDICINE, 2023, 21 (01)
  • [32] Research on radiotherapy related genes and prognostic target identification of rectal cancer based on multi-omics
    Yi Liu
    Yanguang Yang
    Feng Ni
    Guomei Tai
    Cenming Yu
    Xiaohui Jiang
    Ding Wang
    Journal of Translational Medicine, 21
  • [33] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Long-Yi Guo
    Ai-Hua Wu
    Yong-xia Wang
    Li-ping Zhang
    Hua Chai
    Xue-Fang Liang
    BioData Mining, 13
  • [34] Cancer subtype identification by multi-omics clustering based on interpretable feature and latent subspace learning
    Shi, Tianyi
    Ye, Xiucai
    Huang, Dong
    Sakurai, Tetsuya
    METHODS, 2024, 231 : 144 - 153
  • [35] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Guo, Long-Yi
    Wu, Ai-Hua
    Wang, Yong-xia
    Zhang, Li-ping
    Chai, Hua
    Liang, Xue-Fang
    BIODATA MINING, 2020, 13 (01)
  • [36] Pathway-based integration of multi-omics data reveals lipidomics alterations validated in an Alzheimer's disease mouse model and risk loci carriers
    Garcia-Segura, Monica Emili
    Durainayagam, Brenan R.
    Liggi, Sonia
    Graca, Goncalo
    Jimenez, Beatriz
    Dehghan, Abbas
    Tzoulaki, Ioanna
    Karaman, Ibrahim
    Elliott, Paul
    Griffin, Julian L.
    JOURNAL OF NEUROCHEMISTRY, 2023, 164 (01) : 57 - 76
  • [37] A feature extraction framework for discovering pan-cancer driver genes based on multi-omics data
    Xue, Xiaomeng
    Li, Feng
    Shang, Junliang
    Dai, Lingyun
    Ge, Daohui
    Ren, Qianqian
    QUANTITATIVE BIOLOGY, 2024, 12 (02) : 173 - 181
  • [38] DRPPM-EASY: A Web-Based Framework for Integrative Analysis of Multi-Omics Cancer Datasets
    Obermayer, Alyssa
    Dong, Li
    Hu, Qianqian
    Golden, Michael
    Noble, Jerald D. D.
    Rodriguez, Paulo
    Robinson, Timothy J. J.
    Teng, Mingxiang
    Tan, Aik-Choon
    Shaw, Timothy I. I.
    BIOLOGY-BASEL, 2022, 11 (02):
  • [39] Identification of a new gene signature for prognostic evaluation in cervical cancer: based on cuproptosis-associated angiogenesis and multi-omics analysis
    Jiawen Kang
    Jingwen Jiang
    Xiaoqing Xiang
    Yong Zhang
    Jie Tang
    Lesai Li
    Cancer Cell International, 24
  • [40] Identification of a new gene signature for prognostic evaluation in cervical cancer: based on cuproptosis-associated angiogenesis and multi-omics analysis
    Kang, Jiawen
    Jiang, Jingwen
    Xiang, Xiaoqing
    Zhang, Yong
    Tang, Jie
    Li, Lesai
    CANCER CELL INTERNATIONAL, 2024, 24 (01)