Integration of pan-cancer multi-omics data for novel mixed subgroup identification using machine learning methods

被引:4
|
作者
Khadirnaikar, Seema [1 ]
Shukla, Sudhanshu [2 ]
Prasanna, S. R. M. [1 ]
机构
[1] Indian Inst Technol Dharwad, Dept Elect Engn, Dharwad, Karnataka, India
[2] Indian Inst Technol Dharwad, Dept Biosci & Bioengn, Dharwad, Karnataka, India
来源
PLOS ONE | 2023年 / 18卷 / 10期
关键词
MOLECULAR CLASSIFICATION; HETEROGENEITY;
D O I
10.1371/journal.pone.0287176
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer is a heterogeneous disease, and patients with tumors from different organs can share similar epigenetic and genetic alterations. Therefore, it is crucial to identify the novel subgroups of patients with similar molecular characteristics. It is possible to propose a better treatment strategy when the heterogeneity of the patient is accounted for during subgroup identification, irrespective of the tissue of origin. This work proposes a machine learning (ML) based pipeline for subgroup identification in pan-cancer. Here, mRNA, miRNA, DNA methylation, and protein expression features from pan-cancer samples were concatenated and non-linearly projected to a lower dimension using an ML algorithm. This data was then clustered to identify multi-omics-based novel subgroups. The clinical characterization of these ML subgroups indicated significant differences in overall survival (OS) and disease-free survival (DFS) (p-value<0.0001). The subgroups formed by the patients from different tumors shared similar molecular alterations in terms of immune microenvironment, mutation profile, and enriched pathways. Further, decision-level and feature-level fused classification models were built to identify the novel subgroups for unseen samples. Additionally, the classification models were used to obtain the class labels for the validation samples, and the molecular characteristics were verified. To summarize, this work identified novel ML subgroups using multi-omics data and showed that the patients with different tumor types could be similar molecularly. We also proposed and validated the classification models for subgroup identification. The proposed classification models can be used to identify the novel multi-omics subgroups, and the molecular characteristics of each subgroup can be used to design appropriate treatment regimen.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Machine learning multi-omics analysis reveals cancer driver dysregulation in pan-cancer cell lines compared to primary tumors
    Lauren M. Sanders
    Rahul Chandra
    Navid Zebarjadi
    Holly C. Beale
    A. Geoffrey Lyle
    Analiz Rodriguez
    Ellen Towle Kephart
    Jacob Pfeil
    Allison Cheney
    Katrina Learned
    Rob Currie
    Leonid Gitlin
    David Vengerov
    David Haussler
    Sofie R. Salama
    Olena M. Vaske
    Communications Biology, 5
  • [22] Tissue-specific identification of multi-omics features for pan-cancer drug response prediction
    Zhao, Zhi
    Wang, Shixiong
    Zucknick, Manuela
    Aittokallio, Tero
    ISCIENCE, 2022, 25 (08)
  • [23] Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification
    Zhang, Xiaoyu
    Zhang, Jingqing
    Sun, Kai
    Yang, Xian
    Dai, Chengliang
    Guo, Yike
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 765 - 769
  • [24] Multi-omics integration analysis of GPCRs in pan-cancer to uncover inter-omics relationships and potential driver genes
    Li, Shiqi
    Chen, Xin
    Chen, Jianfang
    Wu, Binjian
    Liu, Jing
    Guo, Yanzhi
    Li, Menglong
    Pu, Xuemei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 161
  • [25] A multi-omics analysis of effector and resting treg cells in pan-cancer
    Chalepaki, Anna-Maria
    Gkoris, Marios
    Chondrou, Irene
    Kourti, Malamati
    Georgakopoulos-Soares, Ilias
    Zaravinos, Apostolos
    Computers in Biology and Medicine, 2025, 189
  • [26] Machine learning for the analysis of multi-omics data
    Sun, Yanni
    METHODS, 2021, 189 : 1 - 2
  • [27] Pan-cancer evidence of prognosis, immune infiltration, and immunotherapy efficacy for annexin family using multi-omics data
    Chong Shen
    Siyang Zhang
    Zhe Zhang
    Shaobo Yang
    Yu Zhang
    Yuda Lin
    Chong Fu
    Zhi Li
    Zhouliang Wu
    Zejin Wang
    Zhuolun Li
    Jian Guo
    Peng Li
    Hailong Hu
    Functional & Integrative Genomics, 2023, 23
  • [28] Evaluation and comparison of multi-omics data integration methods for cancer subtyping
    Duan, Ran
    Gao, Lin
    Gao, Yong
    Hu, Yuxuan
    Xu, Han
    Huang, Mingfeng
    Song, Kuo
    Wang, Hongda
    Dong, Yongqiang
    Jiang, Chaoqun
    Zhang, Chenxing
    Jia, Songwei
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (08)
  • [29] A Multi-Omics Analysis of a Mitophagy-Related Signature in Pan-Cancer
    Agir, Nora
    Georgakopoulos-Soares, Ilias
    Zaravinos, Apostolos
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2025, 26 (02)
  • [30] Pan-cancer evidence of prognosis, immune infiltration, and immunotherapy efficacy for annexin family using multi-omics data
    Shen, Chong
    Zhang, Siyang
    Zhang, Zhe
    Yang, Shaobo
    Zhang, Yu
    Lin, Yuda
    Fu, Chong
    Li, Zhi
    Wu, Zhouliang
    Wang, Zejin
    Li, Zhuolun
    Guo, Jian
    Li, Peng
    Hu, Hailong
    FUNCTIONAL & INTEGRATIVE GENOMICS, 2023, 23 (03)