Deep latent space fusion for adaptive representation of heterogeneous multi-omics data

被引:28
|
作者
Zhang, Chengming
Chen, Yabin
Zeng, Tao
Zhang, Chuanchao
Chen, Luonan
机构
[1] School of Mathematics and Statistics, Shandong University
[2] School of Life and Pharmaceutical Sciences, Dalian University of Technology
[3] Wuhan University, Wuhan
[4] The Huazhong University of Science and Technology, Wuhan
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
deep learning; latent space fusion; adaptive representation; omics data; complex disease; DATA INTEGRATION; VARIABLE MODEL; CANCER; NETWORK; BIOMARKERS; CLASSIFICATION; IDENTIFICATION; DISEASES; BREAST;
D O I
10.1093/bib/bbab600
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The integration of multi-omics data makes it possible to understand complex biological organisms at the system level. Numerous integration approaches have been developed by assuming a common underlying data space. Due to the noise and heterogeneity of biological data, the performance of these approaches is greatly affected. In this work, we propose a novel deep neural network architecture, named Deep Latent Space Fusion (DLSF), which integrates the multi-omics data by learning consistent manifold in the sample latent space for disease subtypes identification. DLSF is built upon a cycle autoencoder with a shared self-expressive layer, which can naturally and adaptively merge nonlinear features at each omics level into one unified sample manifold and produce adaptive representation of heterogeneous samples at the multi-omics level. We have assessed DLSF on various biological and biomedical datasets to validate its effectiveness. DLSF can efficiently and accurately capture the intrinsic manifold of the sample structures or sample clusters compared with other state-of-the-art methods, and DLSF yielded more significant outcomes for biological significance, survival prognosis and clinical relevance in application of cancer study in The Cancer Genome Atlas. Notably, as a deep case study, we determined a new molecular subtype of kidney renal clear cell carcinoma that may benefit immunotherapy in the viewpoint of multi-omics, and we further found potential subtype-specific biomarkers from multiple omics data, which were validated by independent datasets. In addition, we applied DLSF to identify potential therapeutic agents of different molecular subtypes of chronic lymphocytic leukemia, demonstrating the scalability of DLSF in diverse omics data types and application scenarios.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Making multi-omics data accessible to researchers
    Ana Conesa
    Stephan Beck
    Scientific Data, 6
  • [42] Towards multi-omics synthetic data integration
    Selvarajoo, Kumar
    Maurer-Stroh, Sebastian
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [43] Integrative clustering methods for multi-omics data
    Zhang, Xiaoyu
    Zhou, Zhenwei
    Xu, Hanfei
    Liu, Ching-Ti
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (03)
  • [44] Machine learning for the analysis of multi-omics data
    Sun, Yanni
    METHODS, 2021, 189 : 1 - 2
  • [45] Integrating multi-omics data for crop improvement
    Scossa, Federico
    Alseekh, Saleh
    Fernie, Alisdair R.
    JOURNAL OF PLANT PHYSIOLOGY, 2021, 257
  • [46] Multi-omics data fusion using adaptive GTO guided Non-negative matrix factorization for cancer subtype discovery
    Bansal, Bhavana
    Sahoo, Anita
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 228
  • [47] Deep learning on graphs for multi-omics classification of COPD
    Zhuang, Yonghua
    Xing, Fuyong
    Ghosh, Debashis
    Hobbs, Brian D. D.
    Hersh, Craig P. P.
    Banaei-Kashani, Farnoush
    Bowler, Russell P. P.
    Kechris, Katerina
    PLOS ONE, 2023, 18 (04):
  • [48] Autoencoder-assisted latent representation learning for survival prediction and multi-view clustering on multi-omics cancer subtyping
    Zhu, Shuwei
    Wang, Wenping
    Fang, Wei
    Cui, Meiji
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (12) : 21098 - 21119
  • [49] Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing
    Gliozzo, Jessica
    Soto-Gomez, Mauricio
    Guarino, Valentina
    Bonometti, Arturo
    Cabri, Alberto
    Cavalleri, Emanuele
    Reese, Justin
    Robinson, Peter N.
    Mesiti, Marco
    Valentini, Giorgio
    Casiraghi, Elena
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2025, 160
  • [50] Clustering of single-cell multi-omics data with a multimodal deep learning method
    Xiang Lin
    Tian Tian
    Zhi Wei
    Hakon Hakonarson
    Nature Communications, 13