Deep latent space fusion for adaptive representation of heterogeneous multi-omics data

被引:28
|
作者
Zhang, Chengming
Chen, Yabin
Zeng, Tao
Zhang, Chuanchao
Chen, Luonan
机构
[1] School of Mathematics and Statistics, Shandong University
[2] School of Life and Pharmaceutical Sciences, Dalian University of Technology
[3] Wuhan University, Wuhan
[4] The Huazhong University of Science and Technology, Wuhan
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
deep learning; latent space fusion; adaptive representation; omics data; complex disease; DATA INTEGRATION; VARIABLE MODEL; CANCER; NETWORK; BIOMARKERS; CLASSIFICATION; IDENTIFICATION; DISEASES; BREAST;
D O I
10.1093/bib/bbab600
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The integration of multi-omics data makes it possible to understand complex biological organisms at the system level. Numerous integration approaches have been developed by assuming a common underlying data space. Due to the noise and heterogeneity of biological data, the performance of these approaches is greatly affected. In this work, we propose a novel deep neural network architecture, named Deep Latent Space Fusion (DLSF), which integrates the multi-omics data by learning consistent manifold in the sample latent space for disease subtypes identification. DLSF is built upon a cycle autoencoder with a shared self-expressive layer, which can naturally and adaptively merge nonlinear features at each omics level into one unified sample manifold and produce adaptive representation of heterogeneous samples at the multi-omics level. We have assessed DLSF on various biological and biomedical datasets to validate its effectiveness. DLSF can efficiently and accurately capture the intrinsic manifold of the sample structures or sample clusters compared with other state-of-the-art methods, and DLSF yielded more significant outcomes for biological significance, survival prognosis and clinical relevance in application of cancer study in The Cancer Genome Atlas. Notably, as a deep case study, we determined a new molecular subtype of kidney renal clear cell carcinoma that may benefit immunotherapy in the viewpoint of multi-omics, and we further found potential subtype-specific biomarkers from multiple omics data, which were validated by independent datasets. In addition, we applied DLSF to identify potential therapeutic agents of different molecular subtypes of chronic lymphocytic leukemia, demonstrating the scalability of DLSF in diverse omics data types and application scenarios.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] The Omics Dashboard for Interactive Exploration of Metabolomics and Multi-Omics Data
    Paley, Suzanne
    Karp, Peter D.
    METABOLITES, 2024, 14 (01)
  • [32] CVQVAE: A representation learning based method for multi-omics single cell data integration
    Liu, Tianyu
    Greenberg, Grant
    Shomorony, Ilan
    MACHINE LEARNING IN COMPUTATIONAL BIOLOGY, VOL 200, 2022, 200
  • [33] Multi-omics Visualization Platform: An extensible Galaxy plug-in for multi-omics data visualization and exploration
    McGowan, Thomas
    Johnson, James E.
    Kumar, Praveen
    Sajulga, Ray
    Mehta, Subina
    Jagtap, Pratik D.
    Griffin, Timothy J.
    GIGASCIENCE, 2020, 9 (04):
  • [34] Multi-omics at single-cell resolution: comparison of experimental and data fusion approaches
    Leonavicius, Karolis
    Nainys, Juozas
    Kuciauskas, Dalius
    Mazutis, Linas
    CURRENT OPINION IN BIOTECHNOLOGY, 2019, 55 : 159 - 166
  • [35] A Similarity Regression Fusion Model for Integrating Multi-Omics Data to Identify Cancer Subtypes
    Guo, Yang
    Zheng, Jianning
    Shang, Xuequn
    Li, Zhanhuai
    GENES, 2018, 9 (07):
  • [36] Learning an Adaptive Self-expressive Fusion Model for Multi-omics Cancer Subtype Prediction
    Cai, Yueyi
    Zhou, Nan
    Zhao, Junran
    Wang, Shunfang
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT I, ISBRA 2024, 2024, 14954 : 274 - 285
  • [37] Progressively Helical Multi-Omics Data Fusion GCN and Its Application in Lung Adenocarcinoma
    Zhu, Junxuan
    Zhang, Jinhan
    Wang, Liyan
    Huang, Hao
    Zhang, Zhibo
    Song, Kai
    Zhang, Xiaofei
    IEEE ACCESS, 2023, 11 : 73568 - 73582
  • [38] Attention-Fusion Model for Multi-omics (AMMO) Data Integration in Lung Adenocarcinoma
    Li, Wentao
    Muneer, Amgad
    Waqas, Muhammad
    Zhou, Xiaobo
    Wu, Jia
    COMPUTATIONAL MATHEMATICS MODELING IN CANCER ANALYSIS, CMMCA 2024, 2025, 15181 : 52 - 60
  • [39] Making multi-omics data accessible to researchers
    Conesa, Ana
    Beck, Stephan
    SCIENTIFIC DATA, 2019, 6 (1)
  • [40] A cloud solution for multi-omics data integration
    Tordini, Fabio
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 559 - 566