Stable biomarker discovery in multi-omics data via canonical correlation analysis

被引：0

作者：

Pusa, Taneli ^{[1
]}

Rousu, Juho ^{[1
]}

机构：

[1] Aalto Univ, Dept Comp Sci, Espoo, Finland

来源：

PLOS ONE | 2024年 / 19卷 / 09期

基金：

芬兰科学院;

关键词：

INFLAMMATORY-BOWEL-DISEASE; GUT MICROBIOTA; STABILITY; SETS;

D O I：

10.1371/journal.pone.0309921

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Multi-omics analysis offers a promising avenue to a better understanding of complex biological phenomena. In particular, untangling the pathophysiology of multifactorial health conditions such as the inflammatory bowel disease (IBD) could benefit from simultaneous consideration of several omics levels. However, taking full advantage of multi-omics data requires the adoption of suitable new tools. Multi-view learning, a machine learning technique that natively joins together heterogeneous data, is a natural source for such methods. Here we present a new approach to variable selection in unsupervised multi-view learning by applying stability selection to canonical correlation analysis (CCA). We apply our method, StabilityCCA, to simulated and real multi-omics data, and demonstrate its ability to find relevant variables and improve the stability of variable selection. In a case study on an IBD microbiome data set, we link together metagenomics and metabolomics, revealing a connection between their joint structure and the disease, and identifying potential biomarkers. Our results showcase the usefulness of multi-view learning in multi-omics analysis and demonstrate StabilityCCA as a powerful tool for biomarker discovery.

引用

页数：17