GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering

被引:0
|
作者
Cui, Lubin [1 ]
Guo, Guiliang [1 ]
Ng, Michael K. [2 ]
Zou, Quan [3 ]
Qiu, Yushan [4 ]
机构
[1] Henan Normal Univ, Sch Math & Stat, Xinxiang 453007, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Hong Kong 999077, Peoples R China
[3] Elect Sci & Technol Univ, Inst Fundamental & Frontier Sci, Chengdu 611731, Peoples R China
[4] Shenzhen Univ, Sch Math Sci, Shenzhen 518000, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell multi-omics data; irregular tensor decomposition; weighted threshold; joint tensor; PROTEINS;
D O I
10.1093/bib/bbae649
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell multi-omics refers to the various types of biological data at the single-cell level. These data have enabled insight and resolution to cellular phenotypes, biological processes, and developmental stages. Current advances hold high potential for breakthroughs by integrating multiple different omics layers. However, singlecell multi-omics data usually have different feature dimensions and direct or indirect relationships. How to keep the data structure of these different data and extract hidden relationships is a major challenge for omics data integration, and effective integration models are urgently needed. In this paper, we propose an irregular tensor decomposition model (GSTRPCA) based on tensor robust principal component analysis (TRPCA). We developed a weighted threshold model for the decomposition of irregular tensor data by combining low-rank and sparsity constraints, which requires that the low-dimensional embeddings of the data remain lowrank and sparse. The major advantage of the GSTRPCA algorithm is its ability to keep the original data structure and explore hidden related features among omics data. For GSTRPCA, we also designed an effective algorithm that theoretically guarantees global convergence for the tensor decomposition. The computational experiments on irregular tensor datasets demonstrate that GSTRPCA significantly outperformed the state-of-the-art methods and hence confirm the superiority of GSTRPCA in clustering single-cell multiomics data. To our knowledge, this is the first tensor decomposition method for irregular tensor data to keep the data structure and hence improve the clustering performance for single-cell multi-omics data. GSTRPCA is a Matlabbased algorithm, and the code is available from https://github.com/GGL-B/GSTRPCA.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] The technological landscape and applications of single-cell multi-omics
    Baysoy, Alev
    Bai, Zhiliang
    Satija, Rahul
    Fan, Rong
    NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2023, 24 (10) : 695 - 713
  • [42] The technological landscape and applications of single-cell multi-omics
    Alev Baysoy
    Zhiliang Bai
    Rahul Satija
    Rong Fan
    Nature Reviews Molecular Cell Biology, 2023, 24 : 695 - 713
  • [43] How single-cell multi-omics builds relationships
    Vivien Marx
    Nature Methods, 2022, 19 : 142 - 146
  • [44] Applications of single-cell multi-omics in liver cancer
    Peeters, Frederik
    Cappuyns, Sarah
    Pique-Gili, Marta
    Phillips, Gino
    Verslype, Chris
    Lambrechts, Diether
    Dekervel, Jeroen
    JHEP REPORTS, 2024, 6 (07)
  • [45] Scbean: a python']python library for single-cell multi-omics data analysis
    Zhang, Haohui
    Wang, Yuwei
    Lian, Bin
    Wang, Yiran
    Li, Xingyi
    Wang, Tao
    Shang, Xuequn
    Yang, Hui
    Aziz, Ahmad
    Hu, Jialu
    BIOINFORMATICS, 2024, 40 (02)
  • [46] Molecular mechanisms reconstruction from single-cell multi-omics data with HuMMuS
    Trimbour, Remi
    Deutschmann, Ina Maria
    Cantini, Laura
    BIOINFORMATICS, 2024, 40 (05)
  • [47] Multi-omics at single-cell resolution: comparison of experimental and data fusion approaches
    Leonavicius, Karolis
    Nainys, Juozas
    Kuciauskas, Dalius
    Mazutis, Linas
    CURRENT OPINION IN BIOTECHNOLOGY, 2019, 55 : 159 - 166
  • [48] Integration of single-cell multi-omics data by regression analysis on unpaired observations
    Qiuyue Yuan
    Zhana Duren
    Genome Biology, 23
  • [49] Multimodal deep learning approaches for single-cell multi-omics data integration
    Athaya, Tasbiraha
    Ripan, Rony Chowdhury
    Li, Xiaoman
    Hu, Haiyan
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [50] GLUER: integrative analysis of multi-omics data at single-cell resolution.
    Peng, Tao
    Pourfarhangi, Kamyar Esmaeili
    Tan, Kai
    CANCER RESEARCH, 2020, 80 (21)