GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering

被引:0
|
作者
Cui, Lubin [1 ]
Guo, Guiliang [1 ]
Ng, Michael K. [2 ]
Zou, Quan [3 ]
Qiu, Yushan [4 ]
机构
[1] Henan Normal Univ, Sch Math & Stat, Xinxiang 453007, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Hong Kong 999077, Peoples R China
[3] Elect Sci & Technol Univ, Inst Fundamental & Frontier Sci, Chengdu 611731, Peoples R China
[4] Shenzhen Univ, Sch Math Sci, Shenzhen 518000, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell multi-omics data; irregular tensor decomposition; weighted threshold; joint tensor; PROTEINS;
D O I
10.1093/bib/bbae649
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell multi-omics refers to the various types of biological data at the single-cell level. These data have enabled insight and resolution to cellular phenotypes, biological processes, and developmental stages. Current advances hold high potential for breakthroughs by integrating multiple different omics layers. However, singlecell multi-omics data usually have different feature dimensions and direct or indirect relationships. How to keep the data structure of these different data and extract hidden relationships is a major challenge for omics data integration, and effective integration models are urgently needed. In this paper, we propose an irregular tensor decomposition model (GSTRPCA) based on tensor robust principal component analysis (TRPCA). We developed a weighted threshold model for the decomposition of irregular tensor data by combining low-rank and sparsity constraints, which requires that the low-dimensional embeddings of the data remain lowrank and sparse. The major advantage of the GSTRPCA algorithm is its ability to keep the original data structure and explore hidden related features among omics data. For GSTRPCA, we also designed an effective algorithm that theoretically guarantees global convergence for the tensor decomposition. The computational experiments on irregular tensor datasets demonstrate that GSTRPCA significantly outperformed the state-of-the-art methods and hence confirm the superiority of GSTRPCA in clustering single-cell multiomics data. To our knowledge, this is the first tensor decomposition method for irregular tensor data to keep the data structure and hence improve the clustering performance for single-cell multi-omics data. GSTRPCA is a Matlabbased algorithm, and the code is available from https://github.com/GGL-B/GSTRPCA.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] How single-cell multi-omics builds relationships
    Marx, Vivien
    NATURE METHODS, 2022, 19 (02) : 142 - 146
  • [32] Single-cell sequencing to multi-omics: technologies and applications
    Wu, Xiangyu
    Yang, Xin
    Dai, Yunhan
    Zhao, Zihan
    Zhu, Junmeng
    Guo, Hongqian
    Yang, Rong
    BIOMARKER RESEARCH, 2024, 12 (01)
  • [33] EpiDamID, a new single-cell multi-omics tool
    Dorothy Clyde
    Nature Reviews Genetics, 2022, 23 : 323 - 323
  • [34] Computational strategies for single-cell multi-omics integration
    Adossa, Nigatu
    Khan, Sofia
    Rytkonen, Kalle T.
    Elo, Laura L.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 2588 - 2596
  • [35] Methods and applications for single-cell and spatial multi-omics
    Vandereyken, Katy
    Sifrim, Alejandro
    Thienpont, Bernard
    Voet, Thierry
    NATURE REVIEWS GENETICS, 2023, 24 (08) : 494 - 515
  • [36] Arsenal of single-cell multi-omics methods expanded
    Lin Tang
    Nature Methods, 2021, 18 : 858 - 858
  • [37] Methods and applications for single-cell and spatial multi-omics
    Katy Vandereyken
    Alejandro Sifrim
    Bernard Thienpont
    Thierry Voet
    Nature Reviews Genetics, 2023, 24 : 494 - 515
  • [38] Identifying Cancer Biomarkers with Single-Cell Multi-Omics
    Genetic Engineering and Biotechnology News, 2021, 41 (07): : 48 - 53
  • [39] EpiDamID, a new single-cell multi-omics tool
    不详
    NATURE REVIEWS GENETICS, 2022, 23 (6) : 323 - 323
  • [40] Single-cell multi-omics in the medicinal plant Catharanthusroseus
    Li, Chenxin
    Wood, Joshua C.
    Vu, Anh Hai
    Hamilton, John P.
    Lopez, Carlos Eduardo Rodriguez
    Payne, Richard M. E.
    Guerrero, Delia Ayled Serna
    Gase, Klaus
    Yamamoto, Kotaro
    Vaillancourt, Brieanne
    Caputi, Lorenzo
    O'Connor, Sarah E.
    Buell, C. Robin
    NATURE CHEMICAL BIOLOGY, 2023, 19 (08) : 1031 - +