Automatically Detecting Anchor Cells and Clustering for scRNA-Seq Data Using scTSNN

被引:0
|
作者
Liu, Qiaoming [1 ,2 ]
Zhang, Dandan [3 ]
Wang, Dong [4 ]
Wang, Guohua [5 ]
Wang, Yadong [1 ,2 ]
机构
[1] Harbin Inst Technol, Sch Med & Hlth, Zhengzhou 150001, Peoples R China
[2] Harbin Inst Technol, Zhengzhou Res Inst, Zhengzhou 150001, Peoples R China
[3] Harbin Med Univ, Dept Obstet & Gynecol, Affiliated Hosp 1, Harbin 150001, Peoples R China
[4] North China Elect Power Univ, Dept Comp, Baoding 071003, Peoples R China
[5] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Tensors; Mathematical models; Synthetic data; Clustering algorithms; Noise; Accuracy; Spatial databases; Clustering; single-cell RNA-seq; anchor graph; tensor graph; cell heterogeneity; COMPUTER VISION; DENSITY PEAKS;
D O I
10.1109/JBHI.2024.3460761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advancing in single-cell RNA sequencing techniques enhances the resolution of cell heterogeneity study. Density-based unsupervised clustering has the potential to detect the representative anchor points and the number of clusters automatically. Meanwhile, discovering the true cell type of scRNA-seq data in the unsupervised scenario is still challenging. To this end, we proposed a tensor shared nearest neighbor anchor clustering for scRNA-seq data, named scTSNN, which first makes use of the tensor affinity learning module to mine the local-global balanced topological structures among cells, next designs density-based shared nearest neighbor measurement method to automatically detect anchor cells, finally partitions the non-anchor cells to obtain the clustering results. Validated on synthetic datasets and scRNA-seq datasets, scTSNN not only exactly detects the complicated structures but also has better performance in accuracy and robustness compared with the state-of-the-art methods. Moreover, case studies on mammalian cells and cervical cancer tumor cells demonstrate the selected anchor cells of scTSNN benefit the cell pseudotime inference and rare cell identification, which show good application and research value of scTSNN.
引用
收藏
页码:7015 / 7027
页数:13
相关论文
共 50 条
  • [1] Recursive Clustering of Cellular Diversity in scRNA-Seq Data
    Squires, Michael
    Qiu, Peng
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2025,
  • [2] Obstacles to detecting isoforms using full-length scRNA-seq data
    Westoby, Jennifer
    Artemov, Pavel
    Hemberg, Martin
    Ferguson-Smith, Anne
    GENOME BIOLOGY, 2020, 21 (01)
  • [3] Obstacles to detecting isoforms using full-length scRNA-seq data
    Jennifer Westoby
    Pavel Artemov
    Martin Hemberg
    Anne Ferguson-Smith
    Genome Biology, 21
  • [4] Contrastive self-supervised clustering of scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [5] scDSSC: Deep Sparse Subspace Clustering for scRNA-seq Data
    Wang, HaiYun
    Zhao, JianPing
    Zheng, ChunHou
    Su, YanSen
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [6] GNN-based embedding for clustering scRNA-seq data
    Ciortan, Madalina
    Defrance, Matthieu
    BIOINFORMATICS, 2022, 38 (04) : 1037 - 1044
  • [7] Contrastive self-supervised clustering of scRNA-seq data
    Madalina Ciortan
    Matthieu Defrance
    BMC Bioinformatics, 22
  • [8] Deep embedded clustering with multiple objectives on scRNA-seq data
    Li, Xiangtao
    Zhang, Shixiong
    Wong, Ka-Chun
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [9] scTPC: a novel semisupervised deep clustering model for scRNA-seq data
    Qiu, Yushan
    Yang, Lingfei
    Jiang, Hao
    Zou, Quan
    BIOINFORMATICS, 2024, 40 (05)
  • [10] Predicting lung aging using scRNA-Seq data
    Song, Qi
    Singh, Alex
    Mcdonough, John E.
    Adams, Taylor S.
    Vos, Robin
    De Man, Ruben
    Myers, Greg
    Ceulemans, Laurens J.
    Vanaudenaerde, Bart M.
    Wuyts, Wim A.
    Yan, Xiting
    Schuppe, Jonas
    Hagood, James S.
    Kaminski, Naftali
    Bar-Joseph, Ziv
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (12)