scAce: an adaptive embedding and clustering method for single-cell gene expression data

被引:3
|
作者
He, Xinwei [1 ]
Qian, Kun [1 ]
Wang, Ziqian [1 ]
Zeng, Shirou [1 ]
Li, Hongwei [1 ,4 ]
Li, Wei Vivian [2 ,3 ]
机构
[1] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
[2] Univ Calif Riverside, Dept Stat, Riverside, CA 92521 USA
[3] Univ Calif Riverside, Dept Stat, 900 Univ Ave, Riverside, CA 92521 USA
[4] China Univ Geosci, Sch Math & Phys, 388 Lumo Rd, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btad546
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clustering analysis, the majority of them are constrained by the requirement on predetermined cluster numbers or the dependence on selected initial cluster assignment.Results In this article, we propose an adaptive embedding and clustering method named scAce, which constructs a variational autoencoder to simultaneously learn cell embeddings and cluster assignments. In the scAce method, we develop an adaptive cluster merging approach which achieves improved clustering results without the need to estimate the number of clusters in advance. In addition, scAce provides an option to perform clustering enhancement, which can update and enhance cluster assignments based on previous clustering results from other methods. Based on computational analysis of both simulated and real datasets, we demonstrate that scAce outperforms state-of-the-art clustering methods for scRNA-seq data, and achieves better clustering accuracy and robustness.Availability and implementation The scAce package is implemented in python 3.8 and is freely available from https://github.com/sldyns/scAce.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Gene selection and clustering of single-cell data based on Fisher score and genetic algorithm
    Feng, Junhong
    Zhang, Jie
    Zhu, Xiaoshu
    Wang, Jian-Hong
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (07): : 7067 - 7093
  • [42] Label-aware distance mitigates temporal and spatial variability for clustering and visualization of single-cell gene expression data
    Shaoheng Liang
    Jinzhuang Dou
    Ramiz Iqbal
    Ken Chen
    Communications Biology, 7
  • [43] Comparison of Gene Selection Methods for Clustering Single-cell RNA-seq Data
    Zhu, Xiaoshu
    Wang, Jianxin
    Li, Rongruan
    Peng, Xiaoqing
    CURRENT BIOINFORMATICS, 2023, 18 (01) : 1 - 11
  • [44] Synthesising Executable Gene Regulatory Networks from Single-Cell Gene Expression Data
    Fisher, Jasmin
    Koeksal, Ali Sinan
    Piterman, Nir
    Woodhouse, Steven
    COMPUTER AIDED VERIFICATION, PT I, 2015, 9206 : 544 - 560
  • [45] Joint learning of multiple gene networks from single-cell gene expression data
    Wu, Nuosi
    Yin, Fu
    Ou-Yang, Le
    Zhu, Zexuan
    Xie, Weixin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 2583 - 2595
  • [46] scSTEM: clustering pseudotime ordered single-cell data
    Song, Qi
    Wang, Jingtao
    Bar-Joseph, Ziv
    GENOME BIOLOGY, 2022, 23 (01)
  • [47] scSTEM: clustering pseudotime ordered single-cell data
    Qi Song
    Jingtao Wang
    Ziv Bar-Joseph
    Genome Biology, 23
  • [48] Orthogonal multimodality integration and clustering in single-cell data
    Liu, Yufang
    Chen, Yongkai
    Lu, Haoran
    Zhong, Wenxuan
    Yuan, Guo-Cheng
    Ma, Ping
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [49] Clustering single-cell multimodal omics data with jrSiCKLSNMF
    Ellis, Dorothy
    Roy, Arkaprava
    Datta, Susmita
    FRONTIERS IN GENETICS, 2023, 14
  • [50] Unsupervised embedding of single-cell Hi-C data
    Liu, Jie
    Lin, Dejun
    Yardimci, Galip Gurkan
    Noble, William Stafford
    BIOINFORMATICS, 2018, 34 (13) : 96 - 104