scAce: an adaptive embedding and clustering method for single-cell gene expression data

被引:3
|
作者
He, Xinwei [1 ]
Qian, Kun [1 ]
Wang, Ziqian [1 ]
Zeng, Shirou [1 ]
Li, Hongwei [1 ,4 ]
Li, Wei Vivian [2 ,3 ]
机构
[1] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
[2] Univ Calif Riverside, Dept Stat, Riverside, CA 92521 USA
[3] Univ Calif Riverside, Dept Stat, 900 Univ Ave, Riverside, CA 92521 USA
[4] China Univ Geosci, Sch Math & Phys, 388 Lumo Rd, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btad546
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clustering analysis, the majority of them are constrained by the requirement on predetermined cluster numbers or the dependence on selected initial cluster assignment.Results In this article, we propose an adaptive embedding and clustering method named scAce, which constructs a variational autoencoder to simultaneously learn cell embeddings and cluster assignments. In the scAce method, we develop an adaptive cluster merging approach which achieves improved clustering results without the need to estimate the number of clusters in advance. In addition, scAce provides an option to perform clustering enhancement, which can update and enhance cluster assignments based on previous clustering results from other methods. Based on computational analysis of both simulated and real datasets, we demonstrate that scAce outperforms state-of-the-art clustering methods for scRNA-seq data, and achieves better clustering accuracy and robustness.Availability and implementation The scAce package is implemented in python 3.8 and is freely available from https://github.com/sldyns/scAce.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Putative cell type discovery from single-cell gene expression data
    Zhichao Miao
    Pablo Moreno
    Ni Huang
    Irene Papatheodorou
    Alvis Brazma
    Sarah A. Teichmann
    Nature Methods, 2020, 17 : 621 - 628
  • [32] A downsampling method enables robust clustering and integration of single-cell transcriptome data
    Ren, Jun
    Zhang, Quan
    Zhou, Ying
    Hu, Yudi
    Lyu, Xuejing
    Fang, Hongkun
    Yang, Jing
    Yu, Rongshan
    Shi, Xiaodong
    Li, Qiyuan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 130
  • [33] ShinyCell: simple and sharable visualization of single-cell gene expression data
    Ouyang, John F.
    Kamaraj, Uma S.
    Cao, Elaine Y.
    Rackham, Owen J. L.
    BIOINFORMATICS, 2021, 37 (19) : 3374 - 3376
  • [34] Differential gene expression analysis in single-cell RNA sequencing data
    Wang, Tianyu
    Nabavi, Sheida
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 202 - 207
  • [35] Cancer classification of single-cell gene expression data by neural network
    Kim, Bong-Hyun
    Yu, Kijin
    Lee, Peter C. W.
    BIOINFORMATICS, 2020, 36 (05) : 1360 - 1366
  • [36] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Tian, Yu
    Zheng, Ruiqing
    Liang, Zhenlan
    Li, Suning
    Wu, Fang-Xiang
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 772 - 789
  • [37] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Yu Tian
    Ruiqing Zheng
    Zhenlan Liang
    Suning Li
    Fang-Xiang Wu
    Min Li
    TsinghuaScienceandTechnology, 2021, 26 (05) : 772 - 789
  • [38] SAFE-clustering: Single-cell Aggregated (from Ensemble) clustering for single-cell RNA-seq data
    Yang, Yuchen
    Huh, Ruth
    Culpepper, Houston W.
    Lin, Yuan
    Love, Michael I.
    Li, Yun
    BIOINFORMATICS, 2019, 35 (08) : 1269 - 1277
  • [39] Gene selection and clustering of single-cell data based on Fisher score and genetic algorithm
    Junhong Feng
    Jie Zhang
    Xiaoshu Zhu
    Jian-Hong Wang
    The Journal of Supercomputing, 2023, 79 : 7067 - 7093
  • [40] Label-aware distance mitigates temporal and spatial variability for clustering and visualization of single-cell gene expression data
    Liang, Shaoheng
    Dou, Jinzhuang
    Iqbal, Ramiz
    Chen, Ken
    COMMUNICATIONS BIOLOGY, 2024, 7 (01)