A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data

被引:67
|
作者
Xiang, Ruizhi [1 ]
Wang, Wencan [2 ,3 ]
Yang, Lei [1 ]
Wang, Shiyuan [1 ]
Xu, Chaohan [1 ]
Chen, Xiaowen [1 ]
机构
[1] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin, Peoples R China
[2] Wenzhou Med Univ, Sch Optometry & Ophthalmol, Wenzhou, Peoples R China
[3] Wenzhou Med Univ, Eye Hosp, Wenzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell RNA-seq; dimension reduction; benchmark; sequences analysis; deep learning; GENE-EXPRESSION;
D O I
10.3389/fgene.2021.646936
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology performed at the level of an individual cell, which can have a potential to understand cellular heterogeneity. However, scRNA-seq data are high-dimensional, noisy, and sparse data. Dimension reduction is an important step in downstream analysis of scRNA-seq. Therefore, several dimension reduction methods have been developed. We developed a strategy to evaluate the stability, accuracy, and computing cost of 10 dimensionality reduction methods using 30 simulation datasets and five real datasets. Additionally, we investigated the sensitivity of all the methods to hyperparameter tuning and gave users appropriate suggestions. We found that t-distributed stochastic neighbor embedding (t-SNE) yielded the best overall performance with the highest accuracy and computing cost. Meanwhile, uniform manifold approximation and projection (UMAP) exhibited the highest stability, as well as moderate accuracy and the second highest computing cost. UMAP well preserves the original cohesion and separation of cell populations. In addition, it is worth noting that users need to set the hyperparameters according to the specific situation before using the dimensionality reduction methods based on non-linear model and neural network.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] AE-TPGG: a novel autoencoder-based approach for single-cell RNA-seq data imputation and dimensionality reduction
    ZHAO Shuchang
    ZHANG Li
    LIU Xuejun
    Frontiers of Computer Science, 2023, 17 (03)
  • [32] AE-TPGG: a novel autoencoder-based approach for single-cell RNA-seq data imputation and dimensionality reduction
    Zhao, Shuchang
    Zhang, Li
    Liu, Xuejun
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (03)
  • [33] A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq
    Sun, Xiaobo
    Lin, Xiaochu
    Li, Ziyi
    Wu, Hao
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [34] Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data
    Pelin Gundogdu
    Carlos Loucera
    Inmaculada Alamo-Alvarez
    Joaquin Dopazo
    Isabel Nepomuceno
    BioData Mining, 15
  • [35] Flexible comparison of batch correction methods for single-cell RNA-seq using BatchBench
    Chazarra-Gil, Ruben
    van Dongen, Stijn
    Kiselev, Vladimir Yu
    Hemberg, Martin
    NUCLEIC ACIDS RESEARCH, 2021, 49 (07)
  • [36] Computational analysis of alternative polyadenylation from standard RNA-seq and single-cell RNA-seq data
    Gao, Yipeng
    Li, Wei
    MRNA 3' END PROCESSING AND METABOLISM, 2021, 655 : 225 - 243
  • [37] Analysis of Single-Cell RNA-seq Data by Clustering Approaches
    Zhu, Xiaoshu
    Li, Hong-Dong
    Guo, Lilu
    Wu, Fang-Xiang
    Wang, Jianxin
    CURRENT BIOINFORMATICS, 2019, 14 (04) : 314 - 322
  • [38] Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data
    Gundogdu, Pelin
    Loucera, Carlos
    Alamo-Alvarez, Inmaculada
    Dopazo, Joaquin
    Nepomuceno, Isabel
    BIODATA MINING, 2022, 15 (01)
  • [39] SCnorm: robust normalization of single-cell RNA-seq data
    Bacher, Rhonda
    Chu, Li-Fang
    Leng, Ning
    Gasch, Audrey P.
    Thomson, James A.
    Stewart, Ron M.
    Newton, Michael
    Kendziorski, Christina
    NATURE METHODS, 2017, 14 (06) : 584 - +
  • [40] Quantifying the clusterness and trajectoriness of single-cell RNA-seq data
    Lim, Hong Seo
    Qiu, Peng
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (02)