Mugen-UMAP: UMAP visualization and clustering of mutated genes in single-cell DNA sequencing data

被引:1
|
作者
Li, Teng [1 ,2 ]
Zou, Yiran [2 ]
Li, Xianghan [2 ]
Wong, Thomas K. F. [2 ,3 ]
Rodrigo, Allen G. [1 ,2 ]
机构
[1] Univ Auckland, Sch Biol Sci, Auckland, New Zealand
[2] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[3] Australian Natl Univ, Sch Comp, Canberra, ACT, Australia
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
UMAP; Visualization; Clustering; Single-cell DNA sequencing; Gene mutation; CANCER; EVOLUTION; PATTERNS;
D O I
10.1186/s12859-024-05928-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe application of Uniform Manifold Approximation and Projection (UMAP) for dimensionality reduction and visualization has revolutionized the analysis of single-cell RNA expression and population genetics. However, its potential in single-cell DNA sequencing data analysis, particularly for visualizing gene mutation information, has not been fully explored.ResultsWe introduce Mugen-UMAP, a novel Python-based program that extends UMAP's utility to single-cell DNA sequencing data. This innovative tool provides a comprehensive pipeline for processing gene annotation files of single-cell somatic single-nucleotide variants and metadata to the visualization of UMAP projections for identifying clusters, along with various statistical analyses. Employing Mugen-UMAP, we analyzed whole-exome sequencing data from 365 single-cell samples across 12 non-small cell lung cancer (NSCLC) patients, revealing distinct clusters associated with histological subtypes of NSCLC. Moreover, to demonstrate the general utility of Mugen-UMAP, we applied the program to 9 additional single-cell WES datasets from various cancer types, uncovering interesting patterns of cell clusters that warrant further investigation. In summary, Mugen-UMAP provides a quick and effective visualization method to uncover cell cluster patterns based on the gene mutation information from single-cell DNA sequencing data.ConclusionsThe application of Mugen-UMAP demonstrates its capacity to provide valuable insights into the visualization and interpretation of single-cell DNA sequencing data. Mugen-UMAP can be found at https://github.com/tengchn/Mugen-UMAP
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Joint learning dimension reduction and clustering of single-cell RNA-sequencing data
    Wu, Wenming
    Ma, Xiaoke
    BIOINFORMATICS, 2020, 36 (12) : 3825 - 3832
  • [42] scGAAC: A graph attention autoencoder for clustering single-cell RNA-sequencing data
    Zhang, Lin
    Xiang, Haiping
    Wang, Feng
    Chen, Zepeng
    Shen, Mo
    Ma, Jiani
    Liu, Hui
    Zheng, Hongdang
    METHODS, 2024, 229 : 115 - 124
  • [43] Evaluation of single-cell classifiers for single-cell RNA sequencing data sets
    Zhao, Xinlei
    Wu, Shuang
    Fang, Nan
    Sun, Xiao
    Fan, Jue
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1581 - 1595
  • [44] Statistical method scDEED for detecting dubious 2D single-cell embeddings and optimizing t-SNE and UMAP hyperparameters
    Xia, Lucy
    Lee, Christy
    Li, Jingyi Jessica
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [45] Automatic Cell Type Annotation Using Marker Genes for Single-Cell RNA Sequencing Data
    Chen, Yu
    Zhang, Shuqin
    BIOMOLECULES, 2022, 12 (10)
  • [46] Stably expressed genes in single-cell RNA sequencing
    Deeke, Julie M.
    Gagnon-Bartsch, Johann A.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2020, 18 (01)
  • [47] Statistical method scDEED for detecting dubious 2D single-cell embeddings and optimizing t-SNE and UMAP hyperparameters
    Lucy Xia
    Christy Lee
    Jingyi Jessica Li
    Nature Communications, 15
  • [48] scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data
    Ranjan, Bobby
    Schmidt, Florian
    Sun, Wenjie
    Park, Jinyu
    Honardoost, Mohammad Amin
    Tan, Joanna
    Arul Rayan, Nirmala
    Prabhakar, Shyam
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [49] scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data
    Bobby Ranjan
    Florian Schmidt
    Wenjie Sun
    Jinyu Park
    Mohammad Amin Honardoost
    Joanna Tan
    Nirmala Arul Rayan
    Shyam Prabhakar
    BMC Bioinformatics, 22
  • [50] BayesHammer: Bayesian clustering for error correction in single-cell sequencing
    Sergey I Nikolenko
    Anton I Korobeynikov
    Max A Alekseyev
    BMC Genomics, 14