Mugen-UMAP: UMAP visualization and clustering of mutated genes in single-cell DNA sequencing data

被引:1
|
作者
Li, Teng [1 ,2 ]
Zou, Yiran [2 ]
Li, Xianghan [2 ]
Wong, Thomas K. F. [2 ,3 ]
Rodrigo, Allen G. [1 ,2 ]
机构
[1] Univ Auckland, Sch Biol Sci, Auckland, New Zealand
[2] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[3] Australian Natl Univ, Sch Comp, Canberra, ACT, Australia
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
UMAP; Visualization; Clustering; Single-cell DNA sequencing; Gene mutation; CANCER; EVOLUTION; PATTERNS;
D O I
10.1186/s12859-024-05928-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe application of Uniform Manifold Approximation and Projection (UMAP) for dimensionality reduction and visualization has revolutionized the analysis of single-cell RNA expression and population genetics. However, its potential in single-cell DNA sequencing data analysis, particularly for visualizing gene mutation information, has not been fully explored.ResultsWe introduce Mugen-UMAP, a novel Python-based program that extends UMAP's utility to single-cell DNA sequencing data. This innovative tool provides a comprehensive pipeline for processing gene annotation files of single-cell somatic single-nucleotide variants and metadata to the visualization of UMAP projections for identifying clusters, along with various statistical analyses. Employing Mugen-UMAP, we analyzed whole-exome sequencing data from 365 single-cell samples across 12 non-small cell lung cancer (NSCLC) patients, revealing distinct clusters associated with histological subtypes of NSCLC. Moreover, to demonstrate the general utility of Mugen-UMAP, we applied the program to 9 additional single-cell WES datasets from various cancer types, uncovering interesting patterns of cell clusters that warrant further investigation. In summary, Mugen-UMAP provides a quick and effective visualization method to uncover cell cluster patterns based on the gene mutation information from single-cell DNA sequencing data.ConclusionsThe application of Mugen-UMAP demonstrates its capacity to provide valuable insights into the visualization and interpretation of single-cell DNA sequencing data. Mugen-UMAP can be found at https://github.com/tengchn/Mugen-UMAP
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Nonlinear dimensionality reduction based visualization of single-cell RNA sequencing data
    Mohamed Yousuff
    Rajasekhara Babu
    Anand Rathinam
    Journal of Analytical Science and Technology, 15
  • [22] SIEVE: identifying robust single cell variable genes for single-cell RNA sequencing data
    Zhang, Yinan
    Xie, Xiaowei
    Wu, Peng
    Zhu, Ping
    BLOOD SCIENCE, 2021, 3 (02): : 35 - 39
  • [23] Consensus Nature Inspired Clustering of Single-Cell RNA-Sequencing Data
    Abou El-Naga, Amany H.
    Sayed, Sabah
    Salah, Akram
    Mohsen, Heba
    IEEE ACCESS, 2022, 10 : 98079 - 98094
  • [24] SUSCC: Secondary Construction of Feature Space based on UMAP for Rapid and Accurate Clustering Large-scale Single Cell RNA-seq Data
    Wang, Hai-Yun
    Zhao, Jian-ping
    Zheng, Chun-Hou
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (01) : 83 - 90
  • [25] SUSCC: Secondary Construction of Feature Space based on UMAP for Rapid and Accurate Clustering Large-scale Single Cell RNA-seq Data
    Hai-Yun Wang
    Jian-ping Zhao
    Chun-Hou Zheng
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 83 - 90
  • [26] Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data
    P. E. de Souza, Camila
    Andronescu, Mirela
    Masud, Tehmina
    Kabeer, Farhia
    Biele, Justina
    Laks, Emma
    Lai, Daniel
    Ye, Patricia
    Brimhall, Jazmine
    Wang, Beixi
    Su, Edmund
    Hui, Tony
    Cao, Qi
    Wong, Marcus
    Moksa, Michelle
    Moore, Richard A.
    Hirst, Martin
    Aparicio, Samuel
    Shah, Sohrab P.
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (09)
  • [27] Semisoft clustering of single-cell data
    Zhu, Lingxue
    Lei, Jing
    Klei, Lambertus
    Devlin, Bernie
    Roeder, Kathryn
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (02) : 466 - 471
  • [28] SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
    Xikang Feng
    Lingxi Chen
    BMC Genomics, 23
  • [29] Somatic variant calling from single-cell DNA sequencing data
    Valecha, Monica
    Posada, David
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2978 - 2985
  • [30] SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
    Feng, Xikang
    Chen, Lingxi
    BMC GENOMICS, 2022, 23 (SUPPL 4)