Mugen-UMAP: UMAP visualization and clustering of mutated genes in single-cell DNA sequencing data

被引:1
|
作者
Li, Teng [1 ,2 ]
Zou, Yiran [2 ]
Li, Xianghan [2 ]
Wong, Thomas K. F. [2 ,3 ]
Rodrigo, Allen G. [1 ,2 ]
机构
[1] Univ Auckland, Sch Biol Sci, Auckland, New Zealand
[2] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[3] Australian Natl Univ, Sch Comp, Canberra, ACT, Australia
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
UMAP; Visualization; Clustering; Single-cell DNA sequencing; Gene mutation; CANCER; EVOLUTION; PATTERNS;
D O I
10.1186/s12859-024-05928-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe application of Uniform Manifold Approximation and Projection (UMAP) for dimensionality reduction and visualization has revolutionized the analysis of single-cell RNA expression and population genetics. However, its potential in single-cell DNA sequencing data analysis, particularly for visualizing gene mutation information, has not been fully explored.ResultsWe introduce Mugen-UMAP, a novel Python-based program that extends UMAP's utility to single-cell DNA sequencing data. This innovative tool provides a comprehensive pipeline for processing gene annotation files of single-cell somatic single-nucleotide variants and metadata to the visualization of UMAP projections for identifying clusters, along with various statistical analyses. Employing Mugen-UMAP, we analyzed whole-exome sequencing data from 365 single-cell samples across 12 non-small cell lung cancer (NSCLC) patients, revealing distinct clusters associated with histological subtypes of NSCLC. Moreover, to demonstrate the general utility of Mugen-UMAP, we applied the program to 9 additional single-cell WES datasets from various cancer types, uncovering interesting patterns of cell clusters that warrant further investigation. In summary, Mugen-UMAP provides a quick and effective visualization method to uncover cell cluster patterns based on the gene mutation information from single-cell DNA sequencing data.ConclusionsThe application of Mugen-UMAP demonstrates its capacity to provide valuable insights into the visualization and interpretation of single-cell DNA sequencing data. Mugen-UMAP can be found at https://github.com/tengchn/Mugen-UMAP
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Tian, Yu
    Zheng, Ruiqing
    Liang, Zhenlan
    Li, Suning
    Wu, Fang-Xiang
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 772 - 789
  • [32] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Yu Tian
    Ruiqing Zheng
    Zhenlan Liang
    Suning Li
    Fang-Xiang Wu
    Min Li
    TsinghuaScienceandTechnology, 2021, 26 (05) : 772 - 789
  • [33] Clustering and visualization of single-cell RNA-seq data using path metrics
    Manousidaki, Andriana
    Little, Anna
    Xie, Yuying
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (05)
  • [34] Topological Methods for Visualization and Analysis of High Dimensional Single-Cell RNA Sequencing Data
    Wang, Tongxin
    Johnson, Travis
    Zhang, Jie
    Huang, Kun
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019, 2019, : 350 - 361
  • [35] Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study
    Feng, Chao
    Liu, Shufen
    Zhang, Hao
    Guan, Renchu
    Li, Dan
    Zhou, Fengfeng
    Liang, Yanchun
    Feng, Xiaoyue
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (06)
  • [36] Evaluating the performance of dropout imputation and clustering methods for single-cell RNA sequencing data
    Xu, Junlin
    Cui, Lingyu
    Zhuang, Jujuan
    Meng, Yajie
    Bing, Pingping
    He, Binsheng
    Tian, Geng
    Pui, Choi Kwok
    Wu, Taoyang
    Wang, Bing
    Yang, Jialiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [37] Clustering single-cell rna-sequencing data based on matching clusters structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    Tehnicki Vjesnik, 2020, 27 (01): : 89 - 95
  • [38] Machine learning and statistical methods for clustering single-cell RNA-sequencing data
    Petegrosso, Raphael
    Li, Zhuliu
    Kuang, Rui
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1209 - 1223
  • [39] A HIERARCHICAL BAYESIAN MODEL FOR SINGLE-CELL CLUSTERING USING RNA-SEQUENCING DATA
    Liu, Yiyi
    Warren, Joshua L.
    Zhao, Hongyu
    ANNALS OF APPLIED STATISTICS, 2019, 13 (03): : 1733 - 1752
  • [40] Clustering Single-cell RNA-sequencing Data based on Matching Clusters Structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (01): : 89 - 95