Label-aware distance mitigates temporal and spatial variability for clustering and visualization of single-cell gene expression data

被引:0
|
作者
Shaoheng Liang
Jinzhuang Dou
Ramiz Iqbal
Ken Chen
机构
[1] Department of Bioinformatics and Computational Biology,Department of Computer Science
[2] Rice University,Ray and Stephanie Lane Computational Biology Department, School of Computer Science
[3] Carnegie Mellon University,undefined
来源
Communications Biology | / 7卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Clustering and visualization are essential parts of single-cell gene expression data analysis. The Euclidean distance used in most distance-based methods is not optimal. The batch effect, i.e., the variability among samples gathered from different times, tissues, and patients, introduces large between-group distance and obscures the true identities of cells. To solve this problem, we introduce Label-Aware Distance (Lad), a metric using temporal/spatial locality of the batch effect to control for such factors. We validate Lad on simulated data as well as apply it to a mouse retina development dataset and a lung dataset. We also found the utility of our approach in understanding the progression of the Coronavirus Disease 2019 (COVID-19). Lad provides better cell embedding than state-of-the-art batch correction methods on longitudinal datasets. It can be used in distance-based clustering and visualization methods to combine the power of multiple samples to help make biological findings.
引用
收藏
相关论文
共 50 条
  • [1] Label-aware distance mitigates temporal and spatial variability for clustering and visualization of single-cell gene expression data
    Liang, Shaoheng
    Dou, Jinzhuang
    Iqbal, Ramiz
    Chen, Ken
    COMMUNICATIONS BIOLOGY, 2024, 7 (01)
  • [2] Spatial reconstruction of single-cell gene expression data
    Rahul Satija
    Jeffrey A Farrell
    David Gennert
    Alexander F Schier
    Aviv Regev
    Nature Biotechnology, 2015, 33 : 495 - 502
  • [3] Spatial reconstruction of single-cell gene expression data
    Satija, Rahul
    Farrell, Jeffrey A.
    Gennert, David
    Schier, Alexander F.
    Regev, Aviv
    NATURE BIOTECHNOLOGY, 2015, 33 (05) : 495 - U206
  • [4] Differential variability analysis of single-cell gene expression data
    Liu, Jiayi
    Kreimer, Anat
    Li, Wei Vivian
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [5] Identification of spatial expression trends in single-cell gene expression data
    Daniel Edsgärd
    Per Johnsson
    Rickard Sandberg
    Nature Methods, 2018, 15 : 339 - 342
  • [6] Identification of spatial expression trends in single-cell gene expression data
    Edsgard, Daniel
    Johnsson, Per
    Sandberg, Rickard
    NATURE METHODS, 2018, 15 (05) : 339 - +
  • [7] ShinyCell: simple and sharable visualization of single-cell gene expression data
    Ouyang, John F.
    Kamaraj, Uma S.
    Cao, Elaine Y.
    Rackham, Owen J. L.
    BIOINFORMATICS, 2021, 37 (19) : 3374 - 3376
  • [8] Segmentation aware probabilistic phenotyping of single-cell spatial protein expression data
    Lee, Yuju
    Chen, Edward L. Y.
    Chan, Darren C. H.
    Dinesh, Anuroopa
    Afiuni-Zadeh, Somaieh
    Klamann, Conor
    Selega, Alina
    Mrkonjic, Miralem
    Jackson, Hartland W.
    Campbell, Kieran R.
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [9] scAce: an adaptive embedding and clustering method for single-cell gene expression data
    He, Xinwei
    Qian, Kun
    Wang, Ziqian
    Zeng, Shirou
    Li, Hongwei
    Li, Wei Vivian
    BIOINFORMATICS, 2023, 39 (09)
  • [10] GENE EXPRESSION Single-cell variability guided by microRNAs
    Garg, Salil
    Sharp, Phillip A.
    SCIENCE, 2016, 352 (6292) : 1390 - 1391