Label-aware distance mitigates temporal and spatial variability for clustering and visualization of single-cell gene expression data

被引:0
|
作者
Shaoheng Liang
Jinzhuang Dou
Ramiz Iqbal
Ken Chen
机构
[1] Department of Bioinformatics and Computational Biology,Department of Computer Science
[2] Rice University,Ray and Stephanie Lane Computational Biology Department, School of Computer Science
[3] Carnegie Mellon University,undefined
来源
Communications Biology | / 7卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Clustering and visualization are essential parts of single-cell gene expression data analysis. The Euclidean distance used in most distance-based methods is not optimal. The batch effect, i.e., the variability among samples gathered from different times, tissues, and patients, introduces large between-group distance and obscures the true identities of cells. To solve this problem, we introduce Label-Aware Distance (Lad), a metric using temporal/spatial locality of the batch effect to control for such factors. We validate Lad on simulated data as well as apply it to a mouse retina development dataset and a lung dataset. We also found the utility of our approach in understanding the progression of the Coronavirus Disease 2019 (COVID-19). Lad provides better cell embedding than state-of-the-art batch correction methods on longitudinal datasets. It can be used in distance-based clustering and visualization methods to combine the power of multiple samples to help make biological findings.
引用
收藏
相关论文
共 50 条
  • [31] Deciphering single-cell gene expression variability and its role in drug response
    Liu, Sizhe
    Chen, Liang
    HUMAN MOLECULAR GENETICS, 2024, 33 (23) : 2024 - 2034
  • [32] A comprehensive survey of dimensionality reduction and clustering methods for single-cell and spatial transcriptomics data
    Sun, Yidi
    Kong, Lingling
    Huang, Jiayi
    Deng, Hongyan
    Bian, Xinling
    Li, Xingfeng
    Cui, Feifei
    Dou, Lijun
    Cao, Chen
    Zou, Quan
    Zhang, Zilong
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024,
  • [33] Putative cell type discovery from single-cell gene expression data
    Miao, Zhichao
    Moreno, Pablo
    Huang, Ni
    Papatheodorou, Irene
    Brazma, Alvis
    Teichmann, Sarah A.
    NATURE METHODS, 2020, 17 (06) : 621 - +
  • [34] Putative cell type discovery from single-cell gene expression data
    Zhichao Miao
    Pablo Moreno
    Ni Huang
    Irene Papatheodorou
    Alvis Brazma
    Sarah A. Teichmann
    Nature Methods, 2020, 17 : 621 - 628
  • [35] Temporal dynamics and transcriptional control using single-cell gene expression analysis
    Tsukasa Kouno
    Michiel de Hoon
    Jessica C Mar
    Yasuhiro Tomaru
    Mitsuoki Kawano
    Piero Carninci
    Harukazu Suzuki
    Yoshihide Hayashizaki
    Jay W Shin
    Genome Biology, 14
  • [36] Temporal dynamics and transcriptional control using single-cell gene expression analysis
    Kouno, Tsukasa
    de Hoon, Michiel
    Mar, Jessica C.
    Tomaru, Yasuhiro
    Kawano, Mitsuoki
    Carninci, Piero
    Suzuki, Harukazu
    Hayashizaki, Yoshihide
    Shin, Jay W.
    GENOME BIOLOGY, 2013, 14 (10):
  • [37] Bayesian Joint Modeling of Single-Cell Expression Data and Bulk Spatial Transcriptomic Data
    Yu, Jinge
    Wu, Qiuyu
    Luo, Xiangyu
    STATISTICS IN BIOSCIENCES, 2023, 15 (03) : 719 - 733
  • [38] Recovering Spatially-Varying Cell-Specific Gene Co-expression Networks for Single-Cell Spatial Expression Data
    Yu, Jinge
    Luo, Xiangyu
    FRONTIERS IN GENETICS, 2021, 12
  • [39] Bayesian Joint Modeling of Single-Cell Expression Data and Bulk Spatial Transcriptomic Data
    Jinge Yu
    Qiuyu Wu
    Xiangyu Luo
    Statistics in Biosciences, 2023, 15 : 719 - 733
  • [40] Multi-distance based spectral embedding fusion for clustering single-cell methylation data
    Tian, Qi
    Zou, Jianxiao
    Tang, Jianxiong
    Fan, Shicai
    2021 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2021, : 238 - 245