SSBER: removing batch effect for single-cell RNA sequencing data

被引:4
|
作者
Zhang, Yin [1 ,2 ]
Wang, Fei [1 ,2 ]
机构
[1] Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Data integration; Batch effect; The shared cell type; Supervised cell type assignment; SEQ; EXPRESSION;
D O I
10.1186/s12859-021-04165-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background With the continuous maturity of sequencing technology, different laboratories or different sequencing platforms have generated a large amount of single-cell transcriptome sequencing data for the same or different tissues. Due to batch effects and high dimensions of scRNA data, downstream analysis often faces challenges. Although a number of algorithms and tools have been proposed for removing batch effects, the current mainstream algorithms have faced the problem of data overcorrection when the cell type composition varies greatly between batches. Results In this paper, we propose a novel method named SSBER by utilizing biological prior knowledge to guide the correction, aiming to solve the problem of poor batch-effect correction when the cell type composition differs greatly between batches. Conclusions SSBER effectively solves the above problems and outperforms other algorithms when the cell type structure among batches or distribution of cell population varies considerably, or some similar cell types exist across batches.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data
    Andrews, Tallulah S.
    Kiselev, Vladimir Yu
    McCarthy, Davis
    Hemberg, Martin
    NATURE PROTOCOLS, 2021, 16 (01) : 1 - 9
  • [42] Mixed Distribution Models Based on Single-Cell RNA Sequencing Data
    Min Wu
    Junhua Xu
    Tao Ding
    Jie Gao
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 362 - 370
  • [43] Resolution of the curse of dimensionality in single-cell RNA sequencing data analysis
    Imoto, Yusuke
    Nakamura, Tomonori
    Escolar, Emerson G.
    Yoshiwaki, Michio
    Kojima, Yoji
    Yabuta, Yukihiro
    Katou, Yoshitaka
    Yamamoto, Takuya
    Hiraoka, Yasuaki
    Saitou, Mitinori
    LIFE SCIENCE ALLIANCE, 2022, 5 (12)
  • [44] Single-Cell RNA Sequencing Data Interpretation by Evolutionary Multiobjective Clustering
    Li, Xiangtao
    Wong, Ka-Chun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (05) : 1773 - 1784
  • [45] splatPop: simulating population scale single-cell RNA sequencing data
    Azodi, Christina B.
    Zappia, Luke
    Oshlack, Alicia
    McCarthy, Davis J.
    GENOME BIOLOGY, 2021, 22 (01)
  • [46] scds: computational annotation of doublets in single-cell RNA sequencing data
    Bais, Abha S.
    Kostka, Dennis
    BIOINFORMATICS, 2020, 36 (04) : 1150 - 1158
  • [47] Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data
    Tallulah S. Andrews
    Vladimir Yu Kiselev
    Davis McCarthy
    Martin Hemberg
    Nature Protocols, 2021, 16 : 1 - 9
  • [48] Differential gene expression analysis in single-cell RNA sequencing data
    Wang, Tianyu
    Nabavi, Sheida
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 202 - 207
  • [49] Accelerated dimensionality reduction of single-cell RNA sequencing data with fastglmpca
    Weine, Eric
    Carbonetto, Peter
    Stephens, Matthew
    BIOINFORMATICS, 2024, 40 (08)
  • [50] Self-assembling manifolds in single-cell RNA sequencing data
    Tarashansky, Alexander J.
    Xue, Yuan
    Li, Pengyang
    Quake, Stephen R.
    Wang, Bo
    ELIFE, 2019, 8