Benchmarking of cell type deconvolution pipelines for transcriptomics data

被引:221
|
作者
Cobos, Francisco Avila [1 ,2 ,3 ]
Alquicira-Hernandez, Jose [3 ,4 ]
Powell, Joseph E. [3 ,4 ]
Mestdagh, Pieter [1 ,2 ]
De Preter, Katleen [1 ,2 ]
机构
[1] Univ Ghent, Ctr Med Genet Ghent, Dept Biomol Med, Ghent, Belgium
[2] Canc Res Inst Ghent CRIG, Ghent, Belgium
[3] Garvan Inst Med Res, Garvan Weizmann Ctr Cellular Genom, Sydney, NSW, Australia
[4] Univ Queensland, Inst Mol Biosci, Brisbane, Qld, Australia
基金
欧盟地平线“2020”;
关键词
NORMALIZATION; SIGNATURES;
D O I
10.1038/s41467-020-19015-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many computational methods have been developed to infer cell type proportions from bulk transcriptomics data. However, an evaluation of the impact of data transformation, pre-processing, marker selection, cell type composition and choice of methodology on the deconvolution results is still lacking. Using five single-cell RNA-sequencing (scRNA-seq) datasets, we generate pseudo-bulk mixtures to evaluate the combined impact of these factors. Both bulk deconvolution methodologies and those that use scRNA-seq data as reference perform best when applied to data in linear scale and the choice of normalization has a dramatic impact on some, but not all methods. Overall, methods that use scRNA-seq data have comparable performance to the best performing bulk methods whereas semi-supervised approaches show higher error values. Moreover, failure to include cell types in the reference that are present in a mixture leads to substantially worse results, regardless of the previous choices. Altogether, we evaluate the combined impact of factors affecting the deconvolution task across different datasets and propose general guidelines to maximize its performance. Inferring cell type proportions from transcriptomics data is affected by data transformation, normalization, choice of method and the markers used. Here, the authors use single-cell RNAseq datasets to evaluate the impact of these factors and propose guidelines to maximise deconvolution performance.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A hitchhiker's guide to single-cell transcriptomics and data analysis pipelines
    Nayak, Richa
    Hasija, Yasha
    GENOMICS, 2021, 113 (02) : 606 - 619
  • [22] Benchmarking variational AutoEncoders on cancer transcriptomics data
    Eltager, Mostafa
    Abdelaal, Tamim
    Charrout, Mohammed
    Mahfouz, Ahmed
    Reinders, Marcel J. T.
    Makrodimitris, Stavros
    PLOS ONE, 2023, 18 (10):
  • [23] SpaDecon: cell-type deconvolution in spatial transcriptomics with semi-supervised learning
    Coleman, Kyle
    Hu, Jian
    Schroeder, Amelia
    Lee, Edward B.
    Li, Mingyao
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [24] SpaDecon: cell-type deconvolution in spatial transcriptomics with semi-supervised learning
    Kyle Coleman
    Jian Hu
    Amelia Schroeder
    Edward B. Lee
    Mingyao Li
    Communications Biology, 6
  • [25] Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data
    Brendan F. Miller
    Feiyang Huang
    Lyla Atta
    Arpan Sahoo
    Jean Fan
    Nature Communications, 13
  • [26] Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data
    Miller, Brendan F.
    Huang, Feiyang
    Atta, Lyla
    Sahoo, Arpan
    Fan, Jean
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [27] Heterogeneous pseudobulk simulation enables realistic benchmarking of cell-type deconvolution methods
    Hu, Mengying
    Chikina, Maria
    GENOME BIOLOGY, 2024, 25 (01):
  • [28] Flexible analysis of spatial transcriptomics data (FAST): a deconvolution approach
    Zhang, Meng
    Parker, Joel
    An, Lingling
    Liu, Yiwen
    Sun, Xiaoxiao
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [29] Benchmarking spatial clustering methods with spatially resolved transcriptomics data
    Zhiyuan Yuan
    Fangyuan Zhao
    Senlin Lin
    Yu Zhao
    Jianhua Yao
    Yan Cui
    Xiao-Yong Zhang
    Yi Zhao
    Nature Methods, 2024, 21 : 712 - 722
  • [30] Benchmarking spatial clustering methods with spatially resolved transcriptomics data
    Yuan, Zhiyuan
    Zhao, Fangyuan
    Lin, Senlin
    Zhao, Yu
    Yao, Jianhua
    Cui, Yan
    Zhang, Xiao-Yong
    Zhao, Yi
    NATURE METHODS, 2024, 21 (04) : 712 - 722