SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data

被引:68
|
作者
Zafar, Hamim [1 ,2 ]
Navin, Nicholas [3 ]
Chen, Ken [2 ]
Nakhleh, Luay [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Genet, Houston, TX 77030 USA
基金
美国国家科学基金会;
关键词
INTRATUMOR HETEROGENEITY; CANCER; EVOLUTION; SELECTION; HISTORY; MODEL;
D O I
10.1101/gr.243121.118
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accumulation and selection of somatic mutations in a Darwinian framework result in intra-tumor heterogeneity (ITH) that poses significant challenges to the diagnosis and clinical therapy of cancer. Identification of the tumor cell populations (clones) and reconstruction of their evolutionary relationship can elucidate this heterogeneity. Recently developed single-cell DNA sequencing (SCS) technologies promise to resolve ITH to a single-cell level. However, technical errors in SCS data sets, including false-positives (FP) and false-negatives (FN) due to allelic dropout, and cell doublets, significantly complicate these tasks. Here, we propose a nonparametric Bayesian method that reconstructs the clonal populations as clusters of single cells, genotypes of each clone, and the evolutionary relationship between the clones. It employs a tree-structured Chinese restaurant process as the prior on the number and composition of clonal populations. The evolution of the clonal populations is modeled by a clonal phylogeny and a finite-site model of evolution to account for potential mutation recurrence and losses. We probabilistically account for FP and FN errors, and cell doublets are modeled by employing a Beta-binomial distribution. We develop a Gibbs sampling algorithm comprising partial reversible-jump and partial Metropolis-Hastings updates to explore the joint posterior space of all parameters. The performance of our method on synthetic and experimental data sets suggests that joint reconstruction of tumor clones and clonal phylogeny under a finite-site model of evolution leads to more accurate inferences. Our method is the first to enable this joint reconstruction in a fully Bayesian framework, thus providing measures of support of the inferences it makes.
引用
收藏
页码:1847 / 1859
页数:13
相关论文
共 50 条
  • [31] A Bayesian factorization method to recover single-cell RNA sequencing data
    Wen, Zi-Hang
    Langsam, Jeremy L.
    Zhang, Lu
    Shen, Wenjun
    Zhou, Xin
    CELL REPORTS METHODS, 2022, 2 (01):
  • [32] Heterogeneity Analysis of Glioblastoma Tumor Cell Population Based on Single-Cell Rna Sequencing Data Analysis
    Yang, Jason Huajue
    Cheng, Eena
    2023 13TH INTERNATIONAL CONFERENCE ON BIOSCIENCE, BIOCHEMISTRY AND BIOINFORMATICS, ICBBB 2023, 2023, : 23 - 33
  • [33] Bayesian inference of gene expression states from single-cell RNA-seq data
    Breda, Jeremie
    Zavolan, Mihaela
    van Nimwegen, Erik
    NATURE BIOTECHNOLOGY, 2021, 39 (08) : 1008 - +
  • [34] Bayesian inference of gene expression states from single-cell RNA-seq data
    Jérémie Breda
    Mihaela Zavolan
    Erik van Nimwegen
    Nature Biotechnology, 2021, 39 : 1008 - 1016
  • [35] Single-cell regulatory network inference and clustering from high-dimensional sequencing data
    Vrahatis, Aristidis G.
    Dimitrakopoulos, Georgios N.
    Tasoulis, Sotiris K.
    Georgakopoulos, Spiros V.
    Plagianakos, Vassilis P.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2782 - 2789
  • [36] SCSsim: an integrated tool for simulating single-cell genome sequencing data
    Yu, Zhenhua
    Du, Fang
    Sun, Xuehong
    Li, Ao
    BIOINFORMATICS, 2020, 36 (04) : 1281 - 1282
  • [37] Inference of clonal selection in cancer populations using single-cell sequencing data
    Skums, Pavel
    Tsyvina, Viachaslau
    Zelikovsky, Alex
    BIOINFORMATICS, 2019, 35 (14) : I398 - I407
  • [38] Inference after latent variable estimation for single-cell RNA sequencing data
    Neufeld, Anna
    Gao, Lucy L.
    Popp, Joshua
    Battle, Alexis
    Witten, Daniela
    BIOSTATISTICS, 2023, 25 (01) : 270 - 287
  • [39] COMPASS: joint copy number and mutation phylogeny reconstruction from amplicon single-cell sequencing data
    Etienne Sollier
    Jack Kuipers
    Koichi Takahashi
    Niko Beerenwinkel
    Katharina Jahn
    Nature Communications, 14
  • [40] COMPASS: joint copy number and mutation phylogeny reconstruction from amplicon single-cell sequencing data
    Sollier, Etienne
    Kuipers, Jack
    Takahashi, Koichi
    Beerenwinkel, Niko
    Jahn, Katharina
    NATURE COMMUNICATIONS, 2023, 14 (01)