SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data

被引:68
|
作者
Zafar, Hamim [1 ,2 ]
Navin, Nicholas [3 ]
Chen, Ken [2 ]
Nakhleh, Luay [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Genet, Houston, TX 77030 USA
基金
美国国家科学基金会;
关键词
INTRATUMOR HETEROGENEITY; CANCER; EVOLUTION; SELECTION; HISTORY; MODEL;
D O I
10.1101/gr.243121.118
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accumulation and selection of somatic mutations in a Darwinian framework result in intra-tumor heterogeneity (ITH) that poses significant challenges to the diagnosis and clinical therapy of cancer. Identification of the tumor cell populations (clones) and reconstruction of their evolutionary relationship can elucidate this heterogeneity. Recently developed single-cell DNA sequencing (SCS) technologies promise to resolve ITH to a single-cell level. However, technical errors in SCS data sets, including false-positives (FP) and false-negatives (FN) due to allelic dropout, and cell doublets, significantly complicate these tasks. Here, we propose a nonparametric Bayesian method that reconstructs the clonal populations as clusters of single cells, genotypes of each clone, and the evolutionary relationship between the clones. It employs a tree-structured Chinese restaurant process as the prior on the number and composition of clonal populations. The evolution of the clonal populations is modeled by a clonal phylogeny and a finite-site model of evolution to account for potential mutation recurrence and losses. We probabilistically account for FP and FN errors, and cell doublets are modeled by employing a Beta-binomial distribution. We develop a Gibbs sampling algorithm comprising partial reversible-jump and partial Metropolis-Hastings updates to explore the joint posterior space of all parameters. The performance of our method on synthetic and experimental data sets suggests that joint reconstruction of tumor clones and clonal phylogeny under a finite-site model of evolution leads to more accurate inferences. Our method is the first to enable this joint reconstruction in a fully Bayesian framework, thus providing measures of support of the inferences it makes.
引用
收藏
页码:1847 / 1859
页数:13
相关论文
共 50 条
  • [41] Preparation of a Single-Cell Suspension from Tumor Biopsy Samples for Single-Cell RNA Sequencing
    A. A. Frolova
    T. S. Gerashchenko
    M. R. Patysheva
    A. A. Fedorov
    M. M. Tsyganov
    U. A. Bokova
    O. D. Bragina
    M. A. Vostrikova
    E. Yu. Garbukov
    N. V. Cherdyntseva
    Bulletin of Experimental Biology and Medicine, 2023, 175 : 519 - 523
  • [42] Preparation of a Single-Cell Suspension from Tumor Biopsy Samples for Single-Cell RNA Sequencing
    Frolova, A. A.
    Gerashchenko, T. S.
    Patysheva, M. R.
    Fedorov, A. A.
    Tsyganov, M. M.
    Bokova, U. A.
    Bragina, O. D.
    Vostrikova, M. A.
    Garbukov, E. Yu.
    Cherdyntseva, N. V.
    BULLETIN OF EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 175 (04) : 519 - 523
  • [43] SCARLET: Single-Cell Tumor Phylogeny Inference with Copy-Number Constrained Mutation Losses
    Satas, Gryte
    Zaccaria, Simone
    Mon, Geoffrey
    Raphael, Benjamin J.
    CELL SYSTEMS, 2020, 10 (04) : 323 - +
  • [44] PhISCS: a combinatorial approach for subperfect tumor phylogeny reconstruction via integrative use of single-cell and bulk sequencing data
    Malikic, Salem
    Mehrabadi, Farid Rashidi
    Ciccolella, Simone
    Rahman, Md. Khaledur
    Ricketts, Camir
    Haghshenas, Ehsan
    Seidman, Daniel
    Hach, Faraz
    Hajirasouliha, Iman
    Sahinalp, S. Cenk
    GENOME RESEARCH, 2019, 29 (11) : 1860 - 1877
  • [45] Reference-free inference of tumor phylogenies from single-cell sequencing data (vol 16, S7, 2015)
    Subramanian, Ayshwarya
    Schwartz, Russell
    BMC GENOMICS, 2016, 17
  • [46] SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data
    Yu, Zhenhua
    Du, Fang
    Song, Lijuan
    FRONTIERS IN GENETICS, 2022, 13
  • [47] Inference of Population Structure from Time-Series Genotype Data
    Joseph, Tyler A.
    Pe'er, Itsik
    AMERICAN JOURNAL OF HUMAN GENETICS, 2019, 105 (02) : 317 - 333
  • [48] splatPop: simulating population scale single-cell RNA sequencing data
    Christina B. Azodi
    Luke Zappia
    Alicia Oshlack
    Davis J. McCarthy
    Genome Biology, 22
  • [49] splatPop: simulating population scale single-cell RNA sequencing data
    Azodi, Christina B.
    Zappia, Luke
    Oshlack, Alicia
    McCarthy, Davis J.
    GENOME BIOLOGY, 2021, 22 (01)
  • [50] BiTSC2: Bayesian inference of tumor clonal tree by joint analysis of single-cell SNV and CNA data
    Chen, Ziwei
    Gong, Fuzhou
    Wan, Lin
    Ma, Liang
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)