Inferring single-cell copy number profiles through cross-cell segmentation of read counts

被引:2
|
作者
Liu, Furui [1 ]
Shi, Fangyuan [1 ,2 ]
Yu, Zhenhua [1 ,2 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Peoples R China
[2] Ningxia Univ, Collaborat Innovat Ctr Ningxia Big Data & Artifici, Cofounded Ningxia Municipal & Minist Educ, Yinchuan 750021, Peoples R China
关键词
Single-cell DNA sequencing; Copy number alteration; Autoencoder; Mixture model; TUMOR EVOLUTION;
D O I
10.1186/s12864-023-09901-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundCopy number alteration (CNA) is one of the major genomic variations that frequently occur in cancers, and accurate inference of CNAs is essential for unmasking intra-tumor heterogeneity (ITH) and tumor evolutionary history. Single-cell DNA sequencing (scDNA-seq) makes it convenient to profile CNAs at single-cell resolution, and thus aids in better characterization of ITH. Despite that several computational methods have been proposed to decipher single-cell CNAs, their performance is limited in either breakpoint detection or copy number estimation due to the high dimensionality and noisy nature of read counts data.ResultsBy treating breakpoint detection as a process to segment high dimensional read count sequence, we develop a novel method called DeepCNA for cross-cell segmentation of read count sequence and per-cell inference of CNAs. To cope with the difficulty of segmentation, an autoencoder (AE) network is employed in DeepCNA to project the original data into a low-dimensional space, where the breakpoints can be efficiently detected along each latent dimension and further merged to obtain the final breakpoints. Unlike the existing methods that manually calculate certain statistics of read counts to find breakpoints, the AE model makes it convenient to automatically learn the representations. Based on the inferred breakpoints, we employ a mixture model to predict copy numbers of segments for each cell, and leverage expectation-maximization algorithm to efficiently estimate cell ploidy by exploring the most abundant copy number state. Benchmarking results on simulated and real data demonstrate our method is able to accurately infer breakpoints as well as absolute copy numbers and surpasses the existing methods under different test conditions. DeepCNA can be accessed at: https://github.com/zhyu-lab/deepcna.ConclusionsProfiling single-cell CNAs based on deep learning is becoming a new paradigm of scDNA-seq data analysis, and DeepCNA is an enhancement to the current arsenal of computational methods for investigating cancer genomics.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Cracking the pattern of tumor evolution based on single-cell copy number alterations
    Wang, Ying
    Zhang, Min
    Shi, Jian
    Zhu, Yue
    Wang, Xin
    Zhang, Shaojun
    Wang, Fang
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [32] MEDALT: single-cell copy number lineage tracing enabling gene discovery
    Wang, Fang
    Wang, Qihan
    Mohanty, Vakul
    Liang, Shaoheng
    Dou, Jinzhuang
    Han, Jincheng
    Minussi, Darlan Conterno
    Gao, Ruli
    Ding, Li
    Navin, Nicholas
    Chen, Ken
    GENOME BIOLOGY, 2021, 22 (01)
  • [33] MEDALT: single-cell copy number lineage tracing enabling gene discovery
    Fang Wang
    Qihan Wang
    Vakul Mohanty
    Shaoheng Liang
    Jinzhuang Dou
    Jincheng Han
    Darlan Conterno Minussi
    Ruli Gao
    Li Ding
    Nicholas Navin
    Ken Chen
    Genome Biology, 22
  • [34] SCOPE: A normalization and copy number estimation method for single-cell DNA sequencing
    Wang, Rujin
    Lin, Danyu
    Jiang, Yuchao
    CANCER RESEARCH, 2019, 79 (13)
  • [35] Sequential monitoring of single-cell copy number variation in metastatic prostate cancer
    Kuhn, Peter
    Dago, Angel E.
    Stepansky, Asya
    Carlsson, Anders
    Felch, Natalie
    Luttgen, Madelyn
    Kolatkar, Anand
    Hicks, James
    Gross, Mitcheil E.
    CANCER RESEARCH, 2013, 73 (08)
  • [36] Tumor Copy Number Deconvolution Integrating Bulk and Single-Cell Sequencing Data
    Lei, Haoyun
    Lyu, Bochuan
    Gertz, E. Michael
    Schaffer, Alejandro A.
    Shi, Xulian
    Wu, Kui
    Li, Guibo
    Xu, Liqin
    Hou, Yong
    Dean, Michael
    Schwartz, Russell
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (04) : 565 - 598
  • [37] Transgene copy number distribution profiles in recombinant CHO cell lines revealed by single cell analyses
    He, Luhong
    Winterrowd, Christal
    Kadura, Ibrahim
    Frye, Christopher
    BIOTECHNOLOGY AND BIOENGINEERING, 2012, 109 (07) : 1713 - 1722
  • [38] NestedBD: Bayesian inference of phylogenetic trees from single-cell copy number profiles under a birth-death model
    Liu, Yushu
    Edrisi, Mohammadamin
    Yan, Zhi
    Ogilvie, Huw A.
    Nakhleh, Luay
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)
  • [39] Inferring cell communication using single-cell calcium spatiotemporal dynamics
    Taghdiri, Nika
    King, Kevin R.
    STAR PROTOCOLS, 2022, 3 (03):
  • [40] Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes
    Gao, Ruli
    Bai, Shanshan
    Henderson, Ying C.
    Lin, Yiyun
    Schalck, Aislyn
    Yan, Yun
    Kumar, Tapsi
    Hu, Min
    Sei, Emi
    Davis, Alexander
    Wang, Fang
    Shaitelman, Simona F.
    Wang, Jennifer Rui
    Chen, Ken
    Moulder, Stacy
    Lai, Stephen Y.
    Navin, Nicholas E.
    NATURE BIOTECHNOLOGY, 2021, 39 (05) : 599 - 608