In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data

被引:77
|
作者
Cai, Lei [1 ,2 ]
Yuan, Wei [1 ]
Zhang, Zhou [1 ,3 ]
He, Lin [1 ,4 ]
Chou, Kuo-Chen [2 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Psychot Disorders 13dz2260500, Key Lab Genet Dev & Neuropsychiat Disorders, Bio X Inst,Minist Educ, Shanghai 200030, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Shanghai Jiao Tong Univ, Sch Med, Inst Biliary Tract Dis, Xinhua Hosp, Shanghai 200092, Peoples R China
[4] Zhejiang Univ, Sch Med, Womens Hosp, Hangzhou 310006, Zhejiang, Peoples R China
[5] King Abdulaziz Univ, CEGMR, Jeddah 21589, Saudi Arabia
来源
SCIENTIFIC REPORTS | 2016年 / 6卷
关键词
CANCER GENOMES; SNV DETECTION; WHOLE-EXOME; WEB SERVER; IDENTIFICATION; VARIANTS; MODES; DISCOVERY; PACKAGE; PSEKNC;
D O I
10.1038/srep36540
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Four popular somatic single nucleotide variant (SNV) calling methods (Varscan, SomaticSniper, Strelka and MuTect2) were carefully evaluated on the real whole exome sequencing (WES, depth of -50X) and ultra-deep targeted sequencing (UDT-Seq, depth of similar to 370X) data. The four tools returned poor consensus on candidates (only 20% of calls were with multiple hits by the callers). For both WES and UDT-Seq, MuTect2 and Strelka obtained the largest proportion of COSMIC entries as well as the lowest rate of dbSNP presence and high-alternative-alleles-in-control calls, demonstrating their superior sensitivity and accuracy. Combining different callers does increase reliability of candidates, but narrows the list down to very limited range of tumor read depth and variant allele frequency. Calling SNV on UDT-Seq data, which were of much higher read-depth, discovered additional true-positive variations, despite an even more tremendous growth in false positive predictions. Our findings not only provide valuable benchmark for state-of-the-art SNV calling methods, but also shed light on the access to more accurate SNV identification in the future.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data
    Lei Cai
    Wei Yuan
    Zhou Zhang
    Lin He
    Kuo-Chen Chou
    Scientific Reports, 6
  • [2] Variant Callers for Next-Generation Sequencing Data: A Comparison Study
    Liu, Xiangtao
    Han, Shizhong
    Wang, Zuoheng
    Gelernter, Joel
    Yang, Bao-Zhu
    PLOS ONE, 2013, 8 (09):
  • [3] Detecting somatic point mutations in cancer genome sequencing data: a comparison of mutation callers
    Qingguo Wang
    Peilin Jia
    Fei Li
    Haiquan Chen
    Hongbin Ji
    Donald Hucks
    Kimberly Brown Dahlman
    William Pao
    Zhongming Zhao
    Genome Medicine, 5
  • [4] Detecting somatic point mutations in cancer genome sequencing data: a comparison of mutation callers
    Wang, Qingguo
    Jia, Peilin
    Li, Fei
    Chen, Haiquan
    Ji, Hongbin
    Hucks, Donald
    Dahlman, Kimberly Brown
    Pao, William
    Zhao, Zhongming
    GENOME MEDICINE, 2013, 5
  • [5] Comprehensive Benchmark of Somatic Mutation Callers on Cell-Free DNA Next-Generation Sequencing Data in Cancer Liquid Biopsy
    Carrie, Hanae
    Sim, Ngak Leng
    Wong, Pui Mun
    Gan, Anna
    Lau, Yi Ting
    Poon, Polly
    Thangaraju, Saranya
    Tan, Iain
    Yap, Yoon Sim
    Wong, Limsoon
    Skanderup, Anders
    CANCER RESEARCH, 2024, 84 (08)
  • [6] Impact of the next-generation sequencing data depth on various biological result inferences
    Hou Rui
    Yang ZhenXing
    Li MingHui
    Xiao HuaSheng
    SCIENCE CHINA-LIFE SCIENCES, 2013, 56 (02) : 104 - 109
  • [7] Impact of the next-generation sequencing data depth on various biological result inferences
    Rui Hou
    ZhenXing Yang
    MingHui Li
    HuaSheng Xiao
    Science China Life Sciences, 2013, 56 : 104 - 109
  • [8] Impact of the next-generation sequencing data depth on various biological result inferences
    HOU Rui
    YANG ZhenXing
    LI MingHui
    XIAO HuaSheng
    Science China(Life Sciences), 2013, 56 (02) : 104 - 109
  • [9] Impact of the next-generation sequencing data depth on various biological result inferences
    HOU Rui
    YANG ZhenXing
    LI MingHui
    XIAO HuaSheng
    Science China(Life Sciences) , 2013, (02) : 104 - 109
  • [10] Inferring Population Structure and Admixture Proportions in Low Depth Next-Generation Sequencing Data
    Meisner, Jonas
    Albrechtsen, Anders
    HUMAN HEREDITY, 2017, 83 (01) : 17 - 18