Quantitative Analysis of Pseudogene-Associated Errors During Germline Variant Calling

被引:0
|
作者
Podvalnyi, Artem [1 ,2 ]
Kopernik, Arina [1 ]
Sayganova, Mariia [1 ]
Woroncow, Mary [3 ]
Zobkova, Gauhar [4 ]
Smirnova, Anna [4 ]
Esibov, Anton [1 ]
Deviatkin, Andrey [1 ]
Volchkov, Pavel [1 ,3 ]
Albert, Eugene [1 ,3 ]
机构
[1] Fed Res Ctr Innovator & Emerging Biomed & Pharmace, Moscow 125315, Russia
[2] HSE Univ, Fac Comp Sci, Moscow 101000, Russia
[3] Lomonosov Moscow State Univ, Fac Fundamental Med, Moscow 119991, Russia
[4] Evogen LLC, Moscow 115191, Russia
关键词
processed pseudogenes; SNPs; ACMG;
D O I
10.3390/ijms26010363
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A pseudogene is a non-functional copy of a protein-coding gene. Processed pseudogenes, which are created by the reverse transcription of mRNA and subsequent integration of the resulting cDNA into the genome, being a major pseudogene class, represent a significant challenge in genome analysis due to their high sequence similarity to the parent genes and their frequent absence in the reference genome. This homology can lead to errors in variant identification, as sequences derived from processed pseudogenes can be incorrectly assigned to parental genes, complicating correct variant calling. In this study, we quantified the occurrence of variant calling errors associated with pseudogenes, generated by the most popular germline variant callers, namely GATK-HC, DRAGEN, and DeepVariant, when analysing 30x human whole-genome sequencing data (n = 13,307). The results show that the presence of pseudogenes can interfere with variant calling, leading to false positive identifications of potentially clinically relevant variants. Compared to other approaches, DeepVariant was the most effective in correcting these errors.
引用
收藏
页数:10
相关论文
共 29 条
  • [21] Quantitative analysis of septin Cdc10 & Cdc3-associated proteome during stress response in the fungal pathogen Cryptococcus neoformans
    Barrera, Stephani Martinez
    Hatchell, Emma
    Byrum, Stephanie D.
    Mackintosh, Samuel G.
    Kozubowski, Lukasz
    PLOS ONE, 2024, 19 (12):
  • [22] Detection of apoptosis-associated microRNA in human apheresis platelets during storage by quantitative real-time polymerase chain reaction analysis
    Yu, Shifang
    Deng, Gang
    Qian, Dingliang
    Xie, Zuoting
    Sun, He
    Huang, Dandan
    Li, Qiang
    BLOOD TRANSFUSION, 2014, 12 (04) : 541 - 547
  • [23] Analysis of quantitative trait loci and candidate gene exploration associated with cold tolerance in rice (Oryza sativa L.) during the seedling stage
    Jo, Sumin
    Jang, Seong-Gyu
    Lee, Sais-Beul
    Lee, Ji-Yoon
    Cho, Jun-Hyeon
    Kang, Ju-Won
    Kwon, Yeongho
    Lee, So-Myeong
    Park, Dong-Soo
    Kwon, Soon-Wook
    Lee, Jong-Hee
    FRONTIERS IN PLANT SCIENCE, 2025, 15
  • [24] RETROSPECTIVE ANALYSIS OF THE CHARACTERISTICS OF THE POST-COVID PERIOD IN PATIENTS WITH TYPE 2 DIABETES, INFECTED DURING DIFFERENT VARIANT-ASSOCIATED PERIODS OF COVID-19
    Altynbekova, Saule A.
    Fadeev, Valentin V.
    Abilayuly, Zhangentkhan
    Bolshakova, Svetlana V.
    DIABETES MELLITUS, 2024, 27 (05): : 441 - 450
  • [25] Individual risk factors associated with SARS-CoV-2 infection during Alpha variant in high-income countries: a systematic review and meta-analysis
    Moniz, Marta
    Pereira, Sofia
    Soares, Patricia
    Aguiar, Pedro
    Donato, Helena
    Leite, Andreia
    FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [26] Expression of botulinum neurotoxins A and E, and associated non-toxin genes, during the transition phase and stability at high temperature: analysis by quantitative reverse transcription-PCR
    Couesnon, A
    Raffestin, S
    Popoff, MR
    MICROBIOLOGY-SGM, 2006, 152 : 759 - 770
  • [27] Quantitative Analysis of Dynamic Protein Interactions during Transcription Reveals a Role for Casein Kinase II in Polymerase-associated Factor (PAF) Complex Phosphorylation and Regulation of Histone H2B Monoubiquitylation
    Bedard, Lynn Glowczewski
    Dronamraju, Raghuvar
    Kerschner, Jenny L.
    Hunter, Gerald O.
    Axley, Elizabeth DeVlieger
    Boyd, Asha K.
    Strahl, Brian D.
    Mosley, Amber L.
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2016, 291 (26) : 13410 - 13420
  • [28] Acute Intravenous Synaptamine Complex Variant KB220™ "Normalizes" Neurological Dysregulation in Patients During Protracted Abstinence From Alcohol and Opiates as Observed Using Quantitative Electroencephalographic and Genetic Analysis for Reward Polymorphisms: Part 1, Pilot Study with 2 Case Reports
    Miller, David K.
    Bowirrat, Abdalla
    Manka, Matthew
    Miller, Merlene
    Stokes, Stanley
    Manka, Debra
    Allen, Cameron
    Gant, Charles
    Downs, B. William
    Smolen, Andrew
    Stevens, Emily
    Yeldandi, Swetha
    Blum, Kenneth
    POSTGRADUATE MEDICINE, 2010, 122 (06) : 188 - 213
  • [29] Hemoglobin fairfax, a tandem duplication of beta 90-94 inserted between beta 94-beta 95 (proved by DNA sequence analysis); a previously undescribed unstable hemoglobin variant associated with severe hemolytic anemia and stroke during early childhood.
    Greenberg, J
    Fairbanks, VF
    Hoyer, J
    Thibodeau, SN
    Fairfax, VA
    BLOOD, 1997, 90 (10) : 2808 - 2808