Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations

被引:48
|
作者
Martin, Alicia R. [1 ,2 ,3 ]
Atkinson, Elizabeth G. [1 ,2 ,3 ]
Chapman, Sinead B. [2 ]
Stevenson, Anne [2 ,4 ]
Stroud, Rocky E. [2 ,4 ]
Abebe, Tamrat [5 ]
Akena, Dickens [6 ]
Alemayehu, Melkam [7 ]
Ashaba, Fred K. [8 ]
Atwoli, Lukoye [9 ]
Bowers, Tera [10 ]
Chibnik, Lori B. [2 ,4 ,11 ]
Daly, Mark J. [1 ,2 ,3 ,12 ]
DeSmet, Timothy [10 ]
Dodge, Sheila [10 ]
Fekadu, Abebaw [7 ,13 ]
Ferriera, Steven [10 ]
Gelaye, Bizu [4 ]
Gichuru, Stella [14 ]
Injera, Wilfred E. [15 ]
James, Roxanne [16 ]
Kariuki, Symon M. [17 ,18 ]
Kigen, Gabriel [19 ]
Koenen, Karestan C. [2 ,4 ]
Kwobah, Edith [14 ]
Kyebuzibwa, Joseph [6 ]
Majara, Lerato [16 ,20 ]
Musinguzi, Henry [8 ]
Mwema, Rehema M. [17 ]
Neale, Benjamin M. [1 ,2 ,3 ]
Newman, Carter P. [2 ,4 ]
Newton, Charles R. J. C. [17 ,18 ]
Pickrell, Joseph K. [21 ]
Ramesar, Raj [22 ]
Shiferaw, Welelta [5 ]
Stein, Dan J. [16 ,23 ,24 ]
Teferra, Solomon [7 ]
van der Merwe, Celia [1 ,2 ,3 ,16 ]
Zingela, Zukiswa [25 ]
机构
[1] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[2] Broad Inst Harvard & MIT, Stanley Ctr Psychiat Res, Cambridge, MA 02142 USA
[3] Broad Inst Harvard & MIT, Program Med & Populat Genet, Cambridge, MA 02142 USA
[4] Harvard TH Chan Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[5] Addis Ababa Univ, Coll Hlth Sci, Sch Med, Dept Microbiol Immunol & Parasitol, Addis Ababa, Ethiopia
[6] Makerere Univ, Coll Hlth Sci, Sch Med, Dept Psychiat, Kampala, Uganda
[7] Addis Ababa Univ, Coll Hlth Sci, Sch Med, Dept Psychiat, Addis Ababa, Ethiopia
[8] Makerere Univ, Coll Hlth Sci, Dept Immunol & Mol Biol, Kampala, Uganda
[9] Moi Univ, Sch Med, Dept Mental Hlth, Coll Hlth Sci, Eldoret, Kenya
[10] Broad Inst MIT & Harvard, Broad Genom, 320 Charles St, Cambridge, MA 02141 USA
[11] Massachusetts Gen Hosp, Dept Neurol, Boston, MA 02114 USA
[12] Inst Mol Med Finland, Helsinki 00014, Finland
[13] Addis Ababa Univ, Ctr Innovat Drug Dev & Therapeut Trials Africa, Addis Ababa, Ethiopia
[14] Moi Teaching & Referral Hosp, Dept Mental Hlth, Eldoret, Kenya
[15] Moi Univ, Sch Med, Dept Immunol, Coll Hlth Sci, Eldoret, Kenya
[16] Univ Cape Town, Dept Psychiat & Mental Hlth, Cape Town, South Africa
[17] KEMRI Wellcome Trust Res Programme Coast, Neurosci Unit, Clin Dept, Kilifi, Kenya
[18] Univ Oxford, Dept Psychiat, Oxford OX3 7JX, England
[19] Moi Univ, Sch Med, Dept Pharmacol & Toxicol, Coll Hlth Sci, Eldoret, Kenya
[20] Univ Cape Town, Fac Hlth Sci, Inst Infect Dis & Mol Med, SA MRC Human Genet Res Unit,Div Human Genet, ZA-7925 Observatory, South Africa
[21] Gencove Inc, New York, NY 10016 USA
[22] Univ Cape Town, Inst Infect Dis & Mol Med, Dept Pathol, Div Human Genet,SA MRC Genom & Precis Med Res Uni, Cape Town, South Africa
[23] Univ Cape Town, SA MRC Unit Risk & Resilience Mental Disorders, Cape Town, South Africa
[24] Neuroscience Inst, Cape Town, South Africa
[25] Walter Sisulu Univ, Dept Psychiat & Human Behav Sci, Mthatha, South Africa
基金
英国医学研究理事会; 美国国家卫生研究院;
关键词
GENOTYPE-IMPUTATION; GENETIC ARCHITECTURE; GENOME; ASSOCIATION;
D O I
10.1016/j.ajhg.2021.03.012
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of >= 4x captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1x) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 43 sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.
引用
收藏
页码:656 / 668
页数:13
相关论文
共 31 条
  • [21] Genotyping by low-coverage whole-genome sequencing in intercross pedigrees from outbred founders: a cost-efficient approach
    Zan, Yanjun
    Payen, Thibaut
    Lillie, Mette
    Honaker, Christa F.
    Siegel, Paul B.
    Carlborg, Orjan
    GENETICS SELECTION EVOLUTION, 2019, 51 (01)
  • [22] Copy number variation of urine exfoliated cells by low-coverage whole genome sequencing for diagnosis of prostate adenocarcinoma: a prospective cohort study
    Youyan Guan
    Xiaobing Wang
    Kaopeng Guan
    Dong Wang
    Xingang Bi
    Zhendong Xiao
    Zejun Xiao
    Xingli Shan
    Linjun Hu
    Jianhui Ma
    Changling Li
    Yong Zhang
    Jianzhong Shou
    Baiyun Wang
    Ziliang Qian
    Nianzeng Xing
    BMC Medical Genomics, 15
  • [23] dpGMM: A Dirichlet Process Gaussian Mixture Model for Copy Number Variation Detection in Low-Coverage Whole-Genome Sequencing Data
    Li, Yaoyao
    Zhang, Junying
    Yuan, Xiguo
    Li, Junping
    IEEE ACCESS, 2020, 8 : 27973 - 27985
  • [24] Copy number variation of urine exfoliated cells by low-coverage whole genome sequencing for diagnosis of prostate adenocarcinoma: a prospective cohort study
    Guan, Youyan
    Wang, Xiaobing
    Guan, Kaopeng
    Wang, Dong
    Bi, Xingang
    Xiao, Zhendong
    Xiao, Zejun
    Shan, Xingli
    Hu, Linjun
    Ma, Jianhui
    Li, Changling
    Zhang, Yong
    Shou, Jianzhong
    Wang, Baiyun
    Qian, Ziliang
    Xing, Nianzeng
    BMC MEDICAL GENOMICS, 2022, 15 (SUPPL 2)
  • [25] High-throughput and cost-effective genotyping by low-coverage whole genome sequencing with genotype imputation in Pacific oyster, Crassostrea gigas
    Yang, Ben
    Li, Yongjing
    Li, Qi
    Liu, Shikai
    AQUACULTURE, 2024, 591
  • [26] Copy Number Variation in MUC5AC and Susceptibility to Allergic Rhinitis: A Low-Coverage Whole-Genome Sequencing and Validation Cohort Study
    Wang, Yan
    Li, Linge
    Yang, Yuping
    Feng, Juan
    Wang, Lingling
    Zhang, Hua
    GENETIC TESTING AND MOLECULAR BIOMARKERS, 2020, 24 (04) : 173 - 180
  • [27] Noninvasive Detection of Urothelial Carcinoma by Cost-effective Low-coverage Whole-genome Sequencing from Urine-Exfoliated Cell DNA
    Zeng, Shuxiong
    Ying, Yidie
    Xing, Naidong
    Wang, Baiyun
    Qian, Ziliang
    Zhou, Zunlin
    Zhang, Zhensheng
    Xu, Weidong
    Wang, Huiqing
    Dai, Lihe
    Gao, Li
    Zhou, Tie
    Ji, Jiatao
    Xu, Chuanliang
    CLINICAL CANCER RESEARCH, 2020, 26 (21) : 5646 - 5654
  • [28] Re: Noninvasive Detection of Urothelial Carcinoma by Cost-Effective Low-Coverage Whole-Genome Sequencing from Urine-Exfoliated Cell DNA
    不详
    JOURNAL OF UROLOGY, 2021, 206 (04): : 1061 - 1061
  • [29] Low-coverage sequencing in a deep intercross of the Virginia body weight lines provides insight to the polygenic genetic architecture of growth: novel loci revealed by increased power and improved genome-coverage
    Ronneburg, T.
    Zan, Y.
    Honaker, C. F.
    Siegel, P. B.
    Carlborg, O.
    POULTRY SCIENCE, 2023, 102 (05)
  • [30] Non-invasive detection of urothelial carcinoma (UC) by cost-effective low-coverage whole genome sequencing from urine exfoliated cells DNA.
    Zeng, Shuxiong
    Nai, Dongxing
    Ye, Yingdie
    Ji, Jiatao
    Zhou, Zunlin
    Wang, Baiyun
    Qian Ziliang
    Xu, Changliang
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (15)