On combining family- and population-based sequencing data

被引:0
|
作者
Yuriko Katsumata
David W. Fardo
机构
[1] University of Kentucky College of Public Health,Department of Biostatistics
关键词
Combine Data; Exome Sequencing; Genetic Analysis Workshop; Inflated Type; Variant Call Format;
D O I
10.1186/s12919-016-0026-9
中图分类号
学科分类号
摘要
Several statistical group-based approaches have been proposed to detect effects of variation within a gene for each of the population- and family-based designs. However, unified tests to combine gene-phenotype associations obtained from these 2 study designs are not yet well established. In this study, we investigated the efficient combination of population-based and family-based sequencing data to evaluate best practices using the Genetic Analysis Workshop 19 (GAW19) data set. Because one design employed whole genome sequencing and the other whole exome sequencing, we examined variants overlapping both data sets. We used the family-based sequence kernel association test (famSKAT) to analyze the family- and population-based data sets separately as well as with a combined data set. These were compared against meta-analysis. Using the combined data, we showed that famSKAT has high power to detect associations between diastolic and/or systolic blood pressures and the genes that have causal variants with large effect sizes, such as MAP4, TNN, and CGN. However, when there was a considerable difference in the powers between family- and population-based data, famSKAT with the combined data had lower power than that from the population-based data alone. The famSKAT test statistic for the combined data can be influenced by sample imbalance from the 2 designs. This underscores the importance of foresight in study design as, in this situation, the greatly lower sample size in the family-based data essentially serves to dilute signal. We observed inflated type I errors in our simulation study, largely when using population-based data, which might be a result of principal components failing to completely account for population admixture in this cohort.
引用
收藏
相关论文
共 50 条
  • [1] Combining Family- and Population-Based Imputation Data for Association Analysis of Rare and Common Variants in Large Pedigrees
    Saad, Mohamad
    Wijsman, Ellen M.
    GENETIC EPIDEMIOLOGY, 2014, 38 (07) : 579 - 590
  • [2] Family- and population-based designs identify different rare causal variants
    Xue Zhang
    Hua He
    Lili Ding
    Tesfaye M Baye
    Brad G Kurowski
    Lisa J Martin
    BMC Proceedings, 5 (Suppl 9)
  • [3] Phasing quality assessment in a brown layer population through family- and population-based software
    N. Frioni
    D. Cavero
    H. Simianer
    M. Erbe
    BMC Genetics, 20
  • [4] Phasing quality assessment in a brown layer population through family- and population-based software
    Frioni, N.
    Cavero, D.
    Simianer, H.
    Erbe, M.
    BMC GENETICS, 2019, 20 (1)
  • [5] Analysis of Family- and Population-Based Samples Using Multiple Linkage Disequilibrium Mapping
    Chiu, Yen-Feng
    Lee, Chun-Yi
    Kao, Hui-Yi
    Pan, Wen-Harn
    Hsu, Fang-Chi
    ANNALS OF HUMAN GENETICS, 2013, 77 : 251 - 267
  • [6] Comparison and Assessment of Family- and Population-Based Genotype Imputation Methods in Large Pedigrees
    Ullah, Ehsan
    Mall, Raghvendra
    Abbas, Mostafa M.
    Kunji, Khalid
    Nato, Alejandro Q., Jr.
    Bensmail, Halima
    Wijsman, Ellen M.
    Saad, Mohamad
    HUMAN HEREDITY, 2017, 83 (01) : 22 - 22
  • [7] Comparison and assessment of family- and population-based genotype imputation methods in large pedigrees
    Ullah, Ehsan
    Mall, Raghvendra
    Abbas, Mostafa M.
    Kunji, Khalid
    Nato, Alejandro Q., Jr.
    Bensmail, Halima
    Wijsman, Ellen M.
    Saad, Mohamad
    GENOME RESEARCH, 2019, 29 (01) : 125 - 134
  • [8] A Family- and Population-Based Study of the UFD1L Gene for Schizophrenia
    Xie, Lin
    Ye, Lin
    Ju, Guizhi
    Xu, Qi
    Zhang, Xuan
    Liu, Shuzheng
    Shi, Jieping
    Yu, Yaqin
    Wang, Zhenqi
    Shen, Yan
    Wei, Jun
    AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2008, 147B (07) : 1076 - 1079
  • [9] On Combining Family-Based and Population-Based Case-Control Data in Association Studies
    Zheng, Yingye
    Heagerty, Patrick J.
    Hsu, Li
    Newcomb, Polly A.
    BIOMETRICS, 2010, 66 (04) : 1024 - 1033
  • [10] Family- and population-based association studies of monoamine oxidase A and autism spectrum disorders in Korean
    Yoo, Hee Jeong
    Lee, Seong Kyu
    Park, Mira
    Cho, In Hee
    Hyun, Seung Hee
    Lee, Je Chul
    Yang, So Young
    Kim, Soon Ae
    NEUROSCIENCE RESEARCH, 2009, 63 (03) : 172 - 176