Multiple Imputation of Missing Phenotype Data for QTL Mapping

被引:9
|
作者
Bobb, Jennifer F. [1 ]
Scharfstein, Daniel O. [1 ]
Daniels, Michael J. [2 ]
Collins, Francis S. [3 ]
Kelada, Samir [3 ]
机构
[1] Johns Hopkins Bloomberg Sch Publ Hlth, Baltimore, MD 21205 USA
[2] Univ Florida, Gainesville, FL 32611 USA
[3] NHGRI, NIH, Bethesda, MD USA
基金
美国国家卫生研究院;
关键词
multiple imputation; missing data; quantitative trait loci; QUANTITATIVE TRAIT LOCI; AIRWAY RESPONSIVENESS; COLLABORATIVE CROSS; SYSTEMS GENETICS; INCOMPLETE DATA; MOUSE; MICE; PLETHYSMOGRAPHY; FRAMEWORK; RESOURCE;
D O I
10.2202/1544-6115.1676
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Missing phenotype data can be a major hurdle to mapping quantitative trait loci (QTL). Though in many cases experiments may be designed to minimize the occurrence of missing data, it is often unavoidable in practice; thus, statistical methods to account for missing data are needed. In this paper we describe an approach for conjoining multiple imputation and QTL mapping. Methods are applied to map genes associated with increased breathing effort in mice after lung inflammation due to allergen challenge in developing lines of the Collaborative Cross, a new mouse genetics resource. Missing data poses a particular challenge in this study because the desired phenotype summary to be mapped is a function of incompletely observed dose-response curves. Comparison of the multiple imputation approach to two naive approaches for handling missing data suggest that these simpler methods may yield poor results: ignoring missing data through a complete case analysis may lead to incorrect conclusions, while using a last observation carried forward procedure, which does not account for uncertainty in the imputed values, may lead to anti-conservative inference. The proposed approach is widely applicable to other studies with missing phenotype data.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] Multiple imputation: a mature approach to dealing with missing data
    S. Chevret
    S. Seaman
    M. Resche-Rigon
    Intensive Care Medicine, 2015, 41 : 348 - 350
  • [32] Imputation Methods for Multiple Regression with Missing Heteroscedastic Data
    Asif, Muhammad
    Samart, Klairung
    THAILAND STATISTICIAN, 2022, 20 (01): : 1 - 15
  • [33] Multiple imputation of missing genotype data for unrelated individuals
    Souverein, OW
    Zwinderman, AH
    Tanck, MWT
    ANNALS OF HUMAN GENETICS, 2006, 70 : 372 - 381
  • [34] Missing data and multiple imputation in clinical epidemiological research
    Pedersen, Alma B.
    Mikkelsen, Ellen M.
    Cronin-Fenton, Deirdre
    Kristensen, Nickolaj R.
    Tra My Pham
    Pedersen, Lars
    Petersen, Irene
    CLINICAL EPIDEMIOLOGY, 2017, 9 : 157 - 165
  • [35] Handling missing data in nursing research with multiple imputation
    Kneipp, SM
    McIntosh, M
    NURSING RESEARCH, 2001, 50 (06) : 384 - 389
  • [36] A nonparametric multiple imputation approach for missing categorical data
    Muhan Zhou
    Yulei He
    Mandi Yu
    Chiu-Hsieh Hsu
    BMC Medical Research Methodology, 17
  • [37] Multiple imputation of binary multilevel missing not at random data
    Hammon, Angelina
    Zinn, Sabine
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (03) : 547 - 564
  • [38] Multiple Imputation Ensembles (MIE) for Dealing with Missing Data
    Aleryani A.
    Wang W.
    de la Iglesia B.
    SN Computer Science, 2020, 1 (3)
  • [39] A practical guide to multiple imputation of missing data in nephrology
    Blazek, Katrina
    van Zwieten, Anita
    Saglimbene, Valeria
    Teixeira-Pinto, Armando
    KIDNEY INTERNATIONAL, 2021, 99 (01) : 68 - 74
  • [40] Augmented Stochastic Multiple Imputation Model for Airport Pavement Missing Data Imputation
    Farhan, J.
    Fwa, T. F.
    TRANSPORTATION RESEARCH RECORD, 2014, (2449) : 96 - 104