Robust linear regression methods in association studies

被引:44
|
作者
Lourenco, V. M. [1 ]
Pires, A. M. [2 ,3 ]
Kirst, M. [4 ]
机构
[1] Univ Nova Lisboa, Fac Ciencias & Tecnol, Dept Math, P-2829516 Caparica, Portugal
[2] Univ Tecn Lisboa, Inst Super Tecn,Dept Math, P-1049001 Lisbon, Portugal
[3] Univ Tecn Lisboa, CEMAT, Inst Super Tecn, P-1049001 Lisbon, Portugal
[4] Univ Florida, Genet Inst, Plant Mol & Cellular Biol Program, Sch Forest Resources & Conservat, Gainesville, FL 32611 USA
关键词
SINGLE-NUCLEOTIDE POLYMORPHISMS; MAYS SSP PARVIGLUMIS; LINKAGE DISEQUILIBRIUM; STRUCTURED POPULATIONS; QUANTITATIVE TRAITS; GENETIC-ASSOCIATION; CANDIDATE GENES; STRATIFICATION; STATISTICS; INFERENCE;
D O I
10.1093/bioinformatics/btr006
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: It is well known that data deficiencies, such as coding/rounding errors, outliers or missing values, may lead to misleading results for many statistical methods. Robust statistical methods are designed to accommodate certain types of those deficiencies, allowing for reliable results under various conditions. We analyze the case of statistical tests to detect associations between genomic individual variations (SNP) and quantitative traits when deviations from the normality assumption are observed. We consider the classical analysis of variance tests for the parameters of the appropriate linear model and a robust version of those tests based on M-regression. We then compare their empirical power and level using simulated data with several degrees of contamination. Results: Data normality is nothing but a mathematical convenience. In practice, experiments usually yield data with non-conforming observations. In the presence of this type of data, classical least squares statistical methods perform poorly, giving biased estimates, raising the number of spurious associations and often failing to detect true ones. We show through a simulation study and a real data example, that the robust methodology can be more powerful and thus more adequate for association studies than the classical approach.
引用
收藏
页码:815 / 821
页数:7
相关论文
共 50 条
  • [21] Robust methods for population stratification in genome wide association studies
    Li Liu
    Donghui Zhang
    Hong Liu
    Christopher Arendt
    BMC Bioinformatics, 14
  • [22] Robust methods for population stratification in genome wide association studies
    Liu, Li
    Zhang, Donghui
    Liu, Hong
    Arendt, Christopher
    BMC BIOINFORMATICS, 2013, 14
  • [23] ROBUST LINEAR LEAST SQUARES REGRESSION
    Audibert, Jean-Yves
    Catoni, Olivier
    ANNALS OF STATISTICS, 2011, 39 (05): : 2766 - 2794
  • [24] Robust Mixture of Linear Regression Models
    Bashir, Shaheena
    Carter, E. M.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2012, 41 (18) : 3371 - 3388
  • [25] On robust linear regression with incomplete data
    Atkinson, AC
    Cheng, TC
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 33 (04) : 361 - 380
  • [26] Weighting games in robust linear regression
    Markatou, M
    JOURNAL OF MULTIVARIATE ANALYSIS, 1999, 70 (01) : 118 - 135
  • [27] Robust linear regression: A review and comparison
    Yu, Chun
    Yao, Weixin
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (08) : 6261 - 6282
  • [28] Robust linear and support vector regression
    Mangasarian, OL
    Musicant, DR
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (09) : 950 - 955
  • [29] Linear and robust Gaussian regression filters
    Seewig, J.
    7th International Symposium on Measurement Technology and Intelligent Instruments, 2005, 13 : 254 - 257
  • [30] Robust estimation in restricted linear regression
    Toka, Onur
    Cetin, Meral
    Arslan, Olcay
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (03) : 1015 - 1029