Robust linear regression methods in association studies

被引:44
|
作者
Lourenco, V. M. [1 ]
Pires, A. M. [2 ,3 ]
Kirst, M. [4 ]
机构
[1] Univ Nova Lisboa, Fac Ciencias & Tecnol, Dept Math, P-2829516 Caparica, Portugal
[2] Univ Tecn Lisboa, Inst Super Tecn,Dept Math, P-1049001 Lisbon, Portugal
[3] Univ Tecn Lisboa, CEMAT, Inst Super Tecn, P-1049001 Lisbon, Portugal
[4] Univ Florida, Genet Inst, Plant Mol & Cellular Biol Program, Sch Forest Resources & Conservat, Gainesville, FL 32611 USA
关键词
SINGLE-NUCLEOTIDE POLYMORPHISMS; MAYS SSP PARVIGLUMIS; LINKAGE DISEQUILIBRIUM; STRUCTURED POPULATIONS; QUANTITATIVE TRAITS; GENETIC-ASSOCIATION; CANDIDATE GENES; STRATIFICATION; STATISTICS; INFERENCE;
D O I
10.1093/bioinformatics/btr006
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: It is well known that data deficiencies, such as coding/rounding errors, outliers or missing values, may lead to misleading results for many statistical methods. Robust statistical methods are designed to accommodate certain types of those deficiencies, allowing for reliable results under various conditions. We analyze the case of statistical tests to detect associations between genomic individual variations (SNP) and quantitative traits when deviations from the normality assumption are observed. We consider the classical analysis of variance tests for the parameters of the appropriate linear model and a robust version of those tests based on M-regression. We then compare their empirical power and level using simulated data with several degrees of contamination. Results: Data normality is nothing but a mathematical convenience. In practice, experiments usually yield data with non-conforming observations. In the presence of this type of data, classical least squares statistical methods perform poorly, giving biased estimates, raising the number of spurious associations and often failing to detect true ones. We show through a simulation study and a real data example, that the robust methodology can be more powerful and thus more adequate for association studies than the classical approach.
引用
收藏
页码:815 / 821
页数:7
相关论文
共 50 条
  • [41] Robustification of Linear Regression and Its Application in Genome-Wide Association Studies
    Alamin, Md
    Sultana, Most Humaira
    Xu, Haiming
    Mollah, Md Nurul Haque
    FRONTIERS IN GENETICS, 2020, 11
  • [42] Robust linear regression with broad distributions of errors
    Postnikov, Eugene B.
    Sokolov, Igor M.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 434 : 257 - 267
  • [43] Online signal extraction by robust linear regression
    Gather, U
    Schettlinger, K
    Fried, R
    COMPUTATIONAL STATISTICS, 2006, 21 (01) : 33 - 51
  • [44] ROBUST LINEAR REGRESSION ANALYSIS - THE GREEDY WAY
    Papageorgiou, George
    Bouboulis, Pantelis
    Theodoridis, Sergios
    Themelis, Konstantinos
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 16 - 20
  • [45] FINITE ALGORITHMS FOR ROBUST LINEAR-REGRESSION
    MADSEN, K
    NIELSEN, HB
    BIT, 1990, 30 (04): : 682 - 699
  • [46] ADAPTIVE ESTIMATORS FOR ROBUST LINEAR-REGRESSION
    PRENTICE, RL
    ANDERSON, JR
    BIOMETRICS, 1978, 34 (01) : 153 - 153
  • [47] On robust linear regression of hepatic extraction of insulin
    Aboukalam, MAF
    BIOMETRICAL JOURNAL, 1997, 39 (02) : 171 - 181
  • [48] Robust clusterwise linear regression through trimming
    Garcia-Escudero, L. A.
    Gordaliza, A.
    Mayo-Iscar, A.
    San Martin, R.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3057 - 3069
  • [49] Robust Bayesian analysis of the linear regression model
    Chaturvedi, A
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1996, 50 (02) : 175 - 186
  • [50] ROBUST CRITERION FOR VARIABLE SELECTION IN LINEAR REGRESSION
    Patil, A. B.
    Kashid, D. N.
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2009, 5 (02): : 509 - 521