Reflections on univariate and multivariate analysis of metabolomics data

被引:440
|
作者
Saccenti, Edoardo [1 ,2 ]
Hoefsloot, Huub C. J. [1 ,2 ]
Smilde, Age K. [1 ,2 ]
Westerhuis, Johan A. [1 ,2 ]
Hendriks, Margriet M. W. B. [2 ,3 ]
机构
[1] Univ Amsterdam, Swammerdam Inst Life Sci, Biosyst Data Anal Grp, NL-1098 XH Amsterdam, Netherlands
[2] Netherlands Metabol Ctr, NL-2333 CL Leiden, Netherlands
[3] Leiden Acad Ctr Drug Res, NL-2333 CL Leiden, Netherlands
关键词
Univariate analysis; Multivariate analysis; Hypothesis testing; Multiple test correction; Overfitting; Consistency at large; NMR-BASED METABOLOMICS; STATISTICAL VALIDATION; DISCRIMINANT-ANALYSIS; SHRUNKEN CENTROIDS; POWERFUL APPROACH; FEATURE-SELECTION; HIGHER CRITICISM; GENE-EXPRESSION; DATA SETS; CLASSIFICATION;
D O I
10.1007/s11306-013-0598-6
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Metabolomics experiments usually result in a large quantity of data. Univariate and multivariate analysis techniques are routinely used to extract relevant information from the data with the aim of providing biological knowledge on the problem studied. Despite the fact that statistical tools like the t test, analysis of variance, principal component analysis, and partial least squares discriminant analysis constitute the backbone of the statistical part of the vast majority of metabolomics papers, it seems that many basic but rather fundamental questions are still often asked, like: Why do the results of univariate and multivariate analyses differ? Why apply univariate methods if you have already applied a multivariate method? Why if I do not see something univariately I see something multivariately? In the present paper we address some aspects of univariate and multivariate analysis, with the scope of clarifying in simple terms the main differences between the two approaches. Applications of the t test, analysis of variance, principal component analysis and partial least squares discriminant analysis will be shown on both real and simulated metabolomics data examples to provide an overview on fundamental aspects of univariate and multivariate methods.
引用
收藏
页码:361 / 374
页数:14
相关论文
共 50 条
  • [1] Reflections on univariate and multivariate analysis of metabolomics data
    Edoardo Saccenti
    Huub C. J. Hoefsloot
    Age K. Smilde
    Johan A. Westerhuis
    Margriet M. W. B. Hendriks
    Metabolomics, 2014, 10 : 361 - 374
  • [2] Evaluation of Metabolomics Data Using Univariate and Multivariate Statistical Analysis Techniques
    Moroz, J.
    Fallone, G.
    Syme, A.
    Allalunis-Turner, J.
    MEDICAL PHYSICS, 2010, 37 (06) : 3471 - +
  • [3] muma, An R Package for Metabolomics Univariate and Multivariate Statistical Analysis
    Gaude, Edoardo
    Chignola, Francesca
    Spiliotopoulos, Dimitrios
    Spitaleri, Andrea
    Ghitti, Michela
    Garcia-Manteiga, Jose M.
    Mari, Silvia
    Musco, Giovanna
    CURRENT METABOLOMICS, 2013, 1 (02) : 180 - 189
  • [4] Handbook of univariate and multivariate data analysis and interpretation with SPSS
    Putcha, Venkata
    Raton, Boca
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2008, 171 : 317 - 317
  • [5] SOME INTERPRETATIONS IN THE ANALYSIS OF UNIVARIATE AND MULTIVARIATE TRANSFORMED DATA
    BARGMANN, RE
    BIOMETRICS, 1959, 15 (02) : 330 - 330
  • [6] The combination of univariate and multivariate method for fMRI data analysis
    Xia, WW
    Yan, LR
    Zhou, ZT
    Liu, YD
    Hu, DW
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1568 - 1573
  • [7] Generating univariate and multivariate nonnormal data
    Lee, Sunbok
    STATA JOURNAL, 2015, 15 (01): : 95 - 109
  • [8] UNIVARIATE AND MULTIVARIATE COMPONENT ANALYSIS
    PINNEAU, SR
    AULT, JT
    PERCEPTUAL AND MOTOR SKILLS, 1974, 39 (02) : 955 - 985
  • [9] Classification of Depressive Episodes Using Nighttime Data; a Multivariate and Univariate Analysis
    Rodriguez-Ruiz, J. G.
    Galvan-Tejada, C. E.
    Vazquez-Reyes, S.
    Galvan-Tejada, J. I.
    Gamboa-Rosales, H.
    PROGRAMMING AND COMPUTER SOFTWARE, 2020, 46 (08) : 689 - 698
  • [10] UNIVARIATE AND MULTIVARIATE CATEGORICAL-DATA ANALYSIS FOR BLOCK-DESIGNS
    BHARGAVA, RP
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1982, 11 (11): : 1209 - 1231