So you think you can PLS-DA?

被引:220
|
作者
Ruiz-Perez, Daniel [1 ]
Guan, Haibin [1 ]
Madhivanan, Purnima [2 ]
Mathee, Kalai [3 ]
Narasimhan, Giri [1 ]
机构
[1] Florida Int Univ, Bioinformat Res Grp BioRG, 11200 SW 8th St, Miami, FL 33199 USA
[2] Florida Int Univ, Dept Epidemiol, 11200 SW 8th St, Miami, FL 24105 USA
[3] Florida Int Univ, Herbert Wertheim Coll Med, 11200 SW 8th St, Miami, FL 24105 USA
关键词
PLS-DA; PCA; Feature selection; Dimensionality reduction; Bioinformatics; PARTIAL LEAST-SQUARES; VAGINAL MICROBIOME; HEALTH;
D O I
10.1186/s12859-019-3310-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundPartial Least-Squares Discriminant Analysis (PLS-DA) is a popular machine learning tool that is gaining increasing attention as a useful feature selector and classifier. In an effort to understand its strengths and weaknesses, we performed a series of experiments with synthetic data and compared its performance to its close relative from which it was initially invented, namely Principal Component Analysis (PCA).ResultsWe demonstrate that even though PCA ignores the information regarding the class labels of the samples, this unsupervised tool can be remarkably effective as a feature selector. In some cases, it outperforms PLS-DA, which is made aware of the class labels in its input. Our experiments range from looking at the signal-to-noise ratio in the feature selection task, to considering many practical distributions and models encountered when analyzing bioinformatics and clinical data. Other methods were also evaluated. Finally, we analyzed an interesting data set from 396 vaginal microbiome samples where the ground truth for the feature selection was available. All the 3D figures shown in this paper as well as the supplementary ones can be viewed interactively at http://biorg.cs.fiu.edu/plsdaConclusionsOur results highlighted the strengths and weaknesses of PLS-DA in comparison with PCA for different underlying data models.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] So you think you can PLS-DA?
    Ruiz-Perez, Daniel
    Guan, Haibin
    Madhivanan, Purnima
    Mathee, Kalai
    Narasimhan, Giri
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2018,
  • [2] So you think you can PLS-DA?
    Daniel Ruiz-Perez
    Haibin Guan
    Purnima Madhivanan
    Kalai Mathee
    Giri Narasimhan
    BMC Bioinformatics, 21
  • [3] So You Think You Can Dance
    Hahn, Thomas
    TANZ, 2023, (8-9): : 14 - 16
  • [4] So you think you can dance?
    Kemmerer, Richard A.
    TWENTY-THIRD ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, PROCEEDINGS, 2007, : 4 - 15
  • [5] So you think you can edit?
    Budde, Priya Prakash
    MOLECULAR BIOLOGY OF THE CELL, 2014, 25 (17) : 2539 - 2541
  • [6] SO YOU THINK YOU CAN SUPERVISE?
    Jackson, Christopher D.
    Burroughs-Ray, Desiree
    Dunlap, Natalie E.
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2020, 35 (SUPPL 1) : S767 - S767
  • [7] So you think you can dance?
    Jenkinson, Paul M.
    Fotopoutou, Aikaterini
    PSYCHOLOGIST, 2010, 23 (10) : 810 - 813
  • [8] So You Think You Can Dance
    Desantis, Marissa
    DANCE MAGAZINE, 2021, 95 (06): : 32 - 33
  • [9] 'So you think you can dance'
    Macel, Emily
    DANCE MAGAZINE, 2008, 82 (09): : 48 - +
  • [10] So You Think You Can Dance
    Graham, Elyse
    AMERICAN SCHOLAR, 2018, 87 (01): : 16 - 16