An empirical comparison of dimensionality reduction methods for classifying gene and protein expression datasets

被引:0
|
作者
Lee, George [1 ]
Rodriguez, Carlos [2 ]
Madabhushi, Arlant [1 ]
机构
[1] Rutgers State Univ, Dept Biomed Engn, Piscataway, NJ 08854 USA
[2] Univ Puerto Rico, Mayaguez, PR 00681 USA
关键词
dimensionality reduction; bioinformatics; gene expression; proteomics; classification; prostate cancer; lung cancer; ovarian cancer; principal component analysis; linear discriminant analysis; multidimensional scaling; graph embedding; Isomap; locally linear embedding;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The recent explosion in availability of gene and protein expression data for cancer detection has necessitated the development of sophisticated machine learning tools for high dimensional data analysis. Previous attempts at gene expression analysis have typically used a linear dimensionality reduction method such as Principal Components Analysis (PCA). Linear dimensionality reduction methods do not however account for the inherent nonlinearity within the data. The motivation behind this work is to demonstrate that nonlinear dimensionality reduction methods are more adept at capturing the nonlinearity within the data compared to linear methods, and hence would result in better classification and potentially aid in the visualization and identification of new data classes. Consequently, in this paper, we empirically compare the performance of 3 commonly used linear versus 3 nonlinear dimensionality reduction techniques from the perspective of (a) distinguishing objects belonging to cancer and non-cancer classes and (b) new class discovery in high dimensional gene and protein expression studies for different types of cancer. Quantitative evaluation using a support vector machine and a decision tree classifier revealed statistically significant improvement in classification accuracy by using nonlinear dimensionality reduction methods compared to linear methods.
引用
收藏
页码:170 / +
页数:3
相关论文
共 50 条
  • [1] A Hybrid Heuristic Dimensionality Reduction Methods for Classifying Malaria Vector Gene Expression Data
    Arowolo, Micheal O.
    Adebiyi, Marion Olubunmi
    Adebiyi, Ayodele Ariyo
    Okesola, Olatunji Julius
    IEEE ACCESS, 2020, 8 : 182422 - 182430
  • [2] Investigating the efficacy of nonlinear dimensionality reduction schemes in classifying gene and protein expression studies
    Lee, George
    Rodriguez, Carlos
    Madabhushi, Anant
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2008, 5 (03) : 368 - 384
  • [3] Empirical comparison between autoencoders and traditional dimensionality reduction methods
    Fournier, Quentin
    Aloise, Daniel
    2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 211 - 214
  • [4] Comparing the dimensionality reduction methods in gene expression databases
    Borges, Helyane Bronoski
    Nievola, Julio Cesar
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10780 - 10795
  • [5] Ferroelectric Memristive Networks for Dimensionality Reduction: A Process for Effectively Classifying Cancer Datasets
    Raj, P. Michael Preetam
    Louis, V. Jeffry
    Chatterjee, Sumit Kumar
    Kanungo, Sayan
    Kundu, Souvik
    INTEGRATED FERROELECTRICS, 2019, 201 (01) : 126 - 141
  • [6] Effective Data Dimensionality Reduction Workflow for High-Dimensional Gene Expression Datasets
    Das, Utsha
    Srizon, Azmain Yakin
    Hasan, Md Al Mehedi
    Rahman, Julia
    Ben Islam, Md Khaled
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 182 - 185
  • [7] Gene Expression Programming Ensemble for Classifying Big Datasets
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2017, PT II, 2017, 10449 : 3 - 12
  • [8] An Empirical Study of Dimensionality Reduction Methods for Biometric Recognition
    Kerdprasop, Nittaya
    Chanklan, Ratiporn
    Hirunyawanakul, Anusara
    Kerdprasop, Kittisak
    2014 7TH INTERNATIONAL CONFERENCE ON SECURITY TECHNOLOGY (SECTECH), 2014, : 26 - 29
  • [9] Comparison Of Linear Dimensionality Reduction Methods On Classification Methods
    Yildiz, Eray
    Sevim, Yusuf
    2016 NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND BIOMEDICAL ENGINEERING (ELECO), 2016, : 161 - 164
  • [10] Comparison of feature extraction methods in dimensionality reduction
    Wu, Jee-cheng
    Chang, Chiao-Po
    Tsuei, Gwo-Chyang
    CANADIAN JOURNAL OF REMOTE SENSING, 2010, 36 (06): : 645 - 649