DIMENSIONALITY REDUCTION OF HIGH-DIMENSIONAL DATA WITH A NONLINEAR PRINCIPAL COMPONENT ALIGNED GENERATIVE TOPOGRAPHIC MAPPING

被引:2
|
作者
Griebel, M. [1 ]
Hullmann, A. [1 ]
机构
[1] Univ Bonn, Inst Numer Simulat, D-53115 Bonn, Germany
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2014年 / 36卷 / 03期
关键词
dimensionality reduction; generative topographic mapping; principal component analysis; density estimation; additive model; classification; EM ALGORITHM;
D O I
10.1137/130931382
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Most high-dimensional real-life data exhibit some dependencies such that data points do not populate the whole data space but lie approximately on a lower-dimensional manifold. A major problem in many data mining applications is the detection of such a manifold and the expression of the given data in terms of a moderate number of latent variables. We present a method which is derived from the generative topographic mapping (GTM) and can be seen as a nonlinear generalization of the principal component analysis (PCA). It can detect certain nonlinearities in the data but does not suffer from the curse of dimensionality with respect to the latent space dimension as the original GTM and thus allows for higher embedding dimensions. We provide experiments that show that our approach leads to an improved data reconstruction compared to the purely linear PCA and that it can furthermore be used for classification.
引用
收藏
页码:A1027 / A1047
页数:21
相关论文
共 50 条
  • [21] Dimensionality Reduction Techniques for Visualizing Morphometric Data: Comparing Principal Component Analysis to Nonlinear Methods
    Du, Trina Y.
    EVOLUTIONARY BIOLOGY, 2019, 46 (01) : 106 - 121
  • [22] Glyphboard: Visual Exploration of High-Dimensional Data Combining Glyphs with Dimensionality Reduction
    Kammer, Dietrich
    Keck, Mandy
    Gruender, Thomas
    Maasch, Alexander
    Thom, Thomas
    Kleinsteuber, Martin
    Groh, Rainer
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (04) : 1661 - 1671
  • [23] SeekAView: An Intelligent Dimensionality Reduction Strategy for Navigating High-Dimensional Data Spaces
    Krause, Josua
    Dasgupta, Aritra
    Fekete, Jean-Daniel
    Bertini, Enrico
    2016 IEEE 6TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2016, : 11 - 19
  • [24] Effective Data Dimensionality Reduction Workflow for High-Dimensional Gene Expression Datasets
    Das, Utsha
    Srizon, Azmain Yakin
    Hasan, Md Al Mehedi
    Rahman, Julia
    Ben Islam, Md Khaled
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 182 - 185
  • [25] Recent Dimensionality Reduction Techniques for High-Dimensional COVID-19 Data
    Dallas, Ioannis L.
    Vrahatis, Aristidis G.
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2021, 2022, 13483 : 227 - 241
  • [26] Comparing and Exploring High-Dimensional Data with Dimensionality Reduction Algorithms and Matrix Visualizations
    Cutura, Rene
    Aupetit, Michael
    Fekete, Jean-Daniel
    Sedlmair, Michael
    PROCEEDINGS OF THE WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES AVI 2020, 2020,
  • [27] Semi-supervised dimensionality reduction for analyzing high-dimensional data with constraints
    Yan, Su
    Bouaziz, Sofien
    Lee, Dongwon
    Barlow, Jesse
    NEUROCOMPUTING, 2012, 76 (01) : 114 - 124
  • [28] Forecasting High-Dimensional Covariance Matrices Using High-Dimensional Principal Component Analysis
    Shigemoto, Hideto
    Morimoto, Takayuki
    AXIOMS, 2022, 11 (12)
  • [29] Cauchy robust principal component analysis with applications to high-dimensional data sets
    Fayomi, Aisha
    Pantazis, Yannis
    Tsagris, Michail
    Wood, Andrew T. A.
    STATISTICS AND COMPUTING, 2024, 34 (01)
  • [30] Exploring high-dimensional biological data with sparse contrastive principal component analysis
    Boileau, Philippe
    Hejazi, Nima S.
    Dudoit, Sandrine
    BIOINFORMATICS, 2020, 36 (11) : 3422 - 3430