DIMENSIONALITY REDUCTION OF HIGH-DIMENSIONAL DATA WITH A NONLINEAR PRINCIPAL COMPONENT ALIGNED GENERATIVE TOPOGRAPHIC MAPPING

被引:2
|
作者
Griebel, M. [1 ]
Hullmann, A. [1 ]
机构
[1] Univ Bonn, Inst Numer Simulat, D-53115 Bonn, Germany
来源
SIAM JOURNAL ON SCIENTIFIC COMPUTING | 2014年 / 36卷 / 03期
关键词
dimensionality reduction; generative topographic mapping; principal component analysis; density estimation; additive model; classification; EM ALGORITHM;
D O I
10.1137/130931382
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Most high-dimensional real-life data exhibit some dependencies such that data points do not populate the whole data space but lie approximately on a lower-dimensional manifold. A major problem in many data mining applications is the detection of such a manifold and the expression of the given data in terms of a moderate number of latent variables. We present a method which is derived from the generative topographic mapping (GTM) and can be seen as a nonlinear generalization of the principal component analysis (PCA). It can detect certain nonlinearities in the data but does not suffer from the curse of dimensionality with respect to the latent space dimension as the original GTM and thus allows for higher embedding dimensions. We provide experiments that show that our approach leads to an improved data reconstruction compared to the purely linear PCA and that it can furthermore be used for classification.
引用
收藏
页码:A1027 / A1047
页数:21
相关论文
共 50 条
  • [1] A sparse grid based method for generative dimensionality reduction of high-dimensional data
    Bohn, Bastian
    Garcke, Jochen
    Griebel, Michael
    JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 1 - 17
  • [2] Distance-preserving projection of high-dimensional data for nonlinear dimensionality reduction
    Yang, L
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1243 - 1246
  • [3] Dimensionality reduction for visualizing high-dimensional biological data
    Malepathirana, Tamasha
    Senanayake, Damith
    Vidanaarachchi, Rajith
    Gautam, Vini
    Halgamuge, Saman
    BIOSYSTEMS, 2022, 220
  • [4] Dimensionality Reduction for Registration of High-Dimensional Data Sets
    Xu, Min
    Chen, Hao
    Varshney, Pramod K.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) : 3041 - 3049
  • [5] Adaptive Dimensionality Reduction Method for High-dimensional Data
    Duan, Shuyong
    Yang, Jianhua
    Han, Xu
    Liu, Guirong
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2024, 60 (17): : 283 - 296
  • [6] Principal component analysis for sparse high-dimensional data
    Raiko, Tapani
    Ilin, Alexander
    Karhunen, Juha
    NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 566 - 575
  • [7] Robust locally nonlinear embedding (RLNE) for dimensionality reduction of high-dimensional data with noise
    Xu, Yichen
    Li, Eric
    NEUROCOMPUTING, 2024, 596
  • [8] Efficient indexing of high-dimensional data through dimensionality reduction
    Goh, CH
    Lim, A
    Ooi, BC
    Tan, KL
    DATA & KNOWLEDGE ENGINEERING, 2000, 32 (02) : 115 - 130
  • [9] Multilevel Functional Principal Component Analysis for High-Dimensional Data
    Zipunnikov, Vadim
    Caffo, Brian
    Yousem, David M.
    Davatzikos, Christos
    Schwartz, Brian S.
    Crainiceanu, Ciprian
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) : 852 - 873
  • [10] Linear versus nonlinear dimensionality reduction of high-dimensional dynamical systems
    Smaoui, N
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2004, 25 (06): : 2107 - 2125