Online landmark replacement for out-of-sample dimensionality reduction methods

被引:0
|
作者
Thongprayoon, Chanon [1 ]
Masuda, Naoki [1 ,2 ,3 ]
机构
[1] SUNY Buffalo, Dept Math, Buffalo, NY 14068 USA
[2] SUNY Buffalo, Inst Artificial Intelligence & Data Sci, Buffalo, NY 14068 USA
[3] Kobe Univ, Ctr Computat Social Sci, Kobe, Japan
基金
日本科学技术振兴机构;
关键词
time-series analysis; dimensionality reduction; geometric graph; temporal networks; ANOMALY DETECTION;
D O I
10.1098/rspa.2023.0966
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A strategy to assist visualization and analysis of large and complex datasets is dimensionality reduction, with which one maps each data point into a low-dimensional manifold. However, various dimensionality reduction techniques are computationally infeasible for large data. Out-of-sample techniques aim to resolve this difficulty; they only apply the dimensionality reduction technique on a small portion of data, referred to as landmarks, and determine the embedding coordinates of the other points using landmarks as references. Out-of-sample techniques have been applied to online settings, or when data arrive as time series. However, existing online out-of-sample techniques use either all the previous data points as landmarks or the fixed set of landmarks and therefore are potentially not good at capturing the geometry of the entire dataset when the time series is non-stationary. To address this problem, we propose an online landmark replacement algorithm for out-of-sample techniques using geometric graphs and the minimal dominating set on them. We mathematically analyse some properties of the proposed algorithm, particularly focusing on the case of landmark multi-dimensional scaling as the out-of-sample technique, and test its performance on synthetic and empirical time-series data.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series
    Dadkhahi, Hamid
    Duarte, Marco F.
    Marlin, Benjamin M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5435 - 5446
  • [2] Out-of-Sample Error Estimation: the Blessing of High Dimensionality
    Oneto, Luca
    Ghio, Alessandro
    Ridella, Sandro
    Ortiz, Jorge Luis Reyes
    Anguita, Davide
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 637 - 644
  • [3] A Sparse-Grid-Based Out-of-Sample Extension for Dimensionality Reduction and Clustering with Laplacian Eigenmaps
    Peherstorfer, Benjamin
    Pflueger, Dirk
    Bungartz, Hans-Joachim
    AI 2011: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7106 : 112 - 121
  • [4] A graph regularized dimension reduction method for out-of-sample data
    Tang, Mengfan
    Nie, Feiping
    Jain, Ramesh
    NEUROCOMPUTING, 2017, 225 : 58 - 63
  • [5] Robust out-of-sample inference
    McCracken, MW
    JOURNAL OF ECONOMETRICS, 2000, 99 (02) : 195 - 223
  • [6] Out-of-Sample Extensions for Non-Parametric Kernel Methods
    Pan, Binbin
    Chen, Wen-Sheng
    Chen, Bo
    Xu, Chen
    Lai, Jianhuang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (02) : 334 - 345
  • [7] Landmark diffusion maps (L-dMaps): Accelerated manifold learning out-of-sample extension
    Long, Andrew W.
    Ferguson, Andrew L.
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2019, 47 (01) : 190 - 211
  • [8] Connecting the out-of-sample and pre-image problems in kernel methods
    Arias, Pablo
    Randall, Gregory
    Sapiro, Guillermo
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 524 - +
  • [9] Out-of-Sample Extension of the Fuzzy Transform
    Patane, Giuseppe
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (03) : 1424 - 1434
  • [10] The out-of-sample performance of carry trades
    Hsu, Po-Hsuan
    Taylor, Mark P.
    Wang, Zigan
    Li, Yan
    JOURNAL OF INTERNATIONAL MONEY AND FINANCE, 2024, 143