Local nonlinear dimensionality reduction via preserving the geometric structure of data

被引:7
|
作者
Wang, Xiang [1 ,2 ]
Zhu, Junxing [1 ]
Xu, Zichen [3 ]
Ren, Kaijun [1 ,2 ]
Liu, Xinwang [2 ]
Wang, Fengyun [4 ]
机构
[1] Natl Univ Def Technol, Coll Meteorol & Oceanog, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Coll Comp Sci & Technol, Changsha 410073, Hunan, Peoples R China
[3] Nanchang Univ, Coll Comp Sci & Technol, Nanchang 330000, Jiangxi, Peoples R China
[4] Univ Leeds, Sch Comp, Leeds LS2 9JT, West Yorkshire, England
关键词
Dimensionality reduction; Embedding learning; Geometric preservation; Random walk;
D O I
10.1016/j.patcog.2023.109663
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction has many applications in data visualization and machine learning. Existing methods can be classified into global ones and local ones. The global methods usually learn the linear relationship in data, while the local ones learn the manifold intrinsic geometry structure, which has a significant impact on pattern recognition. However, most of existing local methods obtain an embedding with eigenvalue or singular value decomposition, where the computational complexities are very high in a large amount of high-dimensional data. In this paper, we propose a local nonlinear dimensionality reduction method named Vec2vec , which employs a neural network with only one hidden layer to reduce the computational complexity. We first build a neighborhood similarity graph from the input matrix, and then define the context of data points with the random walk properties in the graph. Finally, we train the neural network with the context of data points to learn the embedding of the matrix. We conduct extensive experiments of data classification and clustering on nine image and text datasets to evaluate the performance of our method. Experimental results show that Vec2vec is better than several state-of-the-art dimensionality reduction methods, except that it is equivalent to UMAP on data clustering tasks in the statistical hypothesis tests, but Vec2vec needs less computational time than UMAP in high-dimensional data. Furthermore, we propose a more lightweight method named Approximate Vec2vec (AVec2vec) with little performance degradation, which employs an approximate method to build the neighborhood similarity graph. AVec2vec is still better than some state-of-the-art local dimensionality reduction methods and competitive with UMAP on data classification and clustering tasks in the statistical hypothesis tests.& COPY; 2023 Published by Elsevier Ltd.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Dimensionality reduction via preserving local information
    Wang, Shangguang
    Ding, Chuntao
    Hsu, Ching-Hsien
    Yang, Fangchun
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 108 : 967 - 975
  • [2] Nonlinear Dimensionality Reduction by Local Orthogonality Preserving Alignment
    Tong Lin
    Yao Liu
    Bo Wang
    Li-Wei Wang
    Hong-Bin Zha
    Journal of Computer Science and Technology, 2016, 31 : 512 - 524
  • [3] Nonlinear Dimensionality Reduction by Local Orthogonality Preserving Alignment
    Lin, Tong
    Liu, Yao
    Wang, Bo
    Wang, Li-Wei
    Zha, Hong-Bin
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2016, 31 (03) : 512 - 524
  • [4] Unsupervised Dimensionality Reduction for Hyperspectral Imagery via Local Geometric Structure Feature Learning
    Shi, Guangyao
    Huang, Hong
    Wang, Lihua
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (08) : 1425 - 1429
  • [5] Local Geometric Structure Feature for Dimensionality Reduction of Hyperspectral Imagery
    Luo, Fulin
    Huang, Hong
    Duan, Yule
    Liu, Jiamin
    Liao, Yinghua
    REMOTE SENSING, 2017, 9 (08)
  • [6] A Local Similarity-Preserving Framework for Nonlinear Dimensionality Reduction with Neural Networks
    Wang, Xiang
    Li, Xiaoyong
    Zhu, Junxing
    Xu, Zichen
    Ren, Kaijun
    Zhang, Weiming
    Liu, Xinwang
    Yu, Kui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 376 - 391
  • [7] DIMENSIONALITY REDUCTION OF HYPERSPECTRAL IMAGES WITH LOCAL GEOMETRIC STRUCTURE FISHER ANALYSIS
    Luo, Fulin
    Huang, Hong
    Yang, Yaqiong
    Lv, ZhiYong
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 52 - 55
  • [8] A global geometric framework for nonlinear dimensionality reduction
    Tenenbaum, JB
    de Silva, V
    Langford, JC
    SCIENCE, 2000, 290 (5500) : 2319 - +
  • [9] Unsupervised approach for structure preserving dimensionality reduction
    Saxena, Amit
    Kothari, Megha
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 315 - +
  • [10] Preserving Global and Local Structures for Supervised Dimensionality Reduction
    Song, Yinglei
    Li, Yongzhong
    Qu, Junfeng
    2015 THE 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICS ENGINEERING (ICAME 2015), 2015, 28