A Dimensionality Reduction and Reconstruction Method for Data with Multiple Connected Components

被引:0
|
作者
Yao, Yuqin [1 ]
Gao, Yang [2 ]
Long, Zhiguo [2 ]
Meng, Hua [1 ]
Sioutis, Michael [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Math, Chengdu, Peoples R China
[2] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
[3] Univ Bamberg, Fac Informat Syst & Appl Comp Sci, Bavaria, Germany
关键词
LE; Dimensionality reduction; Manifold learning; Topological connectivity;
D O I
10.1109/BDAI56143.2022.9862787
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the literature on dimensionality reduction, including Spectral Clustering and Laplacian Eigenmaps, one of the core ideas is to reconstruct data based on similarities between data points, which makes the choice of similarity matrices a key factor on the performance of a dimensionality reduction model. Traditional methods like K-nearest neighbor, is an element of-neighbor, and Gaussian Kernel for constructing similarity matrices based on data distribution characteristics have been extensively studied. However, these methods usually focus on only a specific level of the data when considering the similarity between data points, which might result in a great flaw in data reconstruction when data possess hierarchical and multiple groups structure. Specifically, such methods can only characterize the similarity between data within a group, but ignore the similarity between different groups. To overcome this deficiency, this paper proposes a hierarchical way of similarity matrix construction, by introducing strong, weak, and intra- and inter-cluster similarities to describe relations between multiple levels. The proposed method can better adapt to complex data with multiple connected components, and the effectiveness of it is verified in a series of experiments on synthetic and real-world datasets.
引用
收藏
页码:87 / 92
页数:6
相关论文
共 50 条
  • [41] Nonlinear dimensionality reduction in climate data
    Gámez, AJ
    Zhou, CS
    Timmermann, A
    Kurths, J
    NONLINEAR PROCESSES IN GEOPHYSICS, 2004, 11 (03) : 393 - 398
  • [42] Dimensionality Reduction Approach for Genotypic Data
    Al-Husain, Luluah
    Hafez, Alaaeldin M.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 202 - 206
  • [43] Denoising and dimensionality reduction of genomic data
    Capobianco, E
    FLUCTUATIONS AND NOISE IN BIOLOGICAL, BIOPHYSICAL, AND BIOMEDICAL SYSTEMS III, 2005, 5841 : 69 - 80
  • [44] Data visualization by nonlinear dimensionality reduction
    Gisbrecht, Andrej
    Hammer, Barbara
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (02) : 51 - 73
  • [45] Data Dimensionality Reduction in Anthropometrical Investigations
    Kordecki, Henryk
    Knapik-Kordecka, Maria
    Karmowski, Mikolaj
    Gworys, Bohdan
    Karmowski, Andrzej
    ADVANCES IN CLINICAL AND EXPERIMENTAL MEDICINE, 2012, 21 (05): : 601 - 606
  • [46] Dimensionality Reduction to Dynamically Reduce Data
    Sanderson, Dominic
    Malin, Ben
    Kalganova, Tatiana
    Ott, Richard
    2022 IEEE PHYSICAL ASSURANCE AND INSPECTION OF ELECTRONICS (PAINE), 2022, : 141 - 145
  • [47] PCA Dimensionality Reduction for Categorical Data
    Denisiuk, Aleksander
    COMPUTATIONAL SCIENCE, ICCS 2024, PT III, 2024, 14834 : 179 - 186
  • [48] Dimensionality reduction for mass Spectrometry data
    Liu, Yihui
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 203 - 213
  • [49] Dimensionality reduction for mass spectrometry data
    Liu, Yihui
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4632 : 203 - 213
  • [50] Supervised dimensionality reduction for big data
    Joshua T. Vogelstein
    Eric W. Bridgeford
    Minh Tang
    Da Zheng
    Christopher Douville
    Randal Burns
    Mauro Maggioni
    Nature Communications, 12