Estimation of Locally Relevant Subspace in High-dimensional Data

被引:6
|
作者
Thudumu, Srikanth [1 ]
Branch, Philip [1 ]
Jin, Jiong [1 ]
Singh, Jugdutt [2 ]
机构
[1] Swinburne Univ Technol, Melbourne, Vic, Australia
[2] Sarawak State Govt, Kuching, Malaysia
关键词
High-dimensionality problem; Subspace methods; Outlier Detection; Locally Relevant subspace; The curse of dimensionality problem; OUTLIER DETECTION;
D O I
10.1145/3373017.3373032
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
High-dimensional data is becoming more and more available due to the advent of big data and IoT. Having more dimensions makes data analysis cumbersome increasing the sparsity of data points due to the problem called "curse of dimensionality". To address this problem, global dimensionality reduction techniques are used; however, these techniques are ineffective in revealing hidden outliers from the high-dimensional space. This is due to the behaviour of outliers being hidden in the subspace where they belong; hence, a locally relevant subspace is needed to reveal the hidden outliers. In this paper, we present a technique that identifies a locally relevant subspace and associated low-dimensional subspaces by deriving a final correlation score. To verify the effectiveness of the technique in determining the generalised locally relevant subspace, we evaluate the results with a benchmark data set. Our comparative analysis shows that the technique derived the locally relevant subspace that consists of relevant dimensions presented in benchmark data set.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] A Survey on High-Dimensional Subspace Clustering
    Qu, Wentao
    Xiu, Xianchao
    Chen, Huangyue
    Kong, Lingchen
    MATHEMATICS, 2023, 11 (02)
  • [32] Subspace Clustering for High-Dimensional Data Using Cluster Structure Similarity
    Fatehi, Kavan
    Rezvani, Mohsen
    Fateh, Mansoor
    Pajoohan, Mohammad-Reza
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2018, 14 (03) : 38 - 55
  • [33] Dimension Reconstruction for Visual Exploration of Subspace Clusters in High-dimensional Data
    Zhou, Fangfang
    Li, Juncai
    Huang, Wei
    Zhao, Ying
    Yuan, Xiaoru
    Liang, Xing
    Shi, Yang
    2016 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2016, : 128 - 135
  • [34] An entropy weighting mixture model for subspace clustering of high-dimensional data
    Peng, Liuqing
    Zhang, Junying
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1154 - 1161
  • [35] Dynamic Sparse Subspace Clustering for Evolving High-Dimensional Data Streams
    Sui, Jinping
    Liu, Zhen
    Liu, Li
    Jung, Alexander
    Li, Xiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4173 - 4186
  • [36] Class-specific subspace discriminant analysis for high-dimensional data
    Bouveyron, Charles
    Girard, Stephane
    Schmid, Cordelia
    SUBSPACE, LATENT STRUCTURE AND FEATURE SELECTION, 2006, 3940 : 139 - 150
  • [37] A Compressed PCA Subspace Method for Anomaly Detection in High-Dimensional Data
    Ding, Qi
    Kolaczyk, Eric D.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (11) : 7419 - 7433
  • [38] Accelerating Density-Based Subspace Clustering in High-Dimensional Data
    Prinzbach, Juergen
    Lauer, Tobias
    Kiefer, Nicolas
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 474 - 481
  • [39] Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
    Cai, Xiaosha
    Huang, Dong
    Wang, Chang-Dong
    Kwoh, Chee-Keong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 330 - 342
  • [40] Local-Density Subspace Distributed Clustering for High-Dimensional Data
    Geng, Yangli-ao
    Li, Qingyong
    Liang, Mingfei
    Chi, Chong-Yung
    Tan, Juan
    Huang, Heng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1799 - 1814