Selecting the Top-k Discriminative Features Using Principal Component Analysis

被引:0
|
作者
Kane, Aminata [1 ]
Shiri, Nematollaah [1 ]
机构
[1] Concordia Univ, Comp Sci & Software Engn, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
feature selection; principal component analysis; multivariate time series; INFORMATION;
D O I
10.1109/ICDMW.2016.95
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is important for dimensionality reduction, analysis, and pattern discovery applications. We consider multivariate time series data and propose an unsupervised learning technique to identify the top-k discriminative features. The proposed technique uses statistics drawn from the Principal Component Analysis (PCA) of the input data to leverage the relative importance of the principal components along with the coefficients within the principal directions of the data to uncover the ranking of the features. We conduct numerous experiments using various benchmark datasets to study the performance of the proposed technique in terms of the discriminant power of the selected features and its ability to minimize the original data reconstruction error. Compared to major existing techniques, our results indicate increased accuracy and efficiency. We also show that our technique yields improved classification accuracy.
引用
收藏
页码:639 / 646
页数:8
相关论文
共 50 条
  • [1] On learning discriminative embeddings for optimized top-k matching
    Ghosh, Soumyadeep
    Vatsa, Mayank
    Singh, Richa
    Ratha, Nalini
    PATTERN RECOGNITION, 2025, 162
  • [2] DC-Top-k: A Novel Top-k Selecting Algorithm and Its Parallelization
    Xue, Zhengyuan
    Li, Ruixuan
    Zhang, Heng
    Gu, Xiwu
    Xu, Zhiyong
    PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016, 2016, : 370 - 379
  • [3] Robust Discriminative Principal Component Analysis
    Xu, Xiangxi
    Lai, Zhihui
    Chen, Yudong
    Kong, Heng
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 231 - 238
  • [4] A Fuzzy Framework For Selecting Top-k Web Service Compositions
    Benouaret, Karim
    Benslimane, Djamal
    Hadjali, Allel
    APPLIED COMPUTING REVIEW, 2011, 11 (03): : 32 - 40
  • [5] Selecting Top-k Data Science Models by Example Dataset
    Wang, Mengying
    Guan, Sheng
    Ma, Hanchao
    Bian, Yiyang
    Che, Haolai
    Daundkar, Abhishek
    Sehirlioglu, Alp
    Wu, Yinghui
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2686 - 2695
  • [6] Top-k multi-class SVM using multiple features
    Yan, Caixia
    Luo, Minnan
    Liu, Huan
    Li, Zhihui
    Zheng, Qinghua
    INFORMATION SCIENCES, 2018, 432 : 479 - 494
  • [7] Optimal algorithms for selecting top-k combinations of attributes: theory and applications
    Lin, Chunbin
    Lu, Jiaheng
    Wei, Zhewei
    Wang, Jianguo
    Xiao, Xiaokui
    VLDB JOURNAL, 2018, 27 (01): : 27 - 52
  • [8] Optimal algorithms for selecting top-k combinations of attributes: theory and applications
    Chunbin Lin
    Jiaheng Lu
    Zhewei Wei
    Jianguo Wang
    Xiaokui Xiao
    The VLDB Journal, 2018, 27 : 27 - 52
  • [9] Using Principal Component Analysis for Selecting Network Behavioral Anomaly Metrics
    Gregorio-de Souza, Ian
    Berk, Vincent
    Barsamian, Alex
    SENSORS, AND COMMAND, CONTROL, COMMUNICATIONS, AND INTELLIGENCE (C3I) TECHNOLOGIES FOR HOMELAND SECURITY AND HOMELAND DEFENSE IX, 2010, 7666
  • [10] Dynamic Re-ordering in Mining Top-k Productive Discriminative Patterns
    Kameya, Yoshitaka
    Ito, Ken'ya
    2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 172 - 177