A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering

被引:99
|
作者
Boutemedjet, Sabri [1 ]
Bouguila, Nizar [2 ]
Ziou, Djemel [1 ]
机构
[1] Univ Sherbrooke, Dept Informat, Sherbrooke, PQ J1K 2R1, Canada
[2] Concordia Univ, Concordia Inst Informat Engn CIISE, Montreal, PQ H3G 1T7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Unsupervised learning; mixture models; feature selection; dimensionality reduction; generalized Dirichlet mixture; EM; MML; information theory; object image categorization; STATISTICAL PATTERN-RECOGNITION; DIRICHLET MIXTURE MODEL; UNSUPERVISED SELECTION;
D O I
10.1109/TPAMI.2008.155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an unsupervised approach for feature selection and extraction in mixtures of generalized Dirichlet (GD) distributions. Our method defines a new mixture model that is able to extract independent and non-Gaussian features without loss of accuracy. The proposed model is learned using the Expectation-Maximization algorithm by minimizing the message length of the data set. Experimental results show the merits of the proposed methodology in the categorization of object images.
引用
收藏
页码:1429 / 1443
页数:15
相关论文
共 50 条
  • [1] Unsupervised Hybrid Feature Extraction Selection for High-Dimensional Non-Gaussian Data Clustering with Variational Inference
    Fan, Wentao
    Bouguila, Nizar
    Ziou, Djemel
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (07) : 1670 - 1685
  • [2] Model-based approach for high-dimensional non-Gaussian visual data clustering and feature weighting
    Elguebaly, Tarek
    Bouguila, Nizar
    Digital Signal Processing: A Review Journal, 2015, 40 (01): : 63 - 79
  • [3] Model-based approach for high-dimensional non-Gaussian visual data clustering and feature weighting
    Elguebaly, Tarek
    Bouguila, Nizar
    DIGITAL SIGNAL PROCESSING, 2015, 40 : 63 - 79
  • [4] Clustering high-dimensional data via feature selection
    Liu, Tianqi
    Lu, Yu
    Zhu, Biqing
    Zhao, Hongyu
    BIOMETRICS, 2023, 79 (02) : 940 - 950
  • [5] Simultaneous Non-gaussian Data Clustering, Feature Selection and Outliers Rejection
    Bouguila, Nizar
    Ziou, Djemel
    Boutemedjet, Sabri
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, 2011, 6744 : 364 - 369
  • [6] Hybrid Feature Selection for High-Dimensional Manufacturing Data
    Sun, Yajuan
    Yu, Jianlin
    Li, Xiang
    Wu, Ji Yan
    Lu, Wen Feng
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [7] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
  • [8] A hybrid feature selection scheme for high-dimensional data
    Ganjei, Mohammad Ahmadi
    Boostani, Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
  • [9] On online high-dimensional spherical data clustering and feature selection
    Amayri, Ola
    Bouguila, Nizar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (04) : 1386 - 1398
  • [10] A hybrid feature selection approach based on ensemble method for high-dimensional data
    Rouhi, Amirreza
    Nezamabadi-pour, Hossein
    2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), 2017, : 16 - 20