Unsupervised feature selection via multi-step markov probability relationship

被引:1
|
作者
Min, Yan [1 ]
Ye, Mao [1 ]
Tian, Liang [1 ]
Jian, Yulin [1 ]
Zhu, Ce [2 ]
Yang, Shangming [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 611731, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Unsupervised feature selection; Data structure preserving; Multi-step Markov transition probability; Machine learning; DIMENSIONALITY REDUCTION; RECOGNITION;
D O I
10.1016/j.neucom.2021.04.073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a widely used dimension reduction technique to select feature subsets because of its interpretability. Many methods have been proposed and achieved good results, in which the relationships between adjacent data points are mainly concerned. But the possible associations between data pairs that are not adjacent are always neglected. Different from previous methods, we propose a novel and very simple approach for unsupervised feature selection, named MMFS (Multi-step Markov Probability Relationship for Feature Selection). The idea is using multi-step Markov transition probability to describe the relation between any data pair. Two ways from the positive and negative viewpoints are employed respectively to keep the data structure after feature selection. From the positive viewpoint, the maximum transition probability that can be reached in a certain number of steps is used to describe the relation between two points. Then, the features which can keep the compact data structure are selected. From the viewpoint of negative, the minimum transition probability that can be reached in a certain number of steps is used to describe the relation between two points. On the contrary, the features that least maintain the loose data structure are selected. The two ways can also be combined. Thus three algorithms are proposed. Our main contributions are a novel feature section approach which uses multi-step transition probability to characterize the data structure, and three algorithms proposed from the positive and negative aspects for keeping data structure and select the features to preserve such structure. The performance of our approach is compared with the state-of-the-art methods on eight real-world data sets, and the experimental results show that the proposed MMFS is effective in unsupervised feature selection. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 50 条
  • [1] Multi-Step Iterative Algorithm for Feature Selection on Dynamic Documents
    Bafna, Prafulla Bharat
    Shirwaikar, Shailaja
    Pramod, Dhanya
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2016, 6 (02) : 24 - 40
  • [2] Robust unsupervised feature selection via data relationship learning
    Huang, Pei
    Kong, Zhaoming
    Xie, Mengying
    Yang, Xiaowei
    PATTERN RECOGNITION, 2023, 142
  • [3] Temporal Feature Selection for Multi-Step Ahead Reheater Temperature Prediction
    Gui, Ning
    Lou, Jieli
    Qiu, Zhifeng
    Gui, Weihua
    PROCESSES, 2019, 7 (07) : 1 - 12
  • [4] MULTI-STEP ONLINE UNSUPERVISED DOMAIN ADAPTATION
    Moon, J. H.
    Das, Debasnnt
    Lee, C. S. George
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4172 - 4176
  • [5] Isotropy properties of the multi-step Markov symbolic sequences
    Apostolov, S. S.
    Mayzelis, Z. A.
    Usatenko, O. V.
    Yampol'skii, V. A.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 376 : 165 - 172
  • [6] On multi-step MLE-process for Markov sequences
    Yu. A. Kutoyants
    A. Motrunich
    Metrika, 2016, 79 : 705 - 724
  • [7] On multi-step MLE-process for Markov sequences
    Kutoyants, Yu. A.
    Motrunich, A.
    METRIKA, 2016, 79 (06) : 705 - 724
  • [8] Unsupervised Feature Selection via Hypergraph Embedding
    Zhang, Zhihong
    Ren, Peng
    Hancock, Edwin R.
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [9] Multi-Step Unsupervised Domain Adaptation in Image and Feature Space for Synthetic Aperture Radar Image Terrain Classification
    Ren, Zhongle
    Du, Zhe
    Zhang, Yu
    Sha, Feng
    Li, Weibin
    Hou, Biao
    REMOTE SENSING, 2024, 16 (11)
  • [10] Fuzzy Markov predictor in multi-step electric load forecasting
    Teixeira, MA
    Zaverucha, G
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3065 - 3070