A Data Feature Extraction Method Based on the NOTEARS Causal Inference Algorithm

被引:3
|
作者
Wang, Hairui [1 ]
Li, Junming [1 ]
Zhu, Guifu [2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650504, Peoples R China
[2] Kunming Univ Sci & Technol, Informat Technol Construct Management Ctr, Kunming 650504, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 14期
基金
中国国家自然科学基金;
关键词
causal inference; relevance; feature extraction; compare; FEATURE-SELECTION; REGRESSION; CLASSIFICATION;
D O I
10.3390/app13148438
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Extracting effective features from high-dimensional datasets is crucial for determining the accuracy of regression and classification models. Model predictions based on causality are known for their robustness. Thus, this paper introduces causality into feature selection and utilizes Feature Selection based on NOTEARS causal discovery (FSNT) for effective feature extraction. This method transforms the structural learning algorithm into a numerical optimization problem, enabling the rapid identification of the globally optimal causality diagram between features and the target variable. To assess the effectiveness of the FSNT algorithm, this paper evaluates its performance by employing 10 regression algorithms and 8 classification algorithms for regression and classification predictions on six real datasets from diverse fields. These results are then compared with three mainstream feature selection algorithms. The results indicate a significant average decline of 54.02% in regression prediction achieved by the FSNT algorithm. Furthermore, the algorithm exhibits exceptional performance in classification prediction, leading to an enhancement in the precision value. These findings highlight the effectiveness of FSNT in eliminating redundant features and significantly improving the accuracy of model predictions.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A DAG-NOTEARS-based Data Mining Method for Faulty Samples
    Tie, WeiSong
    Li, Kang
    Ye, Hao
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 82 - 87
  • [2] A fast bootstrap algorithm for causal inference with large data
    Kosko, Matthew
    Wang, Lin
    Santacatterina, Michele
    STATISTICS IN MEDICINE, 2024, 43 (15) : 2894 - 2927
  • [3] A polygonal line algorithm based nonlinear feature extraction method
    Zhang, F
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 281 - 288
  • [4] Face feature extraction method based on part of labeled data
    Cui, Peng
    Zhang, Ru-Bo
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2012, 23 (03): : 554 - 560
  • [5] Feature inference and the causal structure of categories
    Rehder, B
    Burnett, RC
    COGNITIVE PSYCHOLOGY, 2005, 50 (03) : 264 - 314
  • [6] Neuroevolutionary Feature Representations for Causal Inference
    Burkhart, Michael C.
    Ruiz, Gabriel
    COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 3 - 10
  • [7] A Sparse Feature Extraction Method Based on Improved Quantum Evolutionary Algorithm
    Yu F.-J.
    Liu Y.-C.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2020, 40 (05): : 512 - 518
  • [8] A Feature Extraction Method of Image Based on Dimentional Transformation and SVD Algorithm
    Jin, Ran
    Dong, Zhuojun
    CEA'09: PROCEEDINGS OF THE 3RD WSEAS INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS, 2009, : 101 - +
  • [9] Planar Feature Extraction and Fitting Method Based on Density Clustering Algorithm
    Zhang, Min
    Luo, Minzhou
    Xu, Xiaobin
    Tan, Zhiying
    Yang, Hao
    Li, Zhihao
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 583 - 587
  • [10] Feature extraction method of football fouls based on deep learning algorithm
    Ma W.
    Lv Y.
    International Journal of Information and Communication Technology, 2023, 22 (04) : 404 - 421