A novel ensemble causal feature selection approach with mutual information and group fusion strategy for multi-label data

被引:0
|
作者
Zheng, Yifeng [1 ,2 ]
Zeng, Xianlong [1 ,2 ]
Zhang, Wenjie [1 ,2 ]
Wei, Baoya [1 ,2 ]
Ren, Weishuo [1 ,2 ]
Qing, Depeng [1 ,2 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou, Peoples R China
关键词
Multi-label learning; Feature selection; Causal relationship; Mutual information; Group fusion strategy;
D O I
10.1108/IJICC-04-2024-0144
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeAs intelligent technology advances, practical applications often involve data with multiple labels. Therefore, multi-label feature selection methods have attracted much attention to extract valuable information. However, current methods tend to lack interpretability when evaluating the relationship between different types of variables without considering the potential causal relationship.Design/methodology/approachTo address the above problems, we propose an ensemble causal feature selection method based on mutual information and group fusion strategy (CMIFS) for multi-label data. First, the causal relationship between labels and features is analyzed by local causal structure learning, respectively, to obtain a causal feature set. Second, we eliminate false positive features from the obtained feature set using mutual information to improve the feature subset reliability. Eventually, we employ a group fusion strategy to fuse the obtained feature subsets from multiple data sub-space to enhance the stability of the results.FindingsExperimental comparisons are performed on six datasets to validate that our proposal can enhance the interpretation and robustness of the model compared with other methods in different metrics. Furthermore, the statistical analyses further validate the effectiveness of our approach.Originality/valueThe present study makes a noteworthy contribution to proposing a causal feature selection approach based on mutual information to obtain an approximate optimal feature subset for multi-label data. Additionally, our proposal adopts the group fusion strategy to guarantee the robustness of the obtained feature subset.
引用
收藏
页码:671 / 704
页数:34
相关论文
共 50 条
  • [1] Multi-label causal feature selection based on neighbourhood mutual information
    Wang, Jie
    Lin, Yaojin
    Li, Longzhu
    Wang, Yun-an
    Xu, Meiyan
    Chen, Jinkun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) : 3509 - 3522
  • [2] Multi-label causal feature selection based on neighbourhood mutual information
    Jie Wang
    Yaojin Lin
    Longzhu Li
    Yun-an Wang
    Meiyan Xu
    Jinkun Chen
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 3509 - 3522
  • [3] Multi-Label Feature Selection with Conditional Mutual Information
    Wang, Xiujuan
    Zhou, Yuchen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [4] Approximating mutual information for multi-label feature selection
    Lee, J.
    Lim, H.
    Kim, D. -W.
    ELECTRONICS LETTERS, 2012, 48 (15) : 929 - 930
  • [5] Convex Optimization Approach for Multi-label Feature Selection based on Mutual Information
    Lim, Hyunki
    Kim, Dae-Won
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1512 - 1517
  • [6] Multi-Label Causal Feature Selection
    Wu, Xingyu
    Jiang, Bingbing
    Yu, Kui
    Chen, Huanhuan
    Miao, Chunyan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6430 - 6437
  • [7] Multi-label feature selection based on neighborhood mutual information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Chen, Jinkun
    Duan, Jie
    APPLIED SOFT COMPUTING, 2016, 38 : 244 - 256
  • [8] Granular multi-label feature selection based on mutual information
    Li, Feng
    Miao, Duoqian
    Pedrycz, Witold
    PATTERN RECOGNITION, 2017, 67 : 410 - 423
  • [9] Feature-specific mutual information variation for multi-label feature selection
    Hu, Liang
    Gao, Lingbo
    Li, Yonghao
    Zhang, Ping
    Gao, Wanfu
    INFORMATION SCIENCES, 2022, 593 : 449 - 471
  • [10] Multi-label feature selection based on minimizing feature redundancy of mutual information
    Zhou, Gaozhi
    Li, Runxin
    Shang, Zhenhong
    Li, Xiaowu
    Jia, Lianyin
    NEUROCOMPUTING, 2024, 607