A novel ensemble causal feature selection approach with mutual information and group fusion strategy for multi-label data

被引:0
|
作者
Zheng, Yifeng [1 ,2 ]
Zeng, Xianlong [1 ,2 ]
Zhang, Wenjie [1 ,2 ]
Wei, Baoya [1 ,2 ]
Ren, Weishuo [1 ,2 ]
Qing, Depeng [1 ,2 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou, Peoples R China
关键词
Multi-label learning; Feature selection; Causal relationship; Mutual information; Group fusion strategy;
D O I
10.1108/IJICC-04-2024-0144
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeAs intelligent technology advances, practical applications often involve data with multiple labels. Therefore, multi-label feature selection methods have attracted much attention to extract valuable information. However, current methods tend to lack interpretability when evaluating the relationship between different types of variables without considering the potential causal relationship.Design/methodology/approachTo address the above problems, we propose an ensemble causal feature selection method based on mutual information and group fusion strategy (CMIFS) for multi-label data. First, the causal relationship between labels and features is analyzed by local causal structure learning, respectively, to obtain a causal feature set. Second, we eliminate false positive features from the obtained feature set using mutual information to improve the feature subset reliability. Eventually, we employ a group fusion strategy to fuse the obtained feature subsets from multiple data sub-space to enhance the stability of the results.FindingsExperimental comparisons are performed on six datasets to validate that our proposal can enhance the interpretation and robustness of the model compared with other methods in different metrics. Furthermore, the statistical analyses further validate the effectiveness of our approach.Originality/valueThe present study makes a noteworthy contribution to proposing a causal feature selection approach based on mutual information to obtain an approximate optimal feature subset for multi-label data. Additionally, our proposal adopts the group fusion strategy to guarantee the robustness of the obtained feature subset.
引用
收藏
页码:671 / 704
页数:34
相关论文
共 50 条
  • [41] Learning correlation information for multi-label feature selection
    Fan, Yuling
    Liu, Jinghua
    Tang, Jianeng
    Liu, Peizhong
    Lin, Yaojin
    Du, Yongzhao
    PATTERN RECOGNITION, 2024, 145
  • [42] Information Theoretic Feature Selection in Multi-label Data through Composite Likelihood
    Sechidis, Konstantinos
    Nikolaou, Nikolaos
    Brown, Gavin
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 143 - 152
  • [43] Feature selection for multi-label classification by maximizing full-dimensional conditional mutual information
    Zhi-Chao Sha
    Zhang-Meng Liu
    Chen Ma
    Jun Chen
    Applied Intelligence, 2021, 51 : 326 - 340
  • [44] Feature selection for multi-label classification by maximizing full-dimensional conditional mutual information
    Sha, Zhi-Chao
    Liu, Zhang-Meng
    Ma, Chen
    Chen, Jun
    APPLIED INTELLIGENCE, 2021, 51 (01) : 326 - 340
  • [45] Multi-label feature selection for missing labels by granular-ball based mutual information
    Shu, Wenhao
    Hu, Yichen
    Qian, Wenbin
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12589 - 12612
  • [46] Multi-label Feature Selection Method Based on Multivariate Mutual Information and Particle Swarm Optimization
    Wang, Xidong
    Zhao, Lei
    Xu, Jianhua
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 84 - 95
  • [47] Optimization approach for feature selection in multi-label classification
    Lim, Hyunki
    Lee, Jaesung
    Kim, Dae-Won
    PATTERN RECOGNITION LETTERS, 2017, 89 : 25 - 30
  • [48] Feature Redundancy Based on Interaction Information for Multi-Label Feature Selection
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    Zhang, Ping
    IEEE ACCESS, 2020, 8 : 146050 - 146064
  • [49] Semi-supervised feature selection with minimal redundancy based on group optimization strategy for multi-label data
    Qing, Depeng
    Zheng, Yifeng
    Zhang, Wenjie
    Ren, Weishuo
    Zeng, Xianlong
    Li, Guohe
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1271 - 1308
  • [50] Multi-label feature selection based on information entropy fusion in multi-source decision system
    Wenbin Qian
    Sudan Yu
    Jun Yang
    Yinglong Wang
    Jihao Zhang
    Evolutionary Intelligence, 2020, 13 : 255 - 268