A novel ensemble causal feature selection approach with mutual information and group fusion strategy for multi-label data

被引:0
|
作者
Zheng, Yifeng [1 ,2 ]
Zeng, Xianlong [1 ,2 ]
Zhang, Wenjie [1 ,2 ]
Wei, Baoya [1 ,2 ]
Ren, Weishuo [1 ,2 ]
Qing, Depeng [1 ,2 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou, Peoples R China
关键词
Multi-label learning; Feature selection; Causal relationship; Mutual information; Group fusion strategy;
D O I
10.1108/IJICC-04-2024-0144
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeAs intelligent technology advances, practical applications often involve data with multiple labels. Therefore, multi-label feature selection methods have attracted much attention to extract valuable information. However, current methods tend to lack interpretability when evaluating the relationship between different types of variables without considering the potential causal relationship.Design/methodology/approachTo address the above problems, we propose an ensemble causal feature selection method based on mutual information and group fusion strategy (CMIFS) for multi-label data. First, the causal relationship between labels and features is analyzed by local causal structure learning, respectively, to obtain a causal feature set. Second, we eliminate false positive features from the obtained feature set using mutual information to improve the feature subset reliability. Eventually, we employ a group fusion strategy to fuse the obtained feature subsets from multiple data sub-space to enhance the stability of the results.FindingsExperimental comparisons are performed on six datasets to validate that our proposal can enhance the interpretation and robustness of the model compared with other methods in different metrics. Furthermore, the statistical analyses further validate the effectiveness of our approach.Originality/valueThe present study makes a noteworthy contribution to proposing a causal feature selection approach based on mutual information to obtain an approximate optimal feature subset for multi-label data. Additionally, our proposal adopts the group fusion strategy to guarantee the robustness of the obtained feature subset.
引用
收藏
页码:671 / 704
页数:34
相关论文
共 50 条
  • [21] A Fast Feature Selection Method Based on Mutual Information in Multi-label Learning
    Sun, Zhenqiang
    Zhang, Jia
    Luo, Zhiming
    Cao, Donglin
    Li, Shaozi
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2018, 2019, 917 : 424 - 437
  • [22] Distributed multi-label feature selection using individual mutual information measures
    Gonzalez-Lopez, Jorge
    Ventura, Sebastian
    Cano, Alberto
    KNOWLEDGE-BASED SYSTEMS, 2020, 188 (188)
  • [23] A Multi-Label Feature Selection Based on Mutual Information and Ant Colony Optimization
    Hatami, Mohammad
    Mehrmohammadi, Pooya
    Moradi, Parham
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1589 - 1594
  • [24] Dynamic mutual information-based feature selection for multi-label learning
    Kim, Kyung-Jun
    Jun, Chi-Hyuck
    INTELLIGENT DATA ANALYSIS, 2023, 27 (04) : 891 - 909
  • [25] Online Multi-label Group Feature Selection
    Liu, Jinghua
    Lin, Yaojin
    Wu, Shunxiang
    Wang, Chenxi
    KNOWLEDGE-BASED SYSTEMS, 2018, 143 : 42 - 57
  • [26] A Novel Online Multi-label Feature Selection Approach for Multi-dimensional Streaming Data
    Zhang, Zhanyun
    Luo, Chuan
    Li, Tianrui
    Chen, Hongmei
    Liu, Dun
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 159 - 171
  • [27] A Label Correlation Based Weighting Feature Selection Approach for Multi-label Data
    Liu, Lu
    Zhang, Jing
    Li, Peipei
    Zhang, Yuhong
    Hu, Xuegang
    WEB-AGE INFORMATION MANAGEMENT, PT II, 2016, 9659 : 369 - 379
  • [28] A Multi-Objective online streaming Multi-Label feature selection using mutual information
    Rafie, Azar
    Moradi, Parham
    Ghaderzadeh, Abdulbaghi
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 216
  • [29] Multi-label Online Streaming Feature Selection Based on Spectral Granulation and Mutual Information
    Wang, Huaming
    Yu, Dongming
    Li, Yuan
    Li, Zhixing
    Wang, Guoyin
    ROUGH SETS, IJCRS 2018, 2018, 11103 : 215 - 228
  • [30] MFC: Initialization method for multi-label feature selection based on conditional mutual information
    Lim, Hyunki
    Kim, Dae-Won
    NEUROCOMPUTING, 2020, 382 : 40 - 51