A novel ensemble causal feature selection approach with mutual information and group fusion strategy for multi-label data

被引:0
|
作者
Zheng, Yifeng [1 ,2 ]
Zeng, Xianlong [1 ,2 ]
Zhang, Wenjie [1 ,2 ]
Wei, Baoya [1 ,2 ]
Ren, Weishuo [1 ,2 ]
Qing, Depeng [1 ,2 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou, Peoples R China
关键词
Multi-label learning; Feature selection; Causal relationship; Mutual information; Group fusion strategy;
D O I
10.1108/IJICC-04-2024-0144
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeAs intelligent technology advances, practical applications often involve data with multiple labels. Therefore, multi-label feature selection methods have attracted much attention to extract valuable information. However, current methods tend to lack interpretability when evaluating the relationship between different types of variables without considering the potential causal relationship.Design/methodology/approachTo address the above problems, we propose an ensemble causal feature selection method based on mutual information and group fusion strategy (CMIFS) for multi-label data. First, the causal relationship between labels and features is analyzed by local causal structure learning, respectively, to obtain a causal feature set. Second, we eliminate false positive features from the obtained feature set using mutual information to improve the feature subset reliability. Eventually, we employ a group fusion strategy to fuse the obtained feature subsets from multiple data sub-space to enhance the stability of the results.FindingsExperimental comparisons are performed on six datasets to validate that our proposal can enhance the interpretation and robustness of the model compared with other methods in different metrics. Furthermore, the statistical analyses further validate the effectiveness of our approach.Originality/valueThe present study makes a noteworthy contribution to proposing a causal feature selection approach based on mutual information to obtain an approximate optimal feature subset for multi-label data. Additionally, our proposal adopts the group fusion strategy to guarantee the robustness of the obtained feature subset.
引用
收藏
页码:671 / 704
页数:34
相关论文
共 50 条
  • [31] Mutual information based multi-label feature selection via constrained convex optimization
    Sun, Zhenqiang
    Zhang, Jia
    Dai, Liang
    Li, Candong
    Zhou, Changen
    Xin, Jiliang
    Li, Shaozi
    NEUROCOMPUTING, 2019, 329 : 447 - 456
  • [32] MFSJMI: Multi-label feature selection considering join mutual information and interaction weight
    Zhang, Ping
    Liu, Guixia
    Song, Jiazhi
    PATTERN RECOGNITION, 2023, 138
  • [33] Label relaxation and shared information for multi-label feature selection
    Fan, Yuling
    Chen, Xu
    Luo, Shimu
    Liu, Peizhong
    Liu, Jinghua
    Chen, Baihua
    Tang, Jianeng
    INFORMATION SCIENCES, 2024, 671
  • [34] Embedded feature fusion for multi-view multi-label feature selection
    Hao, Pingting
    Gao, Wanfu
    Hu, Liang
    PATTERN RECOGNITION, 2025, 157
  • [35] A survey on multi-label feature selection from perspectives of label fusion
    Qian, Wenbin
    Huang, Jintao
    Xu, Fankang
    Shu, Wenhao
    Ding, Weiping
    INFORMATION FUSION, 2023, 100
  • [36] Multi-label feature selection by strongly relevant label gain and label mutual aid
    Dai, Jianhua
    Huang, Weiyi
    Zhang, Chucai
    Liu, Jie
    PATTERN RECOGNITION, 2024, 145
  • [37] Ensemble feature selection for multi-label text classification: An intelligent order statistics approach
    Miri, Mohsen
    Dowlatshahi, Mohammad Bagher
    Hashemi, Amin
    Rafsanjani, Marjan Kuchaki
    Gupta, Brij B.
    Alhalabi, W.
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 11319 - 11341
  • [38] Multi-label Feature Selection via Information Gain
    Li, Ling
    Liu, Huawen
    Ma, Zongjie
    Mo, Yuchang
    Duan, Zhengjie
    Zhou, Jiaqing
    Zhao, Jianmin
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 345 - 355
  • [39] Multi-Label Feature Selection using Correlation Information
    Braytee, Ali
    Liu, Wei
    Catchpoole, Daniel R.
    Kennedy, Paul J.
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1649 - 1656
  • [40] Multi-label feature selection via information gain
    Li, Ling
    Liu, Huawen
    Ma, Zongjie
    Mo, Yuchang
    Duan, Zhengjie
    Zhou, Jiaqing
    Zhao, Jianmin
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 345 - 355