Mutual information-based label distribution feature selection for multi-label learning

被引:52
|
作者
Qian, Wenbin [1 ,2 ]
Huang, Jintao [3 ]
Wang, Yinglong [3 ]
Shu, Wenhao [4 ]
机构
[1] Jiangxi Agr Univ, Sch Software, Nanchang 330045, Jiangxi, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Jiangxi Agr Univ, Sch Comp & Informat Engn, Nanchang 330045, Jiangxi, Peoples R China
[4] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Multi-label data; Granular computing; Label enhancement; Mutual information; STREAMING FEATURE-SELECTION; ATTRIBUTE REDUCTION; MISSING LABELS; CLASSIFICATION; GRAPH; ACCELERATOR; ALGORITHM; DECISION; SPARSE;
D O I
10.1016/j.knosys.2020.105684
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection used for dimensionality reduction of the feature space plays an important role in multi-label learning where high-dimensional data are involved. Although most existing multi-label feature selection approaches can deal with the problem of label ambiguity which mainly focuses on the assumption of uniform distribution with logical labels, it cannot be applied to many practical applications where the significance of related label for every instance tends to be different. To deal with this issue, in this study, label distribution learning covered with a certain real number of labels is introduced to design a model for the labeling-significance. Nevertheless, multi-label feature selection is limited to handling only labels consisting of logical relations. In order to solve this problem, combining the random variable distribution with granular computing, we first propose a label enhancement algorithm to transform logical labels in multi-label data into label distribution with more supervised information, which can mine the hidden label significance from every instance. On this basis, to remove some redundant or irrelevant features in multi-label data, a label distribution feature selection algorithm using mutual information and label enhancement is developed. Finally, the experimental results show that the performance of the proposed method is superior to the other state-of-the-art approaches when dealing with multi-label data. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Feature Selection for Multi-Label Learning
    Spolaor, Newton
    Monard, Maria Carolina
    Lee, Huei Diana
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 4401 - 4402
  • [22] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
  • [23] Multi-label Online Streaming Feature Selection Based on Spectral Granulation and Mutual Information
    Wang, Huaming
    Yu, Dongming
    Li, Yuan
    Li, Zhixing
    Wang, Guoyin
    ROUGH SETS, IJCRS 2018, 2018, 11103 : 215 - 228
  • [24] MFC: Initialization method for multi-label feature selection based on conditional mutual information
    Lim, Hyunki
    Kim, Dae-Won
    NEUROCOMPUTING, 2020, 382 : 40 - 51
  • [25] Mutual information based multi-label feature selection via constrained convex optimization
    Sun, Zhenqiang
    Zhang, Jia
    Dai, Liang
    Li, Candong
    Zhou, Changen
    Xin, Jiliang
    Li, Shaozi
    NEUROCOMPUTING, 2019, 329 : 447 - 456
  • [26] Label relaxation and shared information for multi-label feature selection
    Fan, Yuling
    Chen, Xu
    Luo, Shimu
    Liu, Peizhong
    Liu, Jinghua
    Chen, Baihua
    Tang, Jianeng
    INFORMATION SCIENCES, 2024, 671
  • [27] Alignment Based Feature Selection for Multi-label Learning
    Linlin Chen
    Degang Chen
    Neural Processing Letters, 2019, 50 : 2323 - 2344
  • [28] Alignment Based Feature Selection for Multi-label Learning
    Chen, Linlin
    Chen, Degang
    NEURAL PROCESSING LETTERS, 2019, 50 (03) : 2323 - 2344
  • [29] Multi-label feature selection by strongly relevant label gain and label mutual aid
    Dai, Jianhua
    Huang, Weiyi
    Zhang, Chucai
    Liu, Jie
    PATTERN RECOGNITION, 2024, 145
  • [30] Multi-label feature selection based on label distribution and neighborhood rough set
    Liu, Jinghua
    Lin, Yaojin
    Ding, Weiping
    Zhang, Hongbo
    Wang, Cheng
    Du, Jixiang
    NEUROCOMPUTING, 2023, 524 : 142 - 157