Granular-conditional-entropy-based attribute reduction for partially labeled data with proxy labels

被引:32
|
作者
Gao, Can [1 ,2 ]
Zhou, Jie [1 ,2 ]
Miao, Duoqian [3 ]
Yue, Xiaodong [4 ]
Wan, Jun [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, SZU Branch, Shenzhen 518060, Peoples R China
[3] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[4] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Rough sets; Semi-supervised attribute reduction; Conditional entropy; Information granularity; Proxy label; SUPERVISED FEATURE-SELECTION; ROUGH SET-THEORY; INFORMATION FUSION; DECISION;
D O I
10.1016/j.ins.2021.08.067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attribute reduction is attracting considerable attention in the theory of rough sets, and thus many rough-set-based attribute reduction methods have been presented. However, most of them are specifically designed for either labeled or unlabeled data, whereas many real-world applications involve partial supervision. In this paper, we propose a rough-set based semi-supervised attribute reduction method for partially labeled data. Specifically, using prior class-distribution information, we first develop a simple yet effective strategy to produce proxy labels for unlabeled data. Then, the concept of information granularity is integrated into an information-theoretic measure, based on which, a novel granular conditional entropy measure is proposed, and its monotonicity is theoretically proved. Furthermore, a fast heuristic algorithm is provided to generate the optimal reduct of partially labeled data, which could accelerate the process of attribute reduction by removing irrelevant examples and simultaneously excluding redundant attributes. Extensive experiments conducted on UCI data sets demonstrate that the proposed semi-supervised attribute reduction method is promising and, in terms of classification performance, it even compares favorably with supervised methods on labeled and unlabeled data with true labels (Our code and experimental data are released at Mendeley Data https://doi.org/10. 17632/v3byhx2v8s.1). (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:111 / 128
页数:18
相关论文
共 50 条
  • [1] Attribute reduction for partially labeled data based on hypergraph models
    Xie, Xiaojun
    Qin, Xiaolin
    Huang, Guangmei
    Zhao, Wei
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1434 - 1439
  • [2] Co-training Based Attribute Reduction for Partially Labeled Data
    Zhang, Wei
    Miao, Duoqian
    Gao, Can
    Yue, Xiaodong
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 77 - 88
  • [3] Neighborhood attribute reduction approach to partially labeled data
    Keyu Liu
    Eric C. C. Tsang
    Jingjing Song
    Hualong Yu
    Xiangjian Chen
    Xibei Yang
    Granular Computing, 2020, 5 : 239 - 250
  • [4] Neighborhood attribute reduction approach to partially labeled data
    Liu, Keyu
    Tsang, Eric C. C.
    Song, Jingjing
    Yu, Hualong
    Chen, Xiangjian
    Yang, Xibei
    GRANULAR COMPUTING, 2020, 5 (02) : 239 - 250
  • [5] Outlier detection for partially labeled categorical data based on conditional information entropy
    Zhao, Zhengwei
    Wang, Rongrong
    Huang, Dan
    Li, Zhaowen
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 164
  • [6] An Attribute Reduction Algorithm Based on Conditional Entropy and Frequency of Attributes
    Wang, Cuiru
    Ou, Fangfang
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL 1, PROCEEDINGS, 2008, : 752 - 756
  • [7] Semi-supervised attribute reduction for partially labeled categorical data based on predicted label
    Huang, Dan
    Zhang, Qinli
    Li, Zhaowen
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 154 : 242 - 261
  • [8] Attribute reduction via local conditional entropy
    Yibo Wang
    Xiangjian Chen
    Kai Dong
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3619 - 3634
  • [9] Attribute reduction via local conditional entropy
    Wang, Yibo
    Chen, Xiangjian
    Dong, Kai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (12) : 3619 - 3634
  • [10] Incremental attribute reduction algorithm based on neighborhood granulation conditional entropy
    Zhao X.-L.
    Yang Y.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (10): : 2061 - 2072