Three-way selection random forest algorithm based on decision boundary entropy

被引:12
|
作者
Zhang, Chunying [1 ,2 ]
Ren, Jing [1 ]
Liu, Fengchun [3 ]
Li, Xiaoqi [1 ]
Liu, Shouyue [1 ]
机构
[1] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Hebei, Peoples R China
[2] Key Lab Data Sci & Applicat Hebei Prov, Tangshan 063210, Hebei, Peoples R China
[3] North China Univ Sci & Technol, Coll Qianan, Tangshan 063210, Hebei, Peoples R China
关键词
Random Forest; Attribute Selection; Decision Boundary Entropy; Significance of Attribute; Three-way Decision; ATTRIBUTES;
D O I
10.1007/s10489-021-03033-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problem of high probability of negative impact about redundant attributes in random forest algorithms, a Three-way Selection Random Forest algorithm based on decision boundary entropy (TSRF) is proposed without losing randomness and reducing the influence of redundant attributes on decision-making results. According to the characteristics of the attribute, the concept of decision boundary entropy is defined. Then a measuring method of attribute importance based on decision boundary entropy is proposed and set as an evaluation standard. Three-way decision is constructed and the attribute is divided into three candidate domains, namely positive domain, negative domain and boundary domain. In order to ensure the randomness of attributes, three-way attribute random selection rules based on attribute randomness are established and a certain number of attributes are randomly selected from the three candidate domains. Combine the samples selected by the bootstrap sampling method with attribute sets selected by three-way decision to produce training sample sets so that we can train the decision trees and generate forest. Six datasets are selected for the experiment. Two parameters of attribute randomness and three-way decision thresholds are analyzed to verify the theoretical conclusions respectively. The results show that the TSRF algorithm can meet the different requirements of different data sets by adjusting the parameters. The classification effect on the binary data is basically the same as the comparison algorithm, but TSRF has a significant improvement effect on the multi-class data compared with other algorithms. The proposed TSRF algorithm widens the idea for the measurement method of significance of attribute, innovates the random forest three-way selection integration method, and provides a better model framework for solving multi-classification problems.
引用
收藏
页码:13384 / 13397
页数:14
相关论文
共 50 条
  • [21] A novel attribute reduction algorithm based on granular sequential three-way decision
    Chen, Yuliang
    Cheng, Yunlong
    Luo, Binbin
    Shao, Yabin
    Zhao, Mingfu
    Zhang, Qinghua
    INFORMATION SCIENCES, 2025, 694
  • [22] Intuitionistic Fuzzy Three-Way Decision Model Based on the Three-Way Granular Computing Method
    Xin, Xianwei
    Song, Jihua
    Peng, Weiming
    SYMMETRY-BASEL, 2020, 12 (07):
  • [23] An extended three-way decision and its application in member selection
    Liu, Shuli
    Liu, Xinwang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (05) : 2095 - 2106
  • [24] The geometry of three-way decision
    Yao, Yiyu
    APPLIED INTELLIGENCE, 2021, 51 (09) : 6298 - 6325
  • [25] The geometry of three-way decision
    Yiyu Yao
    Applied Intelligence, 2021, 51 : 6298 - 6325
  • [26] Three-way Learnability: A Learning Theoretic Perspective on Three-way Decision
    Campagner, Andrea
    Ciucci, Davide
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 243 - 246
  • [27] Three-way decisions based on semi-three-way decision spaces
    Hu, Bao Qing
    INFORMATION SCIENCES, 2017, 382 : 415 - 440
  • [28] Three-way decision based participants selection optimization model in sparse mobile crowdsensing
    Wang, Jian
    Zhao, Guosheng
    Ge, Huijie
    INFORMATION SCIENCES, 2023, 645
  • [29] Optimal scale combination selection for multi-scale decision tables based on three-way decision
    Cheng, Yunlong
    Zhang, Qinghua
    Wang, Guoyin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 281 - 301
  • [30] Optimal scale combination selection for multi-scale decision tables based on three-way decision
    Yunlong Cheng
    Qinghua Zhang
    Guoyin Wang
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 281 - 301