Three-way selection random forest algorithm based on decision boundary entropy

被引:12
|
作者
Zhang, Chunying [1 ,2 ]
Ren, Jing [1 ]
Liu, Fengchun [3 ]
Li, Xiaoqi [1 ]
Liu, Shouyue [1 ]
机构
[1] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Hebei, Peoples R China
[2] Key Lab Data Sci & Applicat Hebei Prov, Tangshan 063210, Hebei, Peoples R China
[3] North China Univ Sci & Technol, Coll Qianan, Tangshan 063210, Hebei, Peoples R China
关键词
Random Forest; Attribute Selection; Decision Boundary Entropy; Significance of Attribute; Three-way Decision; ATTRIBUTES;
D O I
10.1007/s10489-021-03033-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problem of high probability of negative impact about redundant attributes in random forest algorithms, a Three-way Selection Random Forest algorithm based on decision boundary entropy (TSRF) is proposed without losing randomness and reducing the influence of redundant attributes on decision-making results. According to the characteristics of the attribute, the concept of decision boundary entropy is defined. Then a measuring method of attribute importance based on decision boundary entropy is proposed and set as an evaluation standard. Three-way decision is constructed and the attribute is divided into three candidate domains, namely positive domain, negative domain and boundary domain. In order to ensure the randomness of attributes, three-way attribute random selection rules based on attribute randomness are established and a certain number of attributes are randomly selected from the three candidate domains. Combine the samples selected by the bootstrap sampling method with attribute sets selected by three-way decision to produce training sample sets so that we can train the decision trees and generate forest. Six datasets are selected for the experiment. Two parameters of attribute randomness and three-way decision thresholds are analyzed to verify the theoretical conclusions respectively. The results show that the TSRF algorithm can meet the different requirements of different data sets by adjusting the parameters. The classification effect on the binary data is basically the same as the comparison algorithm, but TSRF has a significant improvement effect on the multi-class data compared with other algorithms. The proposed TSRF algorithm widens the idea for the measurement method of significance of attribute, innovates the random forest three-way selection integration method, and provides a better model framework for solving multi-classification problems.
引用
收藏
页码:13384 / 13397
页数:14
相关论文
共 50 条
  • [41] A TOPSIS method based on sequential three-way decision
    Jin Qian
    Taotao Wang
    Haoying Jiang
    Ying Yu
    Duoqian Miao
    Applied Intelligence, 2023, 53 : 30661 - 30676
  • [42] A Novel Three-way Decision Based on Linguistic Evaluation
    Liu, Shuli
    Liu, Xinwang
    2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [43] Three-way decision spaces based on partially ordered sets and three-way decisions based on hesitant fuzzy sets
    Hu, Bao Qing
    KNOWLEDGE-BASED SYSTEMS, 2016, 91 : 16 - 31
  • [44] A Three-Way Decision Model Based on Intuitionistic Fuzzy Decision Systems
    Liu, Jiubing
    Zhou, Xianzhong
    Huang, Bing
    Li, Huaxiong
    ROUGH SETS, IJCRS 2017, PT II, 2017, 10314 : 249 - 263
  • [45] Three-way decisions based on bipolar-valued fuzzy sets over three-way decision spaces
    Hu, Bao Qing
    INFORMATION SCIENCES, 2024, 656
  • [46] Optimal scale selection and attribute reduction in multi-scale decision tables based on three-way decision
    Cheng, Yunlong
    Zhang, Qinghua
    Wang, Guoyin
    Hu, Bao Qing
    INFORMATION SCIENCES, 2020, 541 : 36 - 59
  • [47] A Novel TODIM Method-Based Three-Way Decision Model for Medical Treatment Selection
    Hu, Junhua
    Yang, Yao
    Chen, Xiaohong
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2018, 20 (04) : 1240 - 1255
  • [48] A Novel TODIM Method-Based Three-Way Decision Model for Medical Treatment Selection
    Junhua Hu
    Yao Yang
    Xiaohong Chen
    International Journal of Fuzzy Systems, 2018, 20 : 1240 - 1255
  • [49] Three-way decision on two universes
    Li, Xiaonan
    Sun, Qianqian
    Chen, Hongmei
    Yi, Huangjian
    INFORMATION SCIENCES, 2020, 515 : 263 - 279
  • [50] Three-way decision and granular computing
    Yao, Yiyu
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 103 : 107 - 123