Measuring Difficulty of Learning Using Ensemble Methods

被引:0
|
作者
Chen, Bowen [1 ]
Koh, Yun Sing [1 ]
Halstead, Ben [1 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
来源
DATA MINING, AUSDM 2022 | 2022年 / 1741卷
关键词
Complexity measures; Boosting; Instance difficulty; CLASSIFICATION PROBLEMS; COMPLEXITY-MEASURES;
D O I
10.1007/978-981-19-8746-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Measuring the difficulty of each instance is a crucial metaknowledge extraction problem. Most studies on data complexity have focused on extracting the characteristics at a dataset level instead of the instance level while also requiring the complete label knowledge of the dataset, which can often be expensive to obtain. At the instance level, the most commonly used metrics to determine difficult to classify instances are dependant on the learning algorithm used (i.e., uncertainty), and are measurements of the entire system instead of only the dataset. Additionally, these metrics only provide information of misclassification in regard to the learning algorithm and not in respect of the composition of the instances within the dataset. We introduce and propose several novel instance difficulty measures in a semi-supervised boosted ensemble setting to identify difficult to classify instances based on their learning difficulty in relation to other instances within the dataset. The proposed difficulty measures measure both the fluctuations in labeling during the construction process of the ensemble and the amount of resources required for the correct label. This provides the degree of difficulty and gives further insight into the origin of classification difficulty at the instance level reflected by the scores of different difficulty measures.
引用
收藏
页码:28 / 42
页数:15
相关论文
共 50 条
  • [31] Testing a global null hypothesis using ensemble machine learning methods
    Han, Sunwoo
    Fong, Youyi
    Huang, Ying
    STATISTICS IN MEDICINE, 2022, 41 (13) : 2417 - 2426
  • [32] Comparison among Methods of Ensemble Learning
    Wan, Shaohua
    Yang, Hua
    2013 INTERNATIONAL SYMPOSIUM ON BIOMETRICS AND SECURITY TECHNOLOGIES (ISBAST), 2013, : 286 - 290
  • [33] Network representation learning with ensemble methods
    Zhang, Boyu
    Xiang, Ji
    Wang, Xin
    NEUROCOMPUTING, 2020, 380 : 141 - 149
  • [34] Research and Application on Ensemble Learning Methods
    Wang, Yuzhong
    PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 145 - 155
  • [35] Ensemble Methods for Cooperative Robotic Learning
    Tolmidis, Avraam Th.
    Petrou, Loukas
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2017, 32 (05) : 502 - 525
  • [36] Ensemble Learning Methods for Dirty Data
    Liu, Ling
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2 - 2
  • [37] Ensemble Learning Methods: An Empirical Study
    Upasana Sarmah
    Parthajit Borah
    Dhruba Kumar Bhattacharyya
    SN Computer Science, 5 (7)
  • [38] Hybrid and Ensemble Methods in Machine Learning
    Kazienko, Przemyslaw
    Lughofer, Edwin
    Trawinski, Bogdan
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2013, 19 (04) : 457 - 461
  • [39] Measuring the Stability of Feature Selection with Applications to Ensemble Methods
    Nogueira, Sarah
    Brown, Gavin
    MULTIPLE CLASSIFIER SYSTEMS (MCS 2015), 2015, 9132 : 135 - 146
  • [40] Measuring the Big Five Factors from Handwriting Using Ensemble Learning Model AvgMlSC
    Garoot, Afnan
    Suen, Ching Y.
    INTERTWINING GRAPHONOMICS WITH HUMAN MOVEMENTS, IGS 2021, 2022, 13424 : 159 - 173