Measuring Difficulty of Learning Using Ensemble Methods

被引：0

作者：

Chen, Bowen ^{[1
]}

Koh, Yun Sing ^{[1
]}

Halstead, Ben ^{[1
]}

机构：

[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand

来源：

DATA MINING, AUSDM 2022 | 2022年 / 1741卷

关键词：

Complexity measures; Boosting; Instance difficulty; CLASSIFICATION PROBLEMS; COMPLEXITY-MEASURES;

D O I：

10.1007/978-981-19-8746-5_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Measuring the difficulty of each instance is a crucial metaknowledge extraction problem. Most studies on data complexity have focused on extracting the characteristics at a dataset level instead of the instance level while also requiring the complete label knowledge of the dataset, which can often be expensive to obtain. At the instance level, the most commonly used metrics to determine difficult to classify instances are dependant on the learning algorithm used (i.e., uncertainty), and are measurements of the entire system instead of only the dataset. Additionally, these metrics only provide information of misclassification in regard to the learning algorithm and not in respect of the composition of the instances within the dataset. We introduce and propose several novel instance difficulty measures in a semi-supervised boosted ensemble setting to identify difficult to classify instances based on their learning difficulty in relation to other instances within the dataset. The proposed difficulty measures measure both the fluctuations in labeling during the construction process of the ensemble and the amount of resources required for the correct label. This provides the degree of difficulty and gives further insight into the origin of classification difficulty at the instance level reflected by the scores of different difficulty measures.

引用

页码：28 / 42

页数：15

共 50 条

[31] Testing a global null hypothesis using ensemble machine learning methods
Han, Sunwoo
Fong, Youyi
Huang, Ying
STATISTICS IN MEDICINE, 2022, 41 (13) : 2417 - 2426
[32] Comparison among Methods of Ensemble Learning
Wan, Shaohua
Yang, Hua
2013 INTERNATIONAL SYMPOSIUM ON BIOMETRICS AND SECURITY TECHNOLOGIES (ISBAST), 2013, : 286 - 290
[33] Network representation learning with ensemble methods
Zhang, Boyu
Xiang, Ji
Wang, Xin
NEUROCOMPUTING, 2020, 380 : 141 - 149
[34] Research and Application on Ensemble Learning Methods
Wang, Yuzhong
PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 145 - 155
[35] Ensemble Methods for Cooperative Robotic Learning
Tolmidis, Avraam Th.
Petrou, Loukas
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2017, 32 (05) : 502 - 525
[36] Ensemble Learning Methods for Dirty Data
Liu, Ling
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2 - 2
[37] Ensemble Learning Methods: An Empirical Study
Upasana Sarmah
Parthajit Borah
Dhruba Kumar Bhattacharyya
SN Computer Science, 5 (7)
[38] Hybrid and Ensemble Methods in Machine Learning
Kazienko, Przemyslaw
Lughofer, Edwin
Trawinski, Bogdan
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2013, 19 (04) : 457 - 461
[39] Measuring the Stability of Feature Selection with Applications to Ensemble Methods
Nogueira, Sarah
Brown, Gavin
MULTIPLE CLASSIFIER SYSTEMS (MCS 2015), 2015, 9132 : 135 - 146
[40] Measuring the Big Five Factors from Handwriting Using Ensemble Learning Model AvgMlSC
Garoot, Afnan
Suen, Ching Y.
INTERTWINING GRAPHONOMICS WITH HUMAN MOVEMENTS, IGS 2021, 2022, 13424 : 159 - 173

← 1 2 3 4 5 →