Cost-sensitive classifier chains: Selecting low-cost features in multi-label classification

被引:19
|
作者
Teisseyre, Pawel [1 ]
Zufferey, Damien [2 ]
Slomka, Marta [3 ]
机构
[1] Polish Acad Sci, Inst Comp Sci, Jana Kazimierza 5, PL-01248 Warsaw, Poland
[2] BAO Syst, Washington, DC USA
[3] Polish Acad Sci, Mossakowski Med Res Ctr, Warsaw, Poland
关键词
Multi-label classification; Cost-sensitive feature selection; Classifier chains; Logistic regression; Stability; Generalization error bounds;
D O I
10.1016/j.patcog.2018.09.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the trending challenges in multi-label classification. In recent years a lot of methods have been proposed. However the existing approaches assume that all the features have the same cost. This assumption may be inappropriate when the acquisition of the feature values is costly. For example in medical diagnosis each diagnostic value extracted by a clinical test is associated with its own cost. In such cases it may be better to choose a model with an acceptable classification performance but a much lower cost. We propose a novel method which incorporates the feature cost information into the learning process. The method, named Cost-Sensitive Classifier Chains, combines classifier chains and penalized logistic regression with a modified elastic-net penalty which takes into account costs of the features. We prove the stability and provide a bound on generalization error of our algorithm. We also propose the adaptive version in which penalty factors are changing during fitting the consecutive models in the chain. The methods are applied on real datasets: MIMIC-II and Hepatitis for which the cost information is provided by experts. Moreover, we propose an experimental framework in which the features are observed with measurement errors and the costs depend on the quality of the features. The framework allows to compare the cost-sensitive methods on benchmark datasets for which the cost information is not provided. The proposed method can be recommended in a situation when one wants to balance low costs and high prediction performance. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:290 / 319
页数:30
相关论文
共 50 条
  • [31] Speeding Up Classifier Chains in Multi-label Classification
    Moyano, Jose M.
    Gibaja, Eva L.
    Ventura, Sebastian
    Cano, Alberto
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS 2019), 2019, : 29 - 37
  • [32] A Novel classifier - Weighted Features Cost-sensitive SVM
    Ding, Cheng
    Wu, Min
    2016 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2016, : 598 - 603
  • [33] Cost-sensitive feature selection on multi-label data via neighborhood granularity and label enhancement
    Xuandong Long
    Wenbin Qian
    Yinglong Wang
    Wenhao Shu
    Applied Intelligence, 2021, 51 : 2210 - 2232
  • [34] Cyclic Classifier Chain for Cost-Sensitive Multilabel Classification
    Lin, Yi-An
    Lin, Hsuan-Tien
    2017 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2017, : 11 - 20
  • [35] Cost-sensitive feature selection on multi-label data via neighborhood granularity and label enhancement
    Long, Xuandong
    Qian, Wenbin
    Wang, Yinglong
    Shu, Wenhao
    APPLIED INTELLIGENCE, 2021, 51 (04) : 2210 - 2232
  • [36] Multi-label classification by polytree-augmented classifier chains with label-dependent features
    Sun, Lu
    Kudo, Mineichi
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (03) : 1029 - 1049
  • [37] Multi-label classification by polytree-augmented classifier chains with label-dependent features
    Lu Sun
    Mineichi Kudo
    Pattern Analysis and Applications, 2019, 22 : 1029 - 1049
  • [38] An Adaptive Cost-sensitive Classifier
    Chen, Xiaolin
    Song, Enming
    Ma, Guangzhi
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 699 - 701
  • [39] Selecting label-dependent features for multi-label classification
    Qiao, Lishan
    Zhang, Limei
    Sun, Zhonggui
    Liu, Xueyan
    NEUROCOMPUTING, 2017, 259 : 112 - 118
  • [40] A Deep Model with Local Surrogate Loss for General Cost-Sensitive Multi-label Learning
    Hsieh, Cheng-Yu
    Lin, Yi-An
    Lin, Hsuan-Tien
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3239 - 3246