Using Credal-C4.5 with Binary Relevance for Multi-Label Classification

被引:7
作者
Moral-Garcia, Serafin [1 ]
Mantas, Carlos J. [1 ]
Castellano, Javier G. [1 ]
Abellan, Joaquin [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
关键词
Multi-label classification; Binary Relevance; Credal C4.5; C4.5; imprecise probabilities; IMPRECISE PROBABILITIES; DECISION TREES; CLASSIFIERS; PREDICTION; ENSEMBLES;
D O I
10.3233/JIFS-18746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary Relevance (BR) is a simple and direct approach to the Multi-Label Classification (MLC). It decomposes the multi-label problem into several binary problems, one per label. It uses an algorithm of traditional supervised classification in order to solve these binary problems. On the other hand, Credal C4.5 (CC4.5) is a modification of the classical C4.5. CC4.5 estimates the probability of the class variable by using imprecise probabilities. In the literature, this new classification algorithm has obtained better results than C4.5 when both have been applied on datasets with class noise. In MLC, since there are not just a class, but multiple labels are disposed, it is more probable that there is intrinsic noise than in traditional classification. From the previous reasons, in this work it is studied the performance of BR using Credal C4.5 as base classifier versus BR with C4.5. It is carried out an experimental study with several muti-label datasets and a considerable number of measures for MLC. This study shows that the performance of BR is improved when it uses CC4.5 as base classifier versus BR with C4.5. In consequence, it is probably suitable to apply imprecise probabilities in Decision Trees within the MLC field too.
引用
收藏
页码:6501 / 6512
页数:12
相关论文
共 28 条
[2]   Building classification trees using the total uncertainty criterion [J].
Abellán, J ;
Moral, S .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2003, 18 (12) :1215-1225
[3]   Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring [J].
Abellan, Joaquin ;
Mantas, Carlos J. .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (08) :3825-3830
[5]  
Abellán J, 2009, LECT NOTES COMPUT SC, V5590, P446, DOI 10.1007/978-3-642-02906-6_39
[6]  
Alves RobertoTeixeira., 2010, Fuzzy Systems (FUZZ), 2010 IEEE International Conference on, P1, DOI DOI 10.1109/FUZZY.2010.5584298
[7]  
[Anonymous], 2005, Uncertainty and information: Foundations of generalized information theory
[8]   Hierarchical multi-label prediction of gene function [J].
Barutcuoglu, Z ;
Schapire, RE ;
Troyanskaya, OG .
BIOINFORMATICS, 2006, 22 (07) :830-836
[9]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[10]   Tips, guidelines and tools for managing multi-label datasets: The mldr.datasets R package and the Cometa data repository [J].
Charte, Francisco ;
Rivera, Antonio J. ;
Charte, David ;
del Jesus, Mara J. ;
Herrera, Francisco .
NEUROCOMPUTING, 2018, 289 :68-85