Improving Multiclass Text Classification with Error-Correcting Output Coding and Sub-class Partitions

被引:0
|
作者
Li, Baoli [1 ]
Vogel, Carl [1 ]
机构
[1] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
关键词
Text Classification; Error Correcting Output Coding; Binary Classification; ECOC;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Error-Correcting Output Coding (ECOC) is a general framework for multiclass text classification with a set of binary classifiers. It can not only help a binary classifier solve multi-class classification problems, but also boost the performance of a multi-class classifier. When building each individual binary classifier in ECOC, multiple classes are randomly grouped into two disjoint groups: positive and negative. However, when training such a binary classifier, sub-class distribution within positive and negative classes is neglected. Utilizing this information is expected to improve a binary classifier. We thus design a simple binary classification strategy via multi-class categorization (2vM) to make use of sub-class partition information, which can lead to better performance over the traditional binary classification. The proposed binary classification strategy is then applied to enhance ECOC. Experiments on document categorization and question classification show its effectiveness.
引用
收藏
页码:4 / 15
页数:12
相关论文
共 50 条
  • [1] Sub-class Error-Correcting Output Codes
    Escalera, Sergio
    Pujol, Oriol
    Radeva, Petia
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 494 - 504
  • [2] IVUS Tissue Characterization with Sub-class Error-Correcting Output Codes
    Escalera, Sergio
    Pujol, Oriol
    Mauri, Josepa
    Radeva, Petia
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 534 - +
  • [3] Intravascular Ultrasound Tissue Characterization with Sub-class Error-Correcting Output Codes
    Sergio Escalera
    Oriol Pujol
    Josepa Mauri
    Petia Radeva
    Journal of Signal Processing Systems, 2009, 55 : 35 - 47
  • [4] Intravascular Ultrasound Tissue Characterization with Sub-class Error-Correcting Output Codes
    Escalera, Sergio
    Pujol, Oriol
    Mauri, Josepa
    Radeva, Petia
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2009, 55 (1-3): : 35 - 47
  • [5] Efficient Decoding of Ternary Error-Correcting Output Codes for Multiclass Classification
    Park, Sang-Hyeun
    Fuernkranz, Johannes
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 189 - 204
  • [6] Optimisation of multiclass supervised classification based on using output codes with error-correcting
    Ryazanov V.V.
    Pattern Recognition and Image Analysis, 2016, 26 (2) : 262 - 265
  • [7] System Evaluation of Ternary Error-Correcting Output Codes for Multiclass Classification Problems
    Hirasawa, Shigeichi
    Kumoi, Gendo
    Yagi, Hideki
    Kobayashi, Manabu
    Goto, Masayuki
    Sakai, Tetsuya
    Inazumi, Hiroshige
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 2893 - 2898
  • [8] Multiclass classification of adaptive error-correcting output codes based on confusion matrix
    Zhou, Jin-Deng
    Wang, Xiao-Dan
    Zhou, Hong-Jian
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2012, 34 (07): : 1518 - 1524
  • [9] Improving multiclass classification using neighborhood search in error correcting output codes
    Eghbali, Niloufar
    Montazer, Gholam Ali
    PATTERN RECOGNITION LETTERS, 2017, 100 : 74 - 82
  • [10] Minimal classification method with error-correcting codes for multiclass recognition
    Sivalingam, DM
    Pandian, N
    Ben-Arie, J
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (05) : 663 - 680