Estimating a one -class naive Bayes text classifier

被引:6
|
作者
Zhang, Yihong [1 ]
Jatowt, Adam [2 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Dept Multimedia Engn, Osaka 5650871, Japan
[2] Kyoto Univ, Grad Sch Informat, Dept Social Informat, Kyoto 6068501, Japan
关键词
Machine learning; naive Bayes; one class classifier;
D O I
10.3233/IDA-194669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays more and more information extraction projects need to classify large amounts of text data. The common way to classify text is to build a supervised classifier trained on human-labeled positive and negative examples. In many cases, however, it is easy to label positive examples, but hard to label negative examples. In this paper, we address the problem of building a one-class classifier when only the positive examples are labeled. Previous works on building one-class classifier mostly use positive examples and unlabeled data. In this paper, we show that a configurable one-class classifier such as one-class naive Bayes can be optimized by examining the clustering quality of the classification on target data. We propose to use existing and new quality scores for determining clustering quality of the classification. Experimental analysis with real-world data show that our approach generally achieves high classification accuracy, and in some cases improves the accuracy by more than 10% compared to state-of-art baselines. © 2020 - IOS Press and the authors. All rights reserved.
引用
收藏
页码:567 / 579
页数:13
相关论文
共 50 条
  • [1] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [2] Class dependent feature scaling method using naive Bayes classifier for text datamining
    Youn, Eunseog
    Jeong, Myong K.
    PATTERN RECOGNITION LETTERS, 2009, 30 (05) : 477 - 485
  • [3] OCPAD: One class Naive Bayes classifier for payload based anomaly detection
    Swarnkar, Mayank
    Hubballi, Neminath
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 330 - 339
  • [4] Modifying Naive Bayes classifier for multinomial text classification
    1600, Institute of Electrical and Electronics Engineers Inc., United States
  • [5] Modifying Naive Bayes Classifier for Multinomial Text Classification
    Sharma, Neha
    Singh, Manoj
    2016 INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2016,
  • [6] Removing smoothing from naive Bayes text classifier
    Zhu, WB
    Lin, YP
    Lin, M
    Chen, ZP
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 713 - 718
  • [7] One generalization of the naive Bayes to fuzzy sets and the design of the fuzzy naive Bayes classifier
    Zheng, JC
    Tang, YC
    ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING APPLICATIONS: A BIOINSPIRED APPROACH, PT 2, PROCEEDINGS, 2005, 3562 : 281 - 290
  • [8] A naive Bayes classifier for identifying Class II YSOs
    Wilson, Andrew J.
    Lakeland, Ben S.
    Wilson, Tom J.
    Naylor, Tim
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 521 (01) : 354 - 388
  • [9] Complement-Class Harmonized Naive Bayes Classifier
    Alenazi, Fahad S.
    El Hindi, Khalil
    AsSadhan, Basil
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [10] Improving Naive Bayes text classifier with modified EM algorithm
    Kim, HJ
    Chang, JY
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2003, 2871 : 326 - 333