Estimating a one -class naive Bayes text classifier

被引:6
|
作者
Zhang, Yihong [1 ]
Jatowt, Adam [2 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Dept Multimedia Engn, Osaka 5650871, Japan
[2] Kyoto Univ, Grad Sch Informat, Dept Social Informat, Kyoto 6068501, Japan
关键词
Machine learning; naive Bayes; one class classifier;
D O I
10.3233/IDA-194669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays more and more information extraction projects need to classify large amounts of text data. The common way to classify text is to build a supervised classifier trained on human-labeled positive and negative examples. In many cases, however, it is easy to label positive examples, but hard to label negative examples. In this paper, we address the problem of building a one-class classifier when only the positive examples are labeled. Previous works on building one-class classifier mostly use positive examples and unlabeled data. In this paper, we show that a configurable one-class classifier such as one-class naive Bayes can be optimized by examining the clustering quality of the classification on target data. We propose to use existing and new quality scores for determining clustering quality of the classification. Experimental analysis with real-world data show that our approach generally achieves high classification accuracy, and in some cases improves the accuracy by more than 10% compared to state-of-art baselines. © 2020 - IOS Press and the authors. All rights reserved.
引用
收藏
页码:567 / 579
页数:13
相关论文
共 50 条
  • [31] Improving Usual Naive Bayes Classifier Performances with Neural Naive Bayes based Models
    Azeraf, Elie
    Monfrini, Emmanuel
    Pieczynski, Wojciech
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 315 - 322
  • [32] A comparative study of PCA, ICA and class-conditional ICA for Naive Bayes classifier
    Fan, Liwei
    Poh, Kim Leng
    COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 16 - +
  • [33] Weighted Naive Bayes Classifier on Categorical Features
    Omura, Kazuhiro
    Kudo, Mineichi
    Endo, Tomomi
    Murai, Tetsuya
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 865 - 870
  • [34] Applying Naive Bayes Classifier to Document Clustering
    Ji, Jie
    Zhao, Qiangfu
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2010, 14 (06) : 624 - 630
  • [35] An aggregated fuzzy naive bayes data classifier
    Tütüncü, G. Yazgi
    Kayaalp, Necla
    Journal of Computational and Applied Mathematics, 2015, 286 : 17 - 27
  • [36] Texture Classification using Naive Bayes Classifier
    Mansour, Ayman M.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (01): : 112 - 120
  • [37] Exact Learning Augmented Naive Bayes Classifier
    Sugahara, Shouta
    Ueno, Maomi
    ENTROPY, 2021, 23 (12)
  • [38] Incremental discretization for Naive-Bayes classifier
    Lu, Jingli
    Yang, Ying
    Webb, Geoffrey I.
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 223 - 238
  • [39] Threshold-based Naive Bayes classifier
    Romano, Maurizio
    Contu, Giulia
    Mola, Francesco
    Conversano, Claudio
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (02) : 325 - 361
  • [40] Regularization and averaging of the selective Naive Bayes classifier
    Boulle, Marc
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 1680 - 1688