Estimating a one -class naive Bayes text classifier

被引:6
|
作者
Zhang, Yihong [1 ]
Jatowt, Adam [2 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, Dept Multimedia Engn, Osaka 5650871, Japan
[2] Kyoto Univ, Grad Sch Informat, Dept Social Informat, Kyoto 6068501, Japan
关键词
Machine learning; naive Bayes; one class classifier;
D O I
10.3233/IDA-194669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays more and more information extraction projects need to classify large amounts of text data. The common way to classify text is to build a supervised classifier trained on human-labeled positive and negative examples. In many cases, however, it is easy to label positive examples, but hard to label negative examples. In this paper, we address the problem of building a one-class classifier when only the positive examples are labeled. Previous works on building one-class classifier mostly use positive examples and unlabeled data. In this paper, we show that a configurable one-class classifier such as one-class naive Bayes can be optimized by examining the clustering quality of the classification on target data. We propose to use existing and new quality scores for determining clustering quality of the classification. Experimental analysis with real-world data show that our approach generally achieves high classification accuracy, and in some cases improves the accuracy by more than 10% compared to state-of-art baselines. © 2020 - IOS Press and the authors. All rights reserved.
引用
收藏
页码:567 / 579
页数:13
相关论文
共 50 条
  • [21] Emotion Detection from Bangla Text Corpus Using Naive Bayes Classifier
    Azmin, Sara
    Dhar, Kingshuk
    2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2019,
  • [22] Improving a SVM Meta-classifier for Text Documents by using Naive Bayes
    Morariu, D.
    Cretulescu, R.
    Vintan, L.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2010, 5 (03) : 351 - 361
  • [23] Adapting Naive Bayes Model for Text Classification with One-of and Imbalanced Multi-Class Problems
    Almaleh, Ahood
    Aslam, Muhammad Ahtisham
    Saeedi, Kawther
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (09): : 84 - 90
  • [24] A FUZZY EXPONENTIAL NAIVE BAYES CLASSIFIER
    Moraes, R. M.
    Machado, L. S.
    UNCERTAINTY MODELLING IN KNOWLEDGE ENGINEERING AND DECISION MAKING, 2016, 10 : 207 - 212
  • [25] A Fuzzy Gamma Naive Bayes classifier
    de Moraes, Ronei Marcos
    de Melo Gomes Soares, Elaine Anita
    Machado, Liliane dos Santos
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 691 - 699
  • [26] Learning an optimal naive Bayes classifier
    Martinez-Arroyo, Miriam
    Sucar, L. Enrique
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1236 - +
  • [27] The naive Bayes classifier for functional data
    Zhang, Yi-Chen
    Sakhanenko, Lyudmila
    STATISTICS & PROBABILITY LETTERS, 2019, 152 : 137 - 146
  • [28] Attribute Weighted Naive Bayes Classifier
    Foo, Lee-Kien
    Chua, Sook-Ling
    Ibrahim, Neveen
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1945 - 1957
  • [29] Classification and Optimization Scheme for Text Data using Machine Learning Naive Bayes Classifier
    Venkatesh
    Ranjitha, K., V
    PROCEEDINGS OF 2018 IEEE WORLD SYMPOSIUM ON COMMUNICATION ENGINEERING (WSCE), 2018, : 33 - 36
  • [30] Robust approach for estimating probabilities in Naive-Bayes Classifier for gene expression data
    Chandra, B.
    Gupta, Manish
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1293 - 1298