Adapting Naive Bayes Model for Text Classification with One-of and Imbalanced Multi-Class Problems

被引:0
|
作者
Almaleh, Ahood [1 ]
Aslam, Muhammad Ahtisham [1 ]
Saeedi, Kawther [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah 21589, Saudi Arabia
关键词
text classification; multi-class problems; text mining; machine learning;
D O I
10.22937/IJCSNS.2020.20.09.11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly interested in research communities, the text classification area enables the text or part of the text to be classified into classes for extracting useful information. Expensive to scale, the manual classification tasks are becoming vulnerable to potential unreliability as documents in the world increase, especially if the classes number more than two (multiclass classification). As a classification technique based on algorithms, automatic classification facilitates the automatic categorization of text documents to classes, thus resulting in reliable and efficient classification. This paper aims to describe the process of using the Naive Bayes classifier for text classification with one-of and multiclass, especially in cases where the probability of imbalanced classes is higher. Our proposed process consists of a number of steps such as data preprocessing, classification model building, evaluating and predicting classes as final classification results.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [21] Multi-class imbalanced image classification using conditioned GANs
    Kumar, M. R. Pavan
    Jayagopal, Prabhu
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2021, 10 (03) : 143 - 153
  • [22] Selecting local ensembles for multi-class imbalanced data classification
    Krawczyk, Bartosz
    Cano, Alberto
    Wozniak, Michal
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [23] Undersampling with Support Vectors for Multi-Class Imbalanced Data Classification
    Krawczyk, Bartosz
    Bellinger, Colin
    Corizzo, Roberto
    Japkowicz, Nathalie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [24] Multi-class imbalanced image classification using conditioned GANs
    M R Pavan Kumar
    Prabhu Jayagopal
    International Journal of Multimedia Information Retrieval, 2021, 10 : 143 - 153
  • [25] Topic document model approach for naive Bayes text classification
    Kim, SB
    Rim, HC
    Kim, JD
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1091 - 1094
  • [26] Binary classification trees for multi-class classification problems
    Lee, JS
    Oh, LS
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 770 - 774
  • [27] Performance Analysis and Classification of Class Imbalanced Dataset Using Complement Naive Bayes Approach
    Marapelli, Bhaskar
    Kadiyala, Sreedevi
    Potluri, Chandra Srinivas
    2023 ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES FOR HIGH PERFORMANCE APPLICATIONS, ACCTHPA, 2023,
  • [28] Adapting a Fuzzy Random Forest for Ordinal Multi-Class Classification
    Pascual-Fontanilles, Jordi
    Lhotska, Lenka
    Moreno, Antonio
    Valls, Aida
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 181 - 190
  • [29] Efficient classifiers for multi-class classification problems
    Lin, Hung-Yi
    DECISION SUPPORT SYSTEMS, 2012, 53 (03) : 473 - 481
  • [30] A sequential model for multi-class classification
    Even-Zohar, Y
    Roth, D
    PROCEEDINGS OF THE 2001 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2001, : 10 - 19