Adapting Naive Bayes Model for Text Classification with One-of and Imbalanced Multi-Class Problems

被引:0
|
作者
Almaleh, Ahood [1 ]
Aslam, Muhammad Ahtisham [1 ]
Saeedi, Kawther [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah 21589, Saudi Arabia
关键词
text classification; multi-class problems; text mining; machine learning;
D O I
10.22937/IJCSNS.2020.20.09.11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly interested in research communities, the text classification area enables the text or part of the text to be classified into classes for extracting useful information. Expensive to scale, the manual classification tasks are becoming vulnerable to potential unreliability as documents in the world increase, especially if the classes number more than two (multiclass classification). As a classification technique based on algorithms, automatic classification facilitates the automatic categorization of text documents to classes, thus resulting in reliable and efficient classification. This paper aims to describe the process of using the Naive Bayes classifier for text classification with one-of and multiclass, especially in cases where the probability of imbalanced classes is higher. Our proposed process consists of a number of steps such as data preprocessing, classification model building, evaluating and predicting classes as final classification results.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [31] Chinese News Text Multi Classification Based on Naive Bayes Algorithm
    Wang, Fei
    Deng, Xin
    Hou, Lunqing
    ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, 2018,
  • [32] A review on over-sampling techniques in classification of multi-class imbalanced datasets: insights for medical problems
    Yang, Yuxuan
    Khorshidi, Hadi Akbarzadeh
    Aickelin, Uwe
    FRONTIERS IN DIGITAL HEALTH, 2024, 6
  • [33] Imbalanced Multi-class Classification of Structural Damage in a Wind Turbine Foundation
    Leon-Medina, Jersson X.
    Pares, Nuria
    Anaya, Maribel
    Tibaduiza, Diego
    Pozo, Francesc
    EUROPEAN WORKSHOP ON STRUCTURAL HEALTH MONITORING (EWSHM 2022), VOL 3, 2023, : 492 - 500
  • [34] Boosting methods for multi-class imbalanced data classification: an experimental review
    Jafar Tanha
    Yousef Abdi
    Negin Samadi
    Nazila Razzaghi
    Mohammad Asadpour
    Journal of Big Data, 7
  • [35] Boosting methods for multi-class imbalanced data classification: an experimental review
    Tanha, Jafar
    Abdi, Yousef
    Samadi, Negin
    Razzaghi, Nazila
    Asadpour, Mohammad
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [36] Improved multi-class classification approach for imbalanced big data on spark
    Tinku Singh
    Riya Khanna
    Manish Satakshi
    The Journal of Supercomputing, 2023, 79 : 6583 - 6611
  • [37] Improved multi-class classification approach for imbalanced big data on spark
    Singh, Tinku
    Khanna, Riya
    Satakshi
    Kumar, Manish
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6583 - 6611
  • [38] A Novel and Effective Multi-Class Classification Method for Imbalanced Medical Transcriptions
    Bhardwaj, Priti
    Baliyan, Niyati
    IETE JOURNAL OF RESEARCH, 2024, 8 (6734-6744) : 6734 - 6744
  • [39] A new data complexity measure for multi-class imbalanced classification tasks
    Han, Mingming
    Guo, Husheng
    Wang, Wenjian
    PATTERN RECOGNITION, 2025, 157
  • [40] SAMME.C2 algorithm for imbalanced multi-class classification
    So, Banghee
    Valdez, Emiliano A.
    Soft Computing, 2024, 28 (17-18) : 9387 - 9404