Two feature weighting approaches for naive Bayes text classifiers

被引:79
|
作者
Zhang, Lungan [1 ]
Jiang, Liangxiao [1 ,2 ]
Li, Chaoqun [3 ]
Kong, Ganggang [1 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Naive Bayes text classifiers; Feature weighting; Gain ratio; Decision tree;
D O I
10.1016/j.knosys.2016.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper works on feature weighting approaches for naive Bayes text classifiers. Almost all existing feature weighting approaches for naive Bayes text classifiers have some defects: limited improvement to classification performance of naive Bayes text classifiers or sacrificing the simplicity and execution time of the final models. In fact, feature weighting is not new for machine learning community, and many researchers have made fruitful efforts in the field of feature weighting. This paper reviews some simple and efficient feature weighting approaches designed for standard naive Bayes classifiers, and adapts them for naive Bayes text classifiers. As a result, this paper proposes two adaptive feature weighting approaches for naive Bayes text classifiers. Experimental results based on benchmark and real-world data show that, compared to their competitors, our feature weighting approaches show higher classification accuracy, yet at the same time maintain the simplicity and lower execution time of the final models. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 50 条
  • [31] Weakening Feature Independence of Naive Bayes Using Feature Weighting and Selection on Imbalanced Customer Review Data
    Cahya, Reiza Adi
    Bachtiar, Fitra A.
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 182 - 187
  • [32] Deep Feature Weighting Based on Genetic Algorithm and Naive Bayes for Twitter Sentiment Analysis
    Cahya, Reiza Adi
    Adimanggala, Dinda
    Supianto, Ahmad Afif
    PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 326 - 331
  • [33] Badminton match outcome prediction model using Naive Bayes and Feature Weighting technique
    Sharma, Manoj
    Monika
    Kumar, Naresh
    Kumar, Pardeep
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (08) : 8441 - 8455
  • [34] Feature weighting for naive Bayes using multi objective artificial bee colony algorithm
    Chaudhuri, Abhilasha
    Sahu, Tirath Prasad
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2021, 24 (01) : 74 - 88
  • [35] A correlation-based feature weighting filter for multi-label Naive Bayes
    Verma G.
    Sahu T.P.
    International Journal of Information Technology, 2024, 16 (1) : 611 - 619
  • [36] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [37] Toward naive Bayes with attribute value weighting
    Yu, Liangjun
    Jiang, Liangxiao
    Wang, Dianhong
    Zhang, Lungan
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (10): : 5699 - 5713
  • [38] Naive Feature Selection: Sparsity in Naive Bayes
    Askari, Armin
    d'Aspremont, Alex
    El Ghaoui, Laurent
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1813 - 1821
  • [39] Learning Naive Bayes Classifiers with Incomplete Data
    Leng, Cuiping
    Wang, Shuangcheng
    Wang, Hui
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 350 - +
  • [40] Error Estimation on Hybrid Naive Bayes Classifiers
    Chen, Yilan
    Wang, Huanbao
    ADVANCES IN COMPUTATIONAL ENVIRONMENT SCIENCE, 2012, 142 : 239 - 246