Two feature weighting approaches for naive Bayes text classifiers

被引:79
|
作者
Zhang, Lungan [1 ]
Jiang, Liangxiao [1 ,2 ]
Li, Chaoqun [3 ]
Kong, Ganggang [1 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Naive Bayes text classifiers; Feature weighting; Gain ratio; Decision tree;
D O I
10.1016/j.knosys.2016.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper works on feature weighting approaches for naive Bayes text classifiers. Almost all existing feature weighting approaches for naive Bayes text classifiers have some defects: limited improvement to classification performance of naive Bayes text classifiers or sacrificing the simplicity and execution time of the final models. In fact, feature weighting is not new for machine learning community, and many researchers have made fruitful efforts in the field of feature weighting. This paper reviews some simple and efficient feature weighting approaches designed for standard naive Bayes classifiers, and adapts them for naive Bayes text classifiers. As a result, this paper proposes two adaptive feature weighting approaches for naive Bayes text classifiers. Experimental results based on benchmark and real-world data show that, compared to their competitors, our feature weighting approaches show higher classification accuracy, yet at the same time maintain the simplicity and lower execution time of the final models. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 50 条
  • [41] Discrimination-based feature selection for multinomial naive Bayes text classification
    Zhu, Jingbo
    Wang, Huizhen
    Zhang, Xijuan
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 149 - +
  • [42] A Regularized Attribute Weighting Framework for Naive Bayes
    Wang, Shihe
    Ren, Jianfeng
    Bai, Ruibin
    IEEE ACCESS, 2020, 8 : 225639 - 225649
  • [43] Investigating the Statistical Assumptions of Naive Bayes Classifiers
    Kelly, Anthony
    Johnson, Marc Anthony
    2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [44] Toward naive Bayes with attribute value weighting
    Liangjun Yu
    Liangxiao Jiang
    Dianhong Wang
    Lungan Zhang
    Neural Computing and Applications, 2019, 31 : 5699 - 5713
  • [45] Naive bayes-correlation based feature weighting technique for sports match result prediction
    Manoj Sharma
    Naresh Monika
    Pardeep Kumar
    Evolutionary Intelligence, 2022, 15 : 2171 - 2186
  • [46] Naive bayes-correlation based feature weighting technique for sports match result prediction
    Sharma, Manoj
    Monika
    Kumar, Naresh
    Kumar, Pardeep
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (03) : 2171 - 2186
  • [47] Comparative analysis of the impact of discretization on the classification with Naive Bayes and semi-Naive Bayes classifiers
    Mizianty, Marcin
    Kurgan, Lukasz
    Ogiela, Marek
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 823 - +
  • [48] An Improvement to Naive Bayes for Text Classification
    Zhang, Wei
    Gao, Feng
    CEIS 2011, 2011, 15
  • [49] Class dependent feature scaling method using naive Bayes classifier for text datamining
    Youn, Eunseog
    Jeong, Myong K.
    PATTERN RECOGNITION LETTERS, 2009, 30 (05) : 477 - 485
  • [50] Augmenting naive Bayes classifiers with statistical language models
    Peng, FC
    Schuurmans, D
    Wang, SJ
    INFORMATION RETRIEVAL, 2004, 7 (3-4): : 317 - 345