Text Classification Based on Naive Bayes Algorithm with Feature Selection

被引:0
|
作者
Chen, Zhenguo [1 ]
Shi, Guang [1 ]
Wang, Xiaoju [1 ]
机构
[1] N China Inst Sci & Technol, Dept Comp Sci & Technol, Beijing 101601, Peoples R China
基金
中国国家自然科学基金;
关键词
Text classification; Naive bayes; Feature selection;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Text Classification is the task to classify documents into predefined classes. It has become one of the key techniques for organizing information. Machine learning, a branch of artificial intelligence, has been used in text classification with better performance than rule based ones. But they mostly need lots of training samples in the processing, which not only brings heavy work for previous data collection, but also require a higher storage and computing resources during the processing. Naive Bayes is one of the most efficient and effective inductive learning algorithms and can get more accurate result in the large training sample set. To improve the performance, feature selection mechanisms are incorporated into naive bayes algorithm. Firstly, feature extraction techniques are applied to remove irrelevant and redundant features. After that, naive bayes classification algorithm is used to text classification. The experimental results have shown that this method keeps high classification accuracy.
引用
收藏
页码:4255 / 4260
页数:6
相关论文
共 50 条
  • [41] Principal Feature Selection Impact for Internet Traffic Classification Using Naive Bayes
    Paramita, Adi Suryaputra
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL SYSTEMS, TECHNOLOGY AND INFORMATION 2015 (ICESTI 2015), 2016, 365 : 475 - 480
  • [42] Integrating associative rule-based classification with Naive Bayes for text classification
    Hadi, Wa'el
    Al-Radaideh, Qasem A.
    Alhawari, Samer
    APPLIED SOFT COMPUTING, 2018, 69 : 344 - 356
  • [43] Adapting Hidden Naive Bayes for Text Classification
    Gan, Shengfeng
    Shao, Shiqi
    Chen, Long
    Yu, Liangjun
    Jiang, Liangxiao
    MATHEMATICS, 2021, 9 (19)
  • [44] Adapting naive Bayes tree for text classification
    Shasha Wang
    Liangxiao Jiang
    Chaoqun Li
    Knowledge and Information Systems, 2015, 44 : 77 - 89
  • [45] Iterative Feature Selection using Information Gain & Naive Bayes for Document Classification
    Rahman, Chowdhury Mofizur
    Afroze, Lameya
    Refath, Naznin Sultana
    Shawon, Nafin
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [46] A Naive Bayes approach for URL classification with supervised feature selection and rejection framework
    Rajalakshmi, R.
    Aravindan, Chandrabose
    COMPUTATIONAL INTELLIGENCE, 2018, 34 (01) : 363 - 396
  • [47] Bayesian Naive Bayes classifiers to text classification
    Xu, Shuo
    JOURNAL OF INFORMATION SCIENCE, 2018, 44 (01) : 48 - 59
  • [48] Naive Bayes for text classification with unbalanced classes
    Frank, Eibe
    Bouckaert, Remco R.
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 503 - 510
  • [49] Adapting naive Bayes tree for text classification
    Wang, Shasha
    Jiang, Liangxiao
    Li, Chaoqun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (01) : 77 - 89
  • [50] Acceleration of Naive-Bayes Algorithm on Multicore Processor for Massive Text Classification
    Zhou, Lijun
    Yu, Zhiyi
    Lin, Jie
    Zhu, Shikai
    Shi, Weijing
    Zhou, Haijie
    Song, Kunpeng
    Zeng, Xiaoyang
    2014 14TH INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS (ISIC), 2014, : 344 - 347