Application of an Improved CHI Feature Selection Algorithm

被引:9
|
作者
Cai, Liang-jing [1 ]
Lv, Shu [1 ]
Shi, Kai-bo [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Math Sci, Chengdu 611731, Sichuan, Peoples R China
[2] Chengdu Univ, Sch Elect Informat & Elect Engn, Chengdu 610106, Sichuan, Peoples R China
关键词
D O I
10.1155/2021/9963382
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Text classification is the critical content of machine learning, and it is widely applied in information filtering, sentimental analysis, and text review. It is very important to improve the accuracy of classification results, and this is also the main research purpose of researchers in this field in recent years. Feature selection plays an important role in text classification, which has the functions of eliminating irrelevant features, reducing dimensionality, and improving classification accuracy. So, this paper studies the CHI feature selection algorithm, and the main work and innovations are as follows: firstly, this paper analyzed the CHI algorithm's flaws, determined that the introduction of new parameters will be the improvement direction of the CHI algorithm, and thus proposed a new algorithm based on variance and coefficient of variation. Secondly, experiment to verify the effectiveness of the new algorithm. In terms of language, the experiment in this paper includes two text classification systems, which were Chinese and English. In terms of classifiers, two classifier algorithms were used, which included the KNN classifier and the Naive Bayes classifier. In terms of data types, two distribution types of data were used: balanced datasets and unbalanced datasets. Finally, experiment and result analysis. This paper has conducted 3 comparative experiments and analyzed the results of each experiment. The experimental results obtained are all significantly improved compared to the results before the improvement.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Research On An Improved CHI Feature Selection Method
    Qiu Yun-fei
    Wang Wei
    Liu Dayou
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 2841 - +
  • [2] Improved dragonfly algorithm and its application in feature selection
    Wang W.
    Zhu K.
    Li W.
    Zhao Y.
    Jie J.
    1600, CIMS (26): : 2124 - 2132
  • [3] Application of an Improved Multimodal Multiobjective Algorithm in Feature Selection
    Liang, Jing
    Zhang, Yingjie
    Yue, Caitong
    Yu, Kunjie
    Guo, Weifeng
    Chen, Ke
    Lin, Hongyu
    Qu, Boyang
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 367 - 372
  • [4] Novel Improved Salp Swarm Algorithm: An Application for Feature Selection
    Zivkovic, Miodrag
    Stoean, Catalin
    Chhabra, Amit
    Budimirovic, Nebojsa
    Petrovic, Aleksandar
    Bacanin, Nebojsa
    SENSORS, 2022, 22 (05)
  • [5] Improved Harris Hawks Algorithm and Its Application in Feature Selection
    Zhang, Qianqian
    Li, Yingmei
    Zhan, Jianjun
    Chen, Shan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (01): : 1251 - 1273
  • [6] An Improved Feature Extraction Algorithm Based on CHI and MI
    Feng, Guichuan
    Cai, Shubin
    PROCEEDINGS OF THE 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER, MECHATRONICS, CONTROL AND ELECTRONIC ENGINEERING (ICCMCEE 2015), 2015, 37 : 1113 - 1116
  • [7] Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application
    Alshaer, Hadeel N.
    Otair, Mohammed A.
    Abualigah, Laith
    Alshinwan, Mohammad
    Khasawneh, Ahmad M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 10373 - 10390
  • [8] Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application
    Hadeel N. Alshaer
    Mohammed A. Otair
    Laith Abualigah
    Mohammad Alshinwan
    Ahmad M. Khasawneh
    Multimedia Tools and Applications, 2021, 80 : 10373 - 10390
  • [9] Spam Filtering Based on Improved CHI Feature Selection Method
    Lu, Zhimao
    Yu, Hongxia
    Fan, Dongmei
    Yuan, Chaoyue
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 771 - 773
  • [10] An improved Dragonfly Algorithm for feature selection
    Hammouri, Abdelaziz, I
    Mafarja, Majdi
    Al-Betar, Mohammed Azmi
    Awadallah, Mohammed A.
    Abu-Doush, Iyad
    KNOWLEDGE-BASED SYSTEMS, 2020, 203