An Automated Text Classification Method: Using Improved Fuzzy Set Approach for Feature Selection

被引:0
|
作者
Abbasi, Bushra Zaheer [1 ]
Hussain, Shahid [1 ]
Faisal, Muhammad Imran [2 ]
机构
[1] COMSATS Univ Informat Technol, Dept Comp Sci, Islamabad, Pakistan
[2] Fed Urdu Univ, Islamabad, Pakistan
关键词
Classification; accuracy; feature selection; Fuzzy Set Theory; global feature selection; local feature selection;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A well representing feature set that has enough differentiated power plays an important role in the classification. The existing techniques for feature set selection are mostly statistical. They are not flexible to incorporate the human reasoning and the changing requirements and preferences of the real-life systems. They only make a decision between a feature inclusion or exclusion. The fuzziness of human reasoning and thinking are not considered at all that may improve the feature selection and hence the accuracy of the classifier. Also, the selection of overlapping features in case of Local Feature Selection (LFS) methods is an important issue that negatively impacts classification accuracy. For example, in case of Odd Ratio (OR), the selection may contain overlapping features of multiple classes. In this paper, a Fuzzy Set Theory (FST) based feature selection method has been proposed. The approach aims to tackle both above mentioned issues efficiently. The selected final feature set is used to train the well-known classification algorithms and the results are compared with Global Feature Selection (GFS) and LFS methods. The comparison shows that the proposed method has improved the accuracy of the classifiers and also extract comparatively small feature set that ultimately reduces the time complexity of the system.
引用
收藏
页码:666 / 670
页数:5
相关论文
共 50 条
  • [1] Feature selection using improved mutual information for text classification
    Novovicová, J
    Malík, A
    Pudil, P
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1010 - 1017
  • [2] Fuzzy Set Similarity for Feature Selection in Classification
    Cross, Valerie
    Zmuda, Michael
    Paul, Rahul
    Hall, Lawrence
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [3] An improved method of feature selection based on concept attributes in text classification
    Liao, SS
    Jiang, MH
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 1140 - 1149
  • [4] A new approach to feature selection in text classification
    Wang, Y
    Wang, XJ
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3814 - 3819
  • [5] Efficient Method for Feature Selection in Text Classification
    Sun, Jian
    Zhang, Xiang
    Liao, Dan
    Chang, Victor
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2017,
  • [6] A new feature selection method for text classification
    Uchyigit, Gulden
    Clark, Keith
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (02) : 423 - 438
  • [7] Text feature selection method for hierarchical classification
    Zhu, Cui-Ling
    Ma, Jun
    Zhang, Dong-Mei
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2011, 24 (01): : 103 - 110
  • [8] Feature Selection Method of Text Tendency Classification
    Li, Yanling
    Dai, Guanzhong
    Li, Gang
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 34 - +
  • [9] An enhanced feature selection method for text classification
    Kang, Jinbeom
    Lee, Eunshil
    Hong, Kwanghee
    Park, Jeahyun
    Kim, Taehwan
    Park, Juyoung
    Choi, Joongmin
    Yang, Jaeyoung
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 36 - 41
  • [10] Grooming Detection using Fuzzy-Rough Feature Selection and Text Classification
    Zuo, Zheming
    Li, Jie
    Anderson, Philip
    Yang, Longzhi
    Naik, Nitin
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,