A modified multi objective heuristic for effective feature selection in text classification

被引:6
|
作者
Thiyagarajan, D. [1 ]
Shanthi, N. [2 ]
机构
[1] KS Rangasamy Coll Technol, Dept Comp Sci & Engn, Tiruchengode, India
[2] Nandha Engn Coll, Dept Comp Sci & Engn, Erode 638052, India
关键词
Text classification; Feature selection; Artificial fish swarm algorithm (AFSA); Classifiers; NAIVE BAYES;
D O I
10.1007/s10586-017-1150-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text categorization is the process of sorting text documents into one or more predefined categories or classes of similar documents. Differences in the results of such categorization arise from the feature set chosen to base the association of a given document with a given category. This process is challenging mainly because there can be large number of discriminating words which render many of the current algorithms unable to complete this. For most of these tasks there exist both relevant as well as irrelevant features. The objective here is to bring about a text classification on the basis of the features selected and also pre-processing to bring down the dimensionality and increase the accuracy of classification of the feature vector. Here the most commonly used methods are meta-heuristic algorithms in order to facilitate selection. Artificial fish swarm algorithm (AFSA) takes the underlying intelligence of the behaviour of fish swarming to combat the problems of optimization as well as the combinatorial problems. This method has been greatly successful in diverse applications but does suffer from certain limitations like not having multiplicity. Therefore, a modification has been proposed to AFSA which is MAFSA that has a crossover in its operation in order to bring about an improvement in the text classification selection. SVM or Support Vector Machine, Adaboost classifiers and naive bayes are all used here. MAFSA has proved itself to be superior to AFSA in terms of precision and also the selected feature numbers.
引用
收藏
页码:10625 / 10635
页数:11
相关论文
共 50 条
  • [21] Feature Selection for Ordinal Text Classification
    Baccianella, Stefano
    Esuli, Andrea
    Sebastiani, Fabrizio
    NEURAL COMPUTATION, 2014, 26 (03) : 557 - 591
  • [22] Feature Selection Methods for Text Classification
    Dasgupta, Anirban
    Drineas, Petros
    Harb, Boulos
    Josifovski, Vanja
    Mahoney, Michael W.
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 230 - +
  • [23] A heuristic for feature selection for the classification with neural nets
    Feldbusch, F
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 173 - 178
  • [24] A Comprehensive Survey on Effective Feature Selection Approaches for Text Sentiment Classification Process
    Rajpoot, Abha Kiran
    Nand, Parma
    Abidi, Ali Imam
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 971 - 977
  • [25] A systematic literature review on meta-heuristic based feature selection techniques for text classification
    Al-shalif S.A.
    Senan N.
    Saeed F.
    Ghaban W.
    Ibrahim N.
    Aamir M.
    Sharif W.
    PeerJ Computer Science, 2024, 10 : 1 - 45
  • [26] A systematic literature review on meta-heuristic based feature selection techniques for text classification
    Al-shalif, Sarah Abdulkarem
    Senan, Norhalina
    Saeed, Faisal
    Ghaban, Wad
    Ibrahim, Noraini
    Aamir, Muhammad
    Sharif, Wareesa
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [27] Modified Pointwise Mutual Information-Based Feature Selection for Text Classification
    Georgieva-Trifonova, Tsvetanka
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 333 - 353
  • [28] A Review on Feature Selection and Feature Extraction for Text Classification
    Shah, Foram P.
    Patel, Vibha
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2264 - 2268
  • [29] Multi-Objective Optimization of Feature Selection Procedure for EEG Signals Classification
    Cimpanu, Corina
    Ferariu, Lavinia
    Dumitriu, Tiberius
    Ungureanu, Florina
    2017 IEEE INTERNATIONAL CONFERENCE ON E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2017, : 434 - 437
  • [30] MULTI-OBJECTIVE HEURISTIC FEATURE SELECTION FOR SPEECH-BASED MULTILINGUAL EMOTION RECOGNITION
    Brester, Christina
    Semenkin, Eugene
    Sidorov, Maxim
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (04) : 243 - 253