Feature selection based on genetic algorithm and hybrid model for sentiment polarity classification

被引:4
|
作者
Kalaivani, P. [1 ]
Shunmuganathan, K. L. [2 ]
机构
[1] Sathyabama Univ, Dept Comp Sci & Engn, St Josephs Coll Engn, Madras, Tamil Nadu, India
[2] RMK Engn Coll, Dept Comp Sci & Engn, Madras, Tamil Nadu, India
关键词
sentiment classification; supervised machine learning algorithm; feature selection; genetic algorithm; review; information gain; bagging;
D O I
10.1504/IJDMMM.2016.081242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification is to find the polarity of product or user reviews. Supervised machine learning algorithms is used for opinion mining such as naive Bayes, K-nearest neighbour, decision trees, maximum entropy and hidden Markov model and support vector machine. KNN is a simple algorithm, but a less efficient classification algorithm. In this paper, we propose an improved KNN algorithm. An optimised feature selection, genetic algorithm that incorporates the information gain for feature selection and combined with bagging technique and KNN for improving the accuracy of sentiment classification. Specifically, we compared two approaches and traditional KNN for sentiment classification of movie reviews and product reviews. The same approach has been applied to other machine learning algorithms such as support vector machine and naive Bayes and the result is compared with POS-based feature set method. The proposed method is evaluated and experimental results using information gain, genetic algorithm with bagging technique indicate higher performance result with accuracy of 87.50% of the movie reviews and exhibits better performance in terms of accuracy, precision and recall for movie, DVD, electronics and kitchen reviews.
引用
收藏
页码:315 / 329
页数:15
相关论文
共 50 条
  • [41] Classification Algorithm Based on Feature Selection and Samples Selection
    Xu, Yitian
    Zhen, Ling
    Yang, Liming
    Wang, Laisheng
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 631 - 638
  • [42] Deep learning-based hybrid sentiment analysis with feature selection using optimization algorithm
    D. Anand Joseph Daniel
    M. Janaki Meena
    Multimedia Tools and Applications, 2023, 82 : 43273 - 43296
  • [43] Deep learning-based hybrid sentiment analysis with feature selection using optimization algorithm
    Daniel, D. Anand Joseph
    Meena, M. Janaki
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 43273 - 43296
  • [44] A PSO Based Hybrid Feature Selection Algorithm for High-Dimensional Classification
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3801 - 3808
  • [45] Study on the Method of Feature Selection Based on Hybrid Model for Text Classification
    Li, Runzhi
    Zhang, Yangsen
    MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 2881 - 2886
  • [46] Water potability classification based on hybrid stacked model and feature selection
    Ahmed M. Elshewey
    Rasha Y. Youssef
    Hazem M. El-Bakry
    Ahmed M. Osman
    Environmental Science and Pollution Research, 2025, 32 (13) : 7933 - 7949
  • [47] Hybrid genetic algorithm for feature selection with hyperspectral data
    Pal, Mahesh
    REMOTE SENSING LETTERS, 2013, 4 (07) : 619 - 628
  • [48] A novel feature selection approach by hybrid genetic algorithm
    Huang, Jinjie
    Lv, Ning
    Li, Wenlong
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 721 - 729
  • [49] HIERARCHICAL POLARIMETRIC SAR IMAGE CLASSIFICATION BASED ON FEATURE SELECTION AND GENETIC ALGORITHM
    Wang, Yunyan
    Zhuo, Tong
    Zhang, Yu
    Liao, Mingsheng
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 764 - 768
  • [50] A tribe competition-based genetic algorithm for feature selection in pattern classification
    Ma, Benteng
    Xia, Yong
    APPLIED SOFT COMPUTING, 2017, 58 : 328 - 338