A Sentiment Classification Model Using Group Characteristics of Writing Style Features

被引:6
|
作者
Zhao, Huan [1 ]
Zhang, Xixiang [1 ]
Li, Keqin [2 ]
机构
[1] Hunan Univ, Sch Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
关键词
IMDb; machine learning; sentiment classification; writing style; WORDS;
D O I
10.1142/S021800141756016X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is becoming increasingly important mainly because of the growth of web comments. Sentiment polarity classification is a popular process in this field. Writing style features, such as lexical and word-based features, are often used in the authorship identification and gender classification of online messages. However, writing style features were only used in feature selection for sentiment classification. This research presents an exploratory study of the group characteristics of writing style features on the Internet Movie Database (IMDb) movie sentiment data set. Furthermore, this study utilizes the specific group characteristics of writing style in improving the performance of sentiment classification. We determine the optimum clustering number of user reviews based on writing style features distribution. According to the classification model trained on a training subset with specific writing style clustering tags, we determine that the model trained on the data set of a specific writing style group has an optimal e r ect on the classification accuracy, which is better than the model trained on the entire data set in a particular positive or negative polarity. Through the polarity characteristics of specific writing style groups, we propose a general model in improving the performance of the existing classification approach. Results of the experiments on sentiment classification using the IMDb data set demonstrate that the proposed model improves the performance in terms of classification accuracy.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Classification of Writing-Skill Features using Embodied Expertise Onomatopoeias
    Hojo, Hiroki
    Isogai, Junji
    Nakamura, Tsuyoshi
    Kanoh, Masayoshi
    Yamada, Koji
    Tomoto, Yutaro
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 994 - 999
  • [32] Sentiment Classification Using Neural Networks with Sentiment Centroids
    Wang, Maoquan
    Chen, Shiyun
    He, Liang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 56 - 67
  • [33] Sentiment Classification of Tweets using Hierarchical Classification
    Baqapuri, Afroze Ibrahim
    Saleh, Saad
    Ilyas, Muhammad U.
    Khan, Muhammad Murtaza
    Qamar, Ali Mustafa
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [34] Enhancing Text Sentiment Classification with Hybrid CNN-BiLSTM Model on WhatsApp Group
    Susandri, Susandri
    Defit, Sarjon
    Tajuddin, Muhammad
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (03) : 355 - 363
  • [35] SENTIMENT CLASSIFICATION USING TF-IDF FEATURES AND EXTENDED SPACE FOREST ENSEMBLE
    Cao, Nieqing
    Cao, Jingjing
    Lu, Haili
    Li, Bing
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOL. 2, 2015, : 526 - 532
  • [36] A text sentiment classification model using double word embedding methods
    Zhou, Mingqiang
    Liu, Dan
    Zheng, Yanhui
    Zhu, Qingsheng
    Guo, Ping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 18993 - 19012
  • [37] Text Classification of Flu-related Tweets Using FastText with Sentiment and Keyword Features
    Alessa, Ali
    Faezipour, Miad
    Alhassan, Zakhriya
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 366 - 367
  • [38] A text sentiment classification model using double word embedding methods
    Mingqiang Zhou
    Dan Liu
    Yanhui Zheng
    Qingsheng Zhu
    Ping Guo
    Multimedia Tools and Applications, 2022, 81 : 18993 - 19012
  • [39] Sentiment Classification of Tweets with Non-Language Features
    Akilandeswari, J.
    Jothi, G.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 426 - 433
  • [40] Improving sentiment classification using a RoBERTa-based hybrid model
    Semary, Noura A.
    Ahmed, Wesam
    Amin, Khalid
    Plawiak, Pawel
    Hammad, Mohamed
    FRONTIERS IN HUMAN NEUROSCIENCE, 2023, 17