Exploiting effective features for chinese sentiment classification

被引:65
|
作者
Zhai, Zhongwu [1 ]
Xu, Hua [1 ]
Kang, Bada [2 ]
Jia, Peifa [1 ]
机构
[1] Tsinghua Univ, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Univ So Calif, Viterbi Sch Engn, Los Angeles, CA 90089 USA
基金
中国国家自然科学基金;
关键词
Sentiment classification; Substring features; Substring-group; Suffix tree;
D O I
10.1016/j.eswa.2011.01.047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Features play a fundamental role in sentiment classification. How to effectively select different types of features to improve sentiment classification performance is the primary topic of this paper. Ngram features are commonly employed in text classification tasks; in this paper, sentiment-words, substrings, substring-groups, and key-substring-groups, which have never been considered in sentiment classification area before, are also extracted as features. The extracted features are then compared and analyzed. To demonstrate generality, we use two authoritative Chinese data sets in different domains to conduct our experiments. Our statistical analysis of the experimental results indicate the following: (1) different types of features possess different discriminative capabilities in Chinese sentiment classification; (2) character bigram features perform the best among the Ngram features; (3) substring-group features have greater potential to improve the performance of sentiment classification by combining substrings of different lengths; (4) sentiment words or phrases extracted from existing sentiment lexicons are not effective for sentiment classification; (5) effective features are usually at varying lengths rather than fixed lengths. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9139 / 9146
页数:8
相关论文
共 50 条
  • [1] A sentiment analysis approach based on exploiting Chinese linguistic features and classification
    Gao, Kai
    Su, Shu
    Li, Dan-Yang
    Zhang, S-S.
    Wang, J-S.
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2018, 29 (03) : 226 - 232
  • [2] Integrated features based sentiment classification for Chinese text
    Gan, Xiaohong
    Journal of Convergence Information Technology, 2012, 7 (19) : 450 - 458
  • [3] Exploiting New Sentiment-Based Meta-level Features for Effective Sentiment Analysis
    Canuto, Sergio
    Goncalves, Marcos Andre
    Benevenuto, Fabricio
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 53 - 62
  • [4] Sentiment Classification for Chinese Reviews Based on Key Substring Features
    Zhai, Zhongwu
    Xu, Hua
    Li, Jun
    Jia, Peifa
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 452 - 459
  • [5] Implicit Sentiment Classification Model Based on Enhancement of Sentiment Features Oriented to Chinese Text
    Tan, Guangpu
    Zhu, Guangli
    Wei, Siyu
    Computer Engineering and Applications, 2024, 60 (03) : 196 - 204
  • [6] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Amna Altaf
    Muhammad Waqas Anwar
    Muhammad Hasan Jamal
    Usama Ijaz Bajwa
    Multimedia Tools and Applications, 2023, 82 : 41813 - 41839
  • [7] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Altaf, Amna
    Anwar, Muhammad Waqas
    Jamal, Muhammad Hasan
    Bajwa, Usama Ijaz
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 41813 - 41839
  • [8] Exploiting Contextual Target Attributes for Target Sentiment Classification
    Xing B.
    Tsang I.W.
    Journal of Artificial Intelligence Research, 2024, 80 : 419 - 439
  • [9] Exploiting Position Bias for Robust Aspect Sentiment Classification
    Ma, Fang
    Zhang, Chen
    Song, Dawei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1352 - 1358
  • [10] Exploiting Contextual Target Attributes for Target Sentiment Classification
    Xing, Bowen
    Tsang, Ivor W.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 419 - 439