Extracting Polarity Shifting Patterns from Any Corpus Based on Natural Annotation

被引:2
|
作者
Xu, Ge [1 ,2 ,3 ]
Yang, Xiaoyan [1 ]
Cai, Yuanzheng [1 ]
Ruan, Zhiqiang [1 ]
Wang, Tao [1 ]
Liao, Xiangwen [2 ,4 ]
机构
[1] Minjiang Univ, Coll Comp & Control Engn, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China
[2] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, 2 Xueyuan Rd, Fuzhou 350108, Fujian, Peoples R China
[3] Internet Innovat Res Ctr Humanities & Social Sci, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China
[4] Fuzhou Univ, Coll Mathmet & Comp Sci, Fuzhou, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; natural annotation; polarity shifting; sequence mining; prior polarity;
D O I
10.1145/3345518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, online sentiment texts are generated by users in various domains and in different languages. Binary polarity classification (positive or negative) on business sentiment texts can help both companies and customers to evaluate products or services. Sometimes, the polarity of sentiment texts can be modified, making the polarity classification difficult. In sentiment analysis, such modification of polarity is termed as polarity shifting, which shifts the polarity of a sentiment clue (emotion, evaluation, etc.). It is well known that detection of polarity shifting can help improve sentiment analysis in texts. However, to detect polarity shifting in corpora is challenging: (1) polarity shifting is normally sparse in texts. making human annotation difficult; (2) corpora with dense polarity shifting are few; we may need polarity shifting patterns from various corpora. In this article, an approach is presented to extract polarity shifting patterns from any text corpus. For the first time, we proposed to select texts rich in polarity shifting by the idea of natural annotation, which is used to replace human annotation. With a sequence mining algorithm, the selected texts are used to generate polarity shifting pattern candidates, and then we rank them by C-value before human annotation. The approach is tested on different corpora and different languages. The results show that our approach can capture various types of polarity shifting patterns, and some patterns are unique to specific corpora. Therefore, for better performance, it is reasonable to construct polarity shifting patterns directly from the given corpus.
引用
收藏
页数:16
相关论文
共 50 条