Extracting Polarity Shifting Patterns from Any Corpus Based on Natural Annotation

被引：2

作者：

Xu, Ge ^{[1
,2
,3
]}

Yang, Xiaoyan ^{[1
]}

Cai, Yuanzheng ^{[1
]}

Ruan, Zhiqiang ^{[1
]}

Wang, Tao ^{[1
]}

Liao, Xiangwen ^{[2
,4
]}

机构：

[1] Minjiang Univ, Coll Comp & Control Engn, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China

[2] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, 2 Xueyuan Rd, Fuzhou 350108, Fujian, Peoples R China

[3] Internet Innovat Res Ctr Humanities & Social Sci, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China

[4] Fuzhou Univ, Coll Mathmet & Comp Sci, Fuzhou, Fujian, Peoples R China

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2020年 / 19卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Sentiment analysis; natural annotation; polarity shifting; sequence mining; prior polarity;

D O I：

10.1145/3345518

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, online sentiment texts are generated by users in various domains and in different languages. Binary polarity classification (positive or negative) on business sentiment texts can help both companies and customers to evaluate products or services. Sometimes, the polarity of sentiment texts can be modified, making the polarity classification difficult. In sentiment analysis, such modification of polarity is termed as polarity shifting, which shifts the polarity of a sentiment clue (emotion, evaluation, etc.). It is well known that detection of polarity shifting can help improve sentiment analysis in texts. However, to detect polarity shifting in corpora is challenging: (1) polarity shifting is normally sparse in texts. making human annotation difficult; (2) corpora with dense polarity shifting are few; we may need polarity shifting patterns from various corpora. In this article, an approach is presented to extract polarity shifting patterns from any text corpus. For the first time, we proposed to select texts rich in polarity shifting by the idea of natural annotation, which is used to replace human annotation. With a sequence mining algorithm, the selected texts are used to generate polarity shifting pattern candidates, and then we rank them by C-value before human annotation. The approach is tested on different corpora and different languages. The results show that our approach can capture various types of polarity shifting patterns, and some patterns are unique to specific corpora. Therefore, for better performance, it is reasonable to construct polarity shifting patterns directly from the given corpus.

引用

页数：16

共 50 条

[21] Shifting travel patterns through narrative gamification: Evidence from a school-based program in Singapore
Leong, Wai Yan
Chong, Wen Wei
Kwok, Melvin
CASE STUDIES ON TRANSPORT POLICY, 2024, 16
[22] Conversion of intraovarian patterns from preovulation to postovulation based on location of dominant follicle and corpus luteum in heifers
Ginther, O. J.
Siddiqui, M. A. R.
Baldrighi, J. M.
Hoffman, M. M.
THERIOGENOLOGY, 2015, 83 (02) : 153 - 161
[23] A social network analysis based approach to extracting knowledge patterns about innovation geography from patent databases
Ferrara, Massimiliano
Fosso, Diego
Lanata, Davide
Mavilia, Roberto
Ursino, Domenico
INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2018, 10 (01) : 23 - 72
[24] Denoising and extracting background from fringe patterns using midpoint-based bidimensional empirical mode decomposition
Wielgus, Maciej
Patorski, Krzysztof
APPLIED OPTICS, 2014, 53 (10) : B215 - B222
[25] Extracting and re-using design patterns from genetic algorithms using case-based reasoning
Pérez, EI
Coello, CAC
Aguirre, AH
ENGINEERING OPTIMIZATION, 2003, 35 (02) : 121 - 141
[26] Denoising and extracting background from fringe patterns using midpoint-based bidimensional empirical mode decomposition
Wielgus, Maciej
Patorski, Krzysztof
ELEVENTH INTERNATIONAL CONFERENCE ON CORRELATION OPTICS, 2013, 9066
[27] An integrated semi-automated framework for domain-based polarity words extraction from an unannotated non-English corpus
Mohammed Kaity
Vimala Balakrishnan
The Journal of Supercomputing, 2020, 76 : 9772 - 9799
[28] An integrated semi-automated framework for domain-based polarity words extraction from an unannotated non-English corpus
Kaity, Mohammed
Balakrishnan, Vimala
JOURNAL OF SUPERCOMPUTING, 2020, 76 (12): : 9772 - 9799
[29] Risk-benefit assessment of shifting from traditional meat-based diets to alternative dietary patterns
Mihalache, Octavian Augustin
Dellafiora, Luca
Dall'Asta, Chiara
EFSA JOURNAL, 2022, 20
[30] Extracting Fallen Objects on the Road From Accident Reports Using a Natural Language Processing Model-Based Approach
Lee, Seung-Seok
Cha, So-Mi
Ko, Bonggyun
Park, Je Jin
IEEE ACCESS, 2023, 11 : 139521 - 139533

← 1 2 3 4 5 →