Extracting Polarity Shifting Patterns from Any Corpus Based on Natural Annotation

被引:2
|
作者
Xu, Ge [1 ,2 ,3 ]
Yang, Xiaoyan [1 ]
Cai, Yuanzheng [1 ]
Ruan, Zhiqiang [1 ]
Wang, Tao [1 ]
Liao, Xiangwen [2 ,4 ]
机构
[1] Minjiang Univ, Coll Comp & Control Engn, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China
[2] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, 2 Xueyuan Rd, Fuzhou 350108, Fujian, Peoples R China
[3] Internet Innovat Res Ctr Humanities & Social Sci, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China
[4] Fuzhou Univ, Coll Mathmet & Comp Sci, Fuzhou, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; natural annotation; polarity shifting; sequence mining; prior polarity;
D O I
10.1145/3345518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, online sentiment texts are generated by users in various domains and in different languages. Binary polarity classification (positive or negative) on business sentiment texts can help both companies and customers to evaluate products or services. Sometimes, the polarity of sentiment texts can be modified, making the polarity classification difficult. In sentiment analysis, such modification of polarity is termed as polarity shifting, which shifts the polarity of a sentiment clue (emotion, evaluation, etc.). It is well known that detection of polarity shifting can help improve sentiment analysis in texts. However, to detect polarity shifting in corpora is challenging: (1) polarity shifting is normally sparse in texts. making human annotation difficult; (2) corpora with dense polarity shifting are few; we may need polarity shifting patterns from various corpora. In this article, an approach is presented to extract polarity shifting patterns from any text corpus. For the first time, we proposed to select texts rich in polarity shifting by the idea of natural annotation, which is used to replace human annotation. With a sequence mining algorithm, the selected texts are used to generate polarity shifting pattern candidates, and then we rank them by C-value before human annotation. The approach is tested on different corpora and different languages. The results show that our approach can capture various types of polarity shifting patterns, and some patterns are unique to specific corpora. Therefore, for better performance, it is reasonable to construct polarity shifting patterns directly from the given corpus.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Extracting Temporal Patterns from Large-Scale Text Corpus
    Liu, Yu
    Hua, Wen
    Zhou, Xiaofang
    DATABASES THEORY AND APPLICATIONS (ADC 2019), 2019, 11393 : 17 - 30
  • [2] Adding value to, and extracting of value from, a signed language corpus through secondary processing: implications for annotation schemas and corpus creation
    Johnston, Trevor
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : A137 - A142
  • [3] Extracting Sentence Elements for the Natural Language Understanding Based on Slovak National Corpus
    Ondas, Stanislav
    Juhar, Jozef
    Cizmar, Anton
    ANALYSIS OF VERBAL AND NONVERBAL COMMUNICATION AND ENACTMENT: THE PROCESSING ISSUES, 2011, 6800 : 171 - 177
  • [4] Extracting answers to natural language questions from large-scale corpus
    Li, P
    Wang, XL
    Guan, Y
    Zhao, YM
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 690 - 694
  • [5] Search-Based Image Annotation: Extracting Semantics from Similar Images
    Budikova, Petra
    Batko, Michal
    Botorek, Jan
    Zezula, Pavel
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 327 - 339
  • [6] SEWAR: A corpus-based N-gram approach for extracting semantically-related words from Arabic medical corpus
    AlMahmoud, Rana Husni
    Hammo, Bassam H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [7] Extracting semantic relations from the Quranic Arabic based on Arabic conjunctive patterns
    Bentrcia, Rahima
    Zidat, Samir
    Marir, Farhi
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2018, 30 (03) : 382 - 390
  • [8] Extracting recent weighted-based patterns from uncertain temporal databases
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    Wu, Jimmy Ming-Tai
    Zhan, Justin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 61 : 161 - 172
  • [9] Research on Extracting Design Patterns from Source Code based on dynamic analysis
    Li Wen-jin
    Chen Guang-ping
    Pan Ju-long
    2011 INTERNATIONAL CONFERENCE ON COMPUTER APPLICATION AND EDUCATION TECHNOLOGY (ICCAET 2011), 2011, : 369 - 373
  • [10] Extracting Elements of Component-based Systems from Natural Language Requirements
    Lau, Kung-Kiu
    Nordin, Azlin
    Ng, Keng-Yap
    2011 37TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2011), 2011, : 39 - 46