Extracting Polarity Shifting Patterns from Any Corpus Based on Natural Annotation

被引:2
|
作者
Xu, Ge [1 ,2 ,3 ]
Yang, Xiaoyan [1 ]
Cai, Yuanzheng [1 ]
Ruan, Zhiqiang [1 ]
Wang, Tao [1 ]
Liao, Xiangwen [2 ,4 ]
机构
[1] Minjiang Univ, Coll Comp & Control Engn, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China
[2] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, 2 Xueyuan Rd, Fuzhou 350108, Fujian, Peoples R China
[3] Internet Innovat Res Ctr Humanities & Social Sci, 200 Xiyuangong Rd, Fuzhou 350108, Fujian, Peoples R China
[4] Fuzhou Univ, Coll Mathmet & Comp Sci, Fuzhou, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; natural annotation; polarity shifting; sequence mining; prior polarity;
D O I
10.1145/3345518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, online sentiment texts are generated by users in various domains and in different languages. Binary polarity classification (positive or negative) on business sentiment texts can help both companies and customers to evaluate products or services. Sometimes, the polarity of sentiment texts can be modified, making the polarity classification difficult. In sentiment analysis, such modification of polarity is termed as polarity shifting, which shifts the polarity of a sentiment clue (emotion, evaluation, etc.). It is well known that detection of polarity shifting can help improve sentiment analysis in texts. However, to detect polarity shifting in corpora is challenging: (1) polarity shifting is normally sparse in texts. making human annotation difficult; (2) corpora with dense polarity shifting are few; we may need polarity shifting patterns from various corpora. In this article, an approach is presented to extract polarity shifting patterns from any text corpus. For the first time, we proposed to select texts rich in polarity shifting by the idea of natural annotation, which is used to replace human annotation. With a sequence mining algorithm, the selected texts are used to generate polarity shifting pattern candidates, and then we rank them by C-value before human annotation. The approach is tested on different corpora and different languages. The results show that our approach can capture various types of polarity shifting patterns, and some patterns are unique to specific corpora. Therefore, for better performance, it is reasonable to construct polarity shifting patterns directly from the given corpus.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Naloxone distribution amidst shifting drug use patterns: Insights from a needs-based syringe services program
    Eger, William H.
    Paltin, Dafna
    Ross, Jacob D.
    Bailey, Katie
    Nguyen, Amanda, V
    Solomon, Eli M.
    Bartholomew, Tyler S.
    Han, Benjamin H.
    Bazzi, Angela R.
    DRUG AND ALCOHOL DEPENDENCE, 2025, 269
  • [32] Choline Chloride Based Natural Deep Eutectic Solvents as Extraction Media for Extracting Phenolic Compounds from Chokeberry (Aronia melanocarpa)
    Razborsek, Masa Islamcevic
    Ivanovic, Milena
    Krajnc, Peter
    Kolar, Mitja
    MOLECULES, 2020, 25 (07):
  • [33] A new green alternative solvent for extracting echinacoside and acteoside from Cistanche deserticola based on ternary natural deep eutectic solvent
    Nie, Fang
    Feng, Changyin
    Ahmad, Naveed
    Tian, Mengfei
    Liu, Qinglong
    Wang, Weihao
    Lin, Ziqi
    Li, Chunying
    Zhao, Chunjian
    JOURNAL OF INDUSTRIAL AND ENGINEERING CHEMISTRY, 2023, 118 : 499 - 510
  • [34] Shifting the natural deep eutectic solvent based liquid lipase extraction from batch to continuous for more efficient process performance
    Salic, Anita
    Ljubic, Anabela
    Marcinko, Tomislav
    Tusek, Ana Jurinjak
    Bubalo, Marina Cvjetko
    Tisma, Marina
    Zelic, Bruno
    JOURNAL OF CLEANER PRODUCTION, 2023, 405
  • [35] Feature-based molecular networking and network annotation propagation applied to natural antiviral compound research from tropical Euphorbiaceae.
    Remy, S.
    Olivon, F.
    Solis, D.
    Touboul, D.
    Litaudon, M.
    PLANTA MEDICA, 2019, 85 (18) : 1396 - 1396
  • [36] RCM-extractor: an automated NLP-based approach for extracting a semi formal representation model from natural language requirements
    Zaki-Ismail, Aya
    Osama, Mohamed
    Abdelrazek, Mohamed
    Grundy, John
    Ibrahim, Amani
    AUTOMATED SOFTWARE ENGINEERING, 2022, 29 (01)
  • [37] RCM-extractor: an automated NLP-based approach for extracting a semi formal representation model from natural language requirements
    Aya Zaki-Ismail
    Mohamed Osama
    Mohamed Abdelrazek
    John Grundy
    Amani Ibrahim
    Automated Software Engineering, 2022, 29
  • [38] METHODS FOR EXTRACTING TREATMENT PATTERNS FOR RENAL CELL CARCINOMA (RCC) FROM SOCIAL MEDIA (SM) FORUMS USING NATURAL LANGUAGE PROCESSING (NLP) AND MACHINE LEARNING (ML)
    Merinopoulou, E.
    Ramagopalan, S.
    Malcolm, B.
    Cox, A.
    VALUE IN HEALTH, 2017, 20 (09) : A402 - A402
  • [39] Machine Learning-Based Natural Language Processing for Automated Extraction and Standardized Annotation of IHC Results from Free Text Pathology Reports
    Kim, Young Suk
    Roehrl, Michael H. A.
    MODERN PATHOLOGY, 2019, 32
  • [40] Machine Learning-Based Natural Language Processing for Automated Extraction and Standardized Annotation of IHC Results from Free Text Pathology Reports
    Kim, Young Suk
    Roehrl, Michael H. A.
    LABORATORY INVESTIGATION, 2019, 99