PELESent: Cross-domain polarity classification using distant supervision

被引:5
|
作者
Correa, Edilson A., Jr. [1 ]
Marinho, Vanessa Q. [1 ]
dos Santos, Leandro B. [1 ]
Bertaglia, Thales F. C. [1 ]
Treviso, Marcos V. [1 ]
Brum, Henrico B. [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/BRACIS.2017.45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The enormous amount of texts published daily by Internet users has fostered the development of methods to analyze this content in several natural language processing areas, such as sentiment analysis. The main goal of this task is to classify the polarity of a message. Even though many approaches have been proposed for sentiment analysis, some of the most successful ones rely on the availability of large annotated corpus, which is an expensive and time-consuming process. In recent years, distant supervision has been used to obtain larger datasets. So, inspired by these techniques, in this paper we extend such approaches to incorporate popular graphic symbols used in electronic messages, the emojis, in order to create a large sentiment corpus for Portuguese. Trained on almost one million tweets, several models were tested in both same domain and cross-domain corpora. Our methods obtained very competitive results in five annotated corpora from mixed domains (Twitter and product reviews), which proves the domain-independent property of such approach. In addition, our results suggest that the combination of emoticons and emojis is able to properly capture the sentiment of a message.
引用
收藏
页码:49 / 54
页数:6
相关论文
共 50 条
  • [21] Representation Learning for Imbalanced Cross-Domain Classification
    Cheng, Lu
    Guo, Ruocheng
    Candan, K. Selcuk
    Liu, Huan
    PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM), 2020, : 478 - 486
  • [22] Knowledge Transformation for Cross-Domain Sentiment Classification
    Li, Tao
    Sindhwani, Vikas
    Ding, Chris
    Zhang, Yi
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 716 - 717
  • [23] Cross-Domain Knowledge Transfer Using Semi-supervised Classification
    Zhen, Yi
    Li, Chunping
    AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 362 - +
  • [24] Unsupervised Cross-Domain White Blood Cells Classification Using DANN
    Zhang, Lixin
    Fu, Yining
    Yang, Yuhao
    Ding, Yongzheng
    Yu, Xuyao
    Li, Huanming
    Yu, Hui
    Chen, Chong
    2022 9TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, ICBBE 2022, 2022, : 17 - 21
  • [25] Cross-domain Informativeness Classification for Disaster Situations
    Graf, David
    Retschitzegger, Werner
    Schwinger, Wieland
    Proell, Birgit
    Kapsammer, Elisabeth
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES'18), 2018, : 183 - 190
  • [26] A unified framework for cross-domain sentiment classification
    Wu, Qiong
    Liu, Yue
    Shen, Huawei
    Zhang, Jin
    Xu, Hongbo
    Cheng, Xueqi
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (08): : 1683 - 1689
  • [27] Cross-Domain Topic Classification for Political Texts
    Osnabruegge, Moritz
    Ash, Elliott
    Morelli, Massimo
    POLITICAL ANALYSIS, 2023, 31 (01) : 59 - 80
  • [28] CROSS-DOMAIN CNN FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Lee, Hyungtae
    Eum, Sungmin
    Kwon, Heesung
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 3627 - 3630
  • [29] Iterative Reinforcement Cross-Domain Text Classification
    Zhang, Di
    Xue, Gui-Rong
    Yu, Yong
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 282 - 293
  • [30] Cross-domain knowledge distillation for text classification
    Zhang, Shaokang
    Jiang, Lei
    Tan, Jianlong
    NEUROCOMPUTING, 2022, 509 : 11 - 20