PELESent: Cross-domain polarity classification using distant supervision

被引:5
|
作者
Correa, Edilson A., Jr. [1 ]
Marinho, Vanessa Q. [1 ]
dos Santos, Leandro B. [1 ]
Bertaglia, Thales F. C. [1 ]
Treviso, Marcos V. [1 ]
Brum, Henrico B. [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/BRACIS.2017.45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The enormous amount of texts published daily by Internet users has fostered the development of methods to analyze this content in several natural language processing areas, such as sentiment analysis. The main goal of this task is to classify the polarity of a message. Even though many approaches have been proposed for sentiment analysis, some of the most successful ones rely on the availability of large annotated corpus, which is an expensive and time-consuming process. In recent years, distant supervision has been used to obtain larger datasets. So, inspired by these techniques, in this paper we extend such approaches to incorporate popular graphic symbols used in electronic messages, the emojis, in order to create a large sentiment corpus for Portuguese. Trained on almost one million tweets, several models were tested in both same domain and cross-domain corpora. Our methods obtained very competitive results in five annotated corpora from mixed domains (Twitter and product reviews), which proves the domain-independent property of such approach. In addition, our results suggest that the combination of emoticons and emojis is able to properly capture the sentiment of a message.
引用
收藏
页码:49 / 54
页数:6
相关论文
共 50 条
  • [1] An Ensemble Model for Cross-Domain Polarity Classification on Twitter
    Tsakalidis, Adam
    Papadopoulos, Symeon
    Kompatsiaris, Ioannis
    WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 168 - 177
  • [2] Cross-domain sentiment classification initiated with Polarity Detection Task
    Kansal, Nancy
    Goel, Lipika
    Gupta, Sonam
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2021, 8 (30): : 1 - 17
  • [3] Cross-domain polarity classification using a knowledge-enhanced meta-classifier
    Franco-Salvador, Marc
    Cruz, Fermin L.
    Troyano, Jose A.
    Rosso, Paolo
    KNOWLEDGE-BASED SYSTEMS, 2015, 86 : 46 - 56
  • [4] Cross-Domain Labeled LDA for Cross-Domain Text Classification
    Jing, Baoyu
    Lu, Chenwei
    Wang, Deqing
    Zhuang, Fuzhen
    Niu, Cheng
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 187 - 196
  • [5] Cross-domain sentiment classification-feature divergence, polarity divergence or both?
    Zhang, Yuhong
    Hu, Xuegang
    Li, Peipei
    Li, Lei
    Wu, Xindong
    PATTERN RECOGNITION LETTERS, 2015, 65 : 44 - 50
  • [6] A semantic approach based on domain knowledge for polarity shift detection using distant supervision
    Ayeste, Zahra
    Noferesti, Samira
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2022, 11 (02) : 169 - 180
  • [7] A semantic approach based on domain knowledge for polarity shift detection using distant supervision
    Zahra Ayeste
    Samira Noferesti
    Progress in Artificial Intelligence, 2022, 11 : 169 - 180
  • [8] Leveraging ParsBERT for cross-domain polarity sentiment classification of Persian social media comments
    Nigjeh, Mahnaz Panahandeh
    Ghanbari, Shirin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10677 - 10694
  • [9] Cross-domain Network Traffic Classification Using Unsupervised Domain Adaptation
    Li, Dongpu
    Yuan, Qifeng
    Li, Tan
    Chen, Shuangwu
    Yang, Jian
    2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 245 - +
  • [10] Leveraging ParsBERT for cross-domain polarity sentiment classification of Persian social media comments
    Mahnaz Panahandeh Nigjeh
    Shirin Ghanbari
    Multimedia Tools and Applications, 2024, 83 : 10677 - 10694