Translation Is Not Enough: Comparing Lexicon-based Methods for Sentiment Analysis in Persian

被引:0
|
作者
Basiri, Mohammad Ehsan [1 ]
Kabiri, Arman [1 ]
机构
[1] Shahrekord Univ, Dept Comp Engn, Shahrekord, Iran
关键词
component; Sentiment Analysis; Natural Language Processing; Persian Language; Lexicon-based approach; Opinion mining; Data Mining; SOCIAL MEDIA;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sentiment analysis is a subfield of data mining and natural language processing with the aim of extracting people's opinion and appraisals from their comments on the Web. Contrary to machine learning approach, lexicon-based methods have some important advantages like domain-independency and being needless of a large annotated training corpus and hence are faster. This makes lexicon-based approach prevalent in the sentiment analysis community. However, for Persian language, in contrast to English, using lexicon-based method is a new discipline. There are limited lexicons available for sentiment analysis in Persian, almost all of them are directly translated from English. In the current study, four lexicons are compared to show the importance of lexicons in the performance of document-level sentiment analysis. Specifically, the Persian version of NRC lexicon, SentiStrength, CNRC, and Adjectives are compared in a pure lexicon-based scenario. Experiments are carried out on the document-level edition of SPerSent dataset. Results show that direct translation used in NRC leads the poorest performance while pre-processing and refining lexicons used in SentiStrength and CNRC improves the performance. Also, the results show that using just adjectives leads to higher results in comparison to using NRC.
引用
收藏
页码:36 / 41
页数:6
相关论文
共 50 条
  • [31] Comprehensive Study on Lexicon-based Ensemble Classification Sentiment Analysis
    Augustyniak, Lukasz
    Szymanski, Piotr
    Kajdanowicz, Tomasz
    Tuliglowicz, Wlodzimierz
    ENTROPY, 2016, 18 (01)
  • [32] A polarity calculation approach for lexicon-based Turkish sentiment analysis
    Yurtalan, Gokhan
    Koyuncu, Murat
    Turhan, Cigdem
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1325 - 1339
  • [33] An Experimental Study of Lexicon-based Sentiment Analysis on Bahasa Indonesia
    Pamungkas, Endang Wahyu
    Putri, Divi Galih Prasetyo
    2016 6TH INTERNATIONAL ANNUAL ENGINEERING SEMINAR (INAES), 2016, : 28 - 31
  • [34] Lexicon-Based Sentiment Analysis of Facebook Comments in Vietnamese Language
    Son Trinh
    Luu Nguyen
    Minh Vo
    Phuc Do
    RECENT DEVELOPMENTS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2016, 642 : 263 - 276
  • [35] Aspect-Oriented Lexicon-Based Sentiment Analysis of Students' Feedback
    Kathuria, Abhinav
    Gupta, Anu
    Singla, R. K.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (03)
  • [36] Lexicon-Based Sentiment Convolutional Neural Networks for Online Review Analysis
    Huang, Minghui
    Xie, Haoran
    Rao, Yanghui
    Liu, Yuwei
    Poon, Leonard K. M.
    Wang, Fu Lee
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1337 - 1348
  • [37] Aspect-Oriented Lexicon-Based Sentiment Analysis of Students' Feedback
    Kathuria, Abhinav
    Gupta, Anu
    Singla, R. K.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023,
  • [38] FWLSA-score: French and Wolof Lexicon-based for Sentiment Analysis
    Kande, Demba
    Camara, Fode
    Ndiaye, Samba
    Guirassy, Fode M. L.
    5TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM 2019), 2019, : 215 - 220
  • [39] Lexi-Augmenter: Lexicon-Based Model for Tweets Sentiment Analysis
    Alashri, Saud
    Alzahrani, Sultan
    Alhoshan, Muneera
    Alkhanen, Imaan
    Alghunaim, Sara
    Alhassoun, Manal
    2019 22ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (IEEE CSE 2019) AND 17TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (IEEE EUC 2019), 2019, : 7 - 10
  • [40] Automated measures of sentiment via transformer- and lexicon-based sentiment analysis (TLSA)
    Zhao, Xinyan
    Wong, Chau-Wai
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (01): : 145 - 170