Natural Language Processing and Sentiment Analysis on Bangla Social Media Comments on Russia-Ukraine War Using Transformers

被引:8
|
作者
Hasan, Mahmud [1 ]
Islam, Labiba [1 ]
Jahan, Ismat [1 ]
Meem, Sabrina Mannan [1 ]
Rahman, Rashedur M. [1 ]
机构
[1] North South Univ, Dept Elect & Comp Engn, Dhaka 1229, Bangladesh
关键词
Natural language processing; sentiment analysis; transformers; Russia-Ukraine war;
D O I
10.1142/S2196888823500021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bangla Language ranks seventh in the list of most spoken languages with 265 native and non-native speakers around the world and the second Indo-Aryan language after Hindi. However, the growth of research for tasks such as sentiment analysis (SA) in Bangla is relatively low compared to SA in the English language. It is because there are not enough high-quality publically available datasets for training language models for text classification tasks in Bangla. In this paper, we propose a Bangla annotated dataset for sentiment analysis on the ongoing Ukraine-Russia war. The dataset was developed by collecting Bangla comments from various videos of three prominent YouTube TV news channels of Bangladesh covering their report on the ongoing conflict. A total of 10,861 Bangla comments were collected and labeled with three polarity sentiments, namely Neutral, Pro-Ukraine (Positive), and Pro-Russia (Negative). A benchmark classifier was developed by experimenting with several transformer-based language models all pre-trained on unlabeled Bangla corpus. The models were fine-tuned using our procured dataset. Hyperparameter optimization was performed on all 5 transformer language models which include: BanglaBERT, XLM-RoBERTa-base, XLM-RoBERTa-large, Distil-mBERT and mBERT. Each model was evaluated and analyzed using several evaluation metrics which include: F1 score, accuracy, and AIC (Akaike Information Criterion). The best-performing model achieved the highest accuracy of 86% with 0.82 F1 score. Based on accuracy, F1 score and AIC, BanglaBERT outperforms baseline and all the other transformer-based classifiers.
引用
收藏
页码:329 / 356
页数:28
相关论文
共 50 条
  • [21] Sentiment Analysis: Automated Evaluation Using Natural Language Processing
    Novak, Michal
    CREATING GLOBAL COMPETITIVE ECONOMIES: 2020 VISION PLANNING & IMPLEMENTATION, VOLS 1-3, 2013, : 973 - 975
  • [22] Natural Language Processing for the Analysis Sentiment using a LSTM Model
    Berrajaa, Achraf
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 777 - 785
  • [23] Sentiment Analysis of Social Media Comments Using Machine Learning Algorithms
    Taghiyeva, Laman
    Hasanova, Narmin
    Omarova, Masuda
    Rustamov, Samir
    2023 5th International Conference on Problems of Cybernetics and Informatics, PCI 2023, 2023,
  • [24] Special Issue on Natural Language Processing for Social Media Analysis
    Mporas, Iosif
    Simaki, Vasiliki
    Paradis, Carita
    Kerren, Andreas
    Paraskevas, Michael
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2020, 29 (02)
  • [25] ACTIVITY OF BRAZILIAN TOURISM AGENCIES IN SOCIAL MEDIA: AN ANALYSIS USING NATURAL LANGUAGE PROCESSING
    Guedes, Danillo Magno Duarte
    Gosling, Marlusa de Sevilha
    PERSPECTIVAS EM CIENCIA DA INFORMACAO, 2023, 28
  • [26] Emotion Detection from Text and Sentiment Analysis of Ukraine Russia War using Machine Learning Technique
    Al Maruf, Abdullah
    Ziyad, Zakaria Masud
    Haque, Md Mahmudul
    Khanam, Fahima
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 868 - 882
  • [27] Sentiment Analysis of Russian-Language Social Media Posts Discussing the 2022 Russian Invasion of Ukraine
    Dean, Matthew C.
    Porter, Ben
    ARMED FORCES & SOCIETY, 2024,
  • [28] Urgency Detection in Social Media Texts Using Natural Language Processing
    Makkena, Navya
    Islam, A. B. M. Rezbaul
    Varol, Cihan
    An, Min Kyung
    18TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC 2024, 2024, : 156 - 163
  • [30] A Study of Profanity Effect in Sentiment Analysis on Natural Language Processing Using ANN
    Kim, Cheong-Ghil
    Hwang, Young-Jun
    Kamyod, Chayapol
    JOURNAL OF WEB ENGINEERING, 2022, 21 (03): : 751 - 766