Natural Language Processing and Sentiment Analysis on Bangla Social Media Comments on Russia-Ukraine War Using Transformers

被引:8
|
作者
Hasan, Mahmud [1 ]
Islam, Labiba [1 ]
Jahan, Ismat [1 ]
Meem, Sabrina Mannan [1 ]
Rahman, Rashedur M. [1 ]
机构
[1] North South Univ, Dept Elect & Comp Engn, Dhaka 1229, Bangladesh
关键词
Natural language processing; sentiment analysis; transformers; Russia-Ukraine war;
D O I
10.1142/S2196888823500021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bangla Language ranks seventh in the list of most spoken languages with 265 native and non-native speakers around the world and the second Indo-Aryan language after Hindi. However, the growth of research for tasks such as sentiment analysis (SA) in Bangla is relatively low compared to SA in the English language. It is because there are not enough high-quality publically available datasets for training language models for text classification tasks in Bangla. In this paper, we propose a Bangla annotated dataset for sentiment analysis on the ongoing Ukraine-Russia war. The dataset was developed by collecting Bangla comments from various videos of three prominent YouTube TV news channels of Bangladesh covering their report on the ongoing conflict. A total of 10,861 Bangla comments were collected and labeled with three polarity sentiments, namely Neutral, Pro-Ukraine (Positive), and Pro-Russia (Negative). A benchmark classifier was developed by experimenting with several transformer-based language models all pre-trained on unlabeled Bangla corpus. The models were fine-tuned using our procured dataset. Hyperparameter optimization was performed on all 5 transformer language models which include: BanglaBERT, XLM-RoBERTa-base, XLM-RoBERTa-large, Distil-mBERT and mBERT. Each model was evaluated and analyzed using several evaluation metrics which include: F1 score, accuracy, and AIC (Akaike Information Criterion). The best-performing model achieved the highest accuracy of 86% with 0.82 F1 score. Based on accuracy, F1 score and AIC, BanglaBERT outperforms baseline and all the other transformer-based classifiers.
引用
收藏
页码:329 / 356
页数:28
相关论文
共 50 条
  • [31] Telugu Movie Review Sentiment Analysis Using Natural Language Processing Approach
    Badugu, Srinivasu
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 685 - 695
  • [32] Sentiment Analysis and Comprehensive Evaluation of Supervised Machine Learning Models Using Twitter Data on Russia–Ukraine War
    Wadhwani G.K.
    Varshney P.K.
    Gupta A.
    Kumar S.
    SN Computer Science, 4 (4)
  • [33] Deep Learning-Based Natural Language Processing Methods for Sentiment Analysis in Social Networks
    Li, Yan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [34] Microvascular Decompression and Trigeminal Neuralgia: Patient Sentiment Analysis Using Natural Language Processing
    Niazi, Farbod
    Elkaim, Lior M.
    Khomami, Nima Mehdy Zadeh
    Levett, Jordan J.
    Weil, Alexander G.
    Hodaie, Mojgan
    Alotaibi, Naif M.
    WORLD NEUROSURGERY, 2023, 180 : E528 - E536
  • [35] Public discourse and sentiment during Mpox outbreak: an analysis using natural language processing
    Anoop, V. S.
    Sreelakshmi, S.
    PUBLIC HEALTH, 2023, 218 : 114 - 120
  • [36] Contextual Analysis of Social Media: The Promise and Challenge of Eliciting Context in Social Media Posts with Natural Language Processing
    Patton, Desmond U.
    Frey, William R.
    McGregor, Kyle A.
    Lee, Fei-Tzin
    McKeown, Kathleen
    Moss, Emanuel
    PROCEEDINGS OF THE 3RD AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY AIES 2020, 2020, : 337 - 342
  • [37] Personality analysis for social media users using Arabic language and its effect on sentiment analysis
    Dandash, Mokhaiber
    Asadpour, Masoud
    SOCIAL NETWORK ANALYSIS AND MINING, 2025, 15 (01)
  • [38] Sentiment Analysis of Social Media Content in Pashto Language using Deep Learning Algorithms
    Iqbal, Saqib
    Khan, Farhad
    Khan, Hikmat Ullah
    Iqba, Tassawar
    Shah, Jamal Hussain
    JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (07): : 1669 - 1677
  • [39] Do social media create revolutions? Using Twitter sentiment analysis for predicting the Maidan Revolution in Ukraine
    Sabatovych, Iana
    GLOBAL MEDIA AND COMMUNICATION, 2019, 15 (03) : 275 - 283
  • [40] The Perspectives of Individuals with Comorbidities Towards COVID-19 Booster Vaccine Shots in Twitter: A Social Media Analysis Using Natural Language Processing, Sentiment Analysis and Topic Modeling
    Praveen, S. V.
    Sundar, R.
    Vajrobol, Vajratiya
    Ittamalla, Rajesh
    Srividya, K.
    Farahat, Ramadan Abdelmoez
    Chopra, Hitesh
    Rehman, Ebad Ur
    Rehman, Mohammad Ebad Ur
    Chakraborty, Chiranjib
    Dhama, Kuldeep
    JOURNAL OF PURE AND APPLIED MICROBIOLOGY, 2023, 17 (01): : 567 - 575