Transfer learning and sentiment analysis of Bahraini dialects sequential text data using multilingual deep learning approach

被引:14
|
作者
Omran, Thuraya M. [1 ]
Sharef, Baraa T. [2 ]
Grosan, Crina [3 ]
Li, Yongmin [1 ]
机构
[1] Brunel Univ London, Dept Comp Sci, Uxbridge, England
[2] Ahlia Univ, Coll Informat Technol, Dept Informat Technol, Manama, Bahrain
[3] Kings Coll London, Div Appl Technol Clin Care, London, England
关键词
Bahraini dialects; Deep learning; Long Short Term Memory; Modern standard Arabic; Sentiment analysis; Transfer learning; Word embedding; IDENTIFICATION;
D O I
10.1016/j.datak.2022.102106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is a crucial Natural Language Processing task to analyze the user's emotions and opinions towards events, entities, services, or products. Arabic NLP faces numerous challenges, some of which include: (1) the scarcity of resources, especially in modern standard Arabic and Arabic dialects, particularly the Bahraini one; (2) lack of multilingual deep learning models; and (3) insufficient transfer learning studies on Arabic dialects in general and Bahraini dialects specifically. This research aims to create a balanced dataset of Bahraini dialects that covers product reviews by translating English Amazon product reviews to modern standard Arabic, which were then converted to Bahraini dialects. Another aim of this research is to provide a reusable multilingual deep learning long short term memory model to analyze the parallel dataset of English, modern standard Arabic, and Bahraini dialects, which differ in linguistic properties. Many experiments were conducted using train-validate-test split and k-fold cross-validation to evaluate the model performance using accuracy, F1 score, and AUC metrics. The model runs average accuracy on all datasets ranging from 96.72% to 97.04%, 97.91% to 97.93% in F1 score, while in AUC was 98.46% to 98.7% when utilizing an augmentation technique. Moreover, a pre-trained Long Short Term Memory model was created to exploit and transfer the knowledge gained from analyzing the product reviews in Bahraini dialects to perform sentiment analysis on a small dataset of movie comments in the same dialects. The Pre-trained model performance was 96.97% accuracy, 96.65% F1 score, and 97.94% AUC.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Sentiment Analysis of Multilingual Dataset of Bahraini Dialects, Arabic, and English
    Omran, Thuraya
    Sharef, Baraa
    Grosan, Crina
    Li, Yongmin
    DATA, 2023, 8 (04)
  • [2] Hybrid deep learning approach for sentiment analysis using text and emojis
    Kuruva, Arjun
    Chiluka, C. Nagaraju
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,
  • [3] Deep learning and multilingual sentiment analysis on social media data: An overview
    Aguero-Torales, Marvin M.
    Salas, Jose I. Abreu
    Lopez-Herrera, Antonio G.
    APPLIED SOFT COMPUTING, 2021, 107 (107)
  • [4] Text mining based sentiment analysis using a novel deep learning approach
    Abdullaha, Enas Fadhil
    Alasadib, Suad A.
    Al-Jodac, Alyaa Abdulhussein
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 : 595 - 604
  • [5] Using Deep Learning Techniques in Forecasting Stock Markets by Hybrid Data with Multilingual Sentiment Analysis
    Lin, Ying-Lei
    Lai, Chi-Ju
    Pai, Ping-Feng
    ELECTRONICS, 2022, 11 (21)
  • [6] Multilingual Sarcasm Detection for Enhancing Sentiment Analysis using Deep Learning Algorithms
    Yacoub, Ahmed Derbala
    Aboutabl, Amal Elsayed
    Slim, Salwa O.
    JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2024, 20 (04) : 278 - 289
  • [7] Sentiment Analysis for Polish Using Transfer Learning Approach
    Bartusiak, Roman
    Augustyniak, Lukasz
    Kajdanowicz, Tomasz
    Kazienko, Przemyslaw
    SECOND EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC 2015), 2015, : 53 - 59
  • [8] A Deep Learning Approach to Deal with Data Uncertainty in Sentiment Analysis
    Di Capua, Michele
    Petrosino, Alfredo
    FUZZY LOGIC AND SOFT COMPUTING APPLICATIONS, WILF 2016, 2017, 10147 : 172 - 184
  • [9] UTSA: Urdu Text Sentiment Analysis Using Deep Learning Methods
    Naqvi, Uzma
    Majid, Abdul
    Abbas, Syed Ali
    IEEE ACCESS, 2021, 9 : 114085 - 114094
  • [10] Sentiment Analysis of Image with Text Caption using Deep Learning Techniques
    Chaubey, Pavan Kumar
    Arora, Tarun Kumar
    Raj, K. Bhavana
    Asha, G. R.
    Mishra, Geetishree
    Guptav, Suresh Chand
    Altuwairiqi, Majid
    Alhassan, Musah
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022