Transfer learning and sentiment analysis of Bahraini dialects sequential text data using multilingual deep learning approach

被引:14
|
作者
Omran, Thuraya M. [1 ]
Sharef, Baraa T. [2 ]
Grosan, Crina [3 ]
Li, Yongmin [1 ]
机构
[1] Brunel Univ London, Dept Comp Sci, Uxbridge, England
[2] Ahlia Univ, Coll Informat Technol, Dept Informat Technol, Manama, Bahrain
[3] Kings Coll London, Div Appl Technol Clin Care, London, England
关键词
Bahraini dialects; Deep learning; Long Short Term Memory; Modern standard Arabic; Sentiment analysis; Transfer learning; Word embedding; IDENTIFICATION;
D O I
10.1016/j.datak.2022.102106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is a crucial Natural Language Processing task to analyze the user's emotions and opinions towards events, entities, services, or products. Arabic NLP faces numerous challenges, some of which include: (1) the scarcity of resources, especially in modern standard Arabic and Arabic dialects, particularly the Bahraini one; (2) lack of multilingual deep learning models; and (3) insufficient transfer learning studies on Arabic dialects in general and Bahraini dialects specifically. This research aims to create a balanced dataset of Bahraini dialects that covers product reviews by translating English Amazon product reviews to modern standard Arabic, which were then converted to Bahraini dialects. Another aim of this research is to provide a reusable multilingual deep learning long short term memory model to analyze the parallel dataset of English, modern standard Arabic, and Bahraini dialects, which differ in linguistic properties. Many experiments were conducted using train-validate-test split and k-fold cross-validation to evaluate the model performance using accuracy, F1 score, and AUC metrics. The model runs average accuracy on all datasets ranging from 96.72% to 97.04%, 97.91% to 97.93% in F1 score, while in AUC was 98.46% to 98.7% when utilizing an augmentation technique. Moreover, a pre-trained Long Short Term Memory model was created to exploit and transfer the knowledge gained from analyzing the product reviews in Bahraini dialects to perform sentiment analysis on a small dataset of movie comments in the same dialects. The Pre-trained model performance was 96.97% accuracy, 96.65% F1 score, and 97.94% AUC.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Sentiment Analysis using Machine Learning and Deep Learning
    Chandra, Yogesh
    Jana, Antoreep
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM-2020), 2019, : 1 - 4
  • [22] Categorizing Text Data Using Deep Learning: A Novel Approach
    Roul, Rajendra Kumar
    Sahay, Sanjay Kumar
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, 2019, 711 : 793 - 805
  • [23] Prediction of Sentiment Analysis on Educational Data based on Deep Learning Approach
    Sultana, Jabeen
    Sultana, Nasreen
    Yadav, Kusum
    AlFayez, Fayez
    2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC), 2018,
  • [24] Morphological evaluation and sentiment analysis of Punjabi text using deep learning classification
    Singh, Jaspreet
    Singh, Gurvinder
    Singh, Rajinder
    Singh, Prithvipal
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (05) : 508 - 517
  • [25] A Deep Learning Approach to Sentiment Analysis in Turkish
    Ciftci, Basri
    Apaydin, Mehmet Serkan
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [26] A machine learning approach for urdu text sentiment analysis
    Akhtar, Muhammad
    Shoukat, Rana Saud
    Rehman, Saif Ur
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (02) : 75 - 87
  • [27] Sentiment Analysis of Financial Textual data Using Machine Learning and Deep Learning Models
    Ahmad H.O.
    Umar S.U.
    Informatica (Slovenia), 2023, 47 (05): : 153 - 158
  • [28] Text sentiment analysis based on CBOW model and deep learning in big data environment
    Liu, Bing
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (02) : 451 - 458
  • [29] Text sentiment analysis based on CBOW model and deep learning in big data environment
    Bing Liu
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 451 - 458
  • [30] Weibo Text Sentiment Analysis Based on BERT and Deep Learning
    Li, Hongchan
    Ma, Yu
    Ma, Zishuai
    Zhu, Haodong
    APPLIED SCIENCES-BASEL, 2021, 11 (22):