ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory

被引:0
|
作者
Alosaimi, Wael [1 ]
Saleh, Hager [2 ,3 ,4 ]
Hamzah, Ali A. [5 ]
El-Rashidy, Nora [6 ]
Alharb, Abdullah [1 ]
Elaraby, Ahmed [7 ]
Mostafa, Sherif [2 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Informat Technol, Taif, Saudi Arabia
[2] South Valley Univ, Fac Comp & Artificial Intelligence, Hurghada, Egypt
[3] Galway Univ, Data Sci Inst, Galway, Ireland
[4] Atlantic Technol Univ, Letterkenny, Ireland
[5] Ahram Canadian Univ, 6th Of October City, Egypt
[6] Kafrelsheiksh Univ, Fac Artificial Intelligence, ML & Informat Retrieval Dept, Kafrelsheiksh, Egypt
[7] South Valley Univ, Fac Comp & Informat, Dept Comp Sci, Qena, Egypt
来源
关键词
sentiment analysis; transformer models; deep learning; machine learning; Arabic sentiment analysis; Long Short-Term Memory;
D O I
10.3389/frai.2024.1408845
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis also referred to as opinion mining, plays a significant role in automating the identification of negative, positive, or neutral sentiments expressed in textual data. The proliferation of social networks, review sites, and blogs has rendered these platforms valuable resources for mining opinions. Sentiment analysis finds applications in various domains and languages, including English and Arabic. However, Arabic presents unique challenges due to its complex morphology characterized by inflectional and derivation patterns. To effectively analyze sentiment in Arabic text, sentiment analysis techniques must account for this intricacy. This paper proposes a model designed using the transformer model and deep learning (DL) techniques. The word embedding is represented by Transformer-based Model for Arabic Language Understanding (ArabBert), and then passed to the AraBERT model. The output of AraBERT is subsequently fed into a Long Short-Term Memory (LSTM) model, followed by feedforward neural networks and an output layer. AraBERT is used to capture rich contextual information and LSTM to enhance sequence modeling and retain long-term dependencies within the text data. We compared the proposed model with machine learning (ML) algorithms and DL algorithms, as well as different vectorization techniques: term frequency-inverse document frequency (TF-IDF), ArabBert, Continuous Bag-of-Words (CBOW), and skipGrams using four Arabic benchmark datasets. Through extensive experimentation and evaluation of Arabic sentiment analysis datasets, we showcase the effectiveness of our approach. The results underscore significant improvements in sentiment analysis accuracy, highlighting the potential of leveraging transformer models for Arabic Sentiment Analysis. The outcomes of this research contribute to advancing Arabic sentiment analysis, enabling more accurate and reliable sentiment analysis in Arabic text. The findings reveal that the proposed framework exhibits exceptional performance in sentiment classification, achieving an impressive accuracy rate of over 97%.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Sentiment Analysis Using Word2vec And Long Short-Term Memory (LSTM) For Indonesian Hotel Reviews
    Muhammad, Putra Fissabil
    Kusumaningrum, Retno
    Wibowo, Adi
    5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 728 - 735
  • [22] Malicious Traffic classification Using Long Short-Term Memory (LSTM) Model
    Thapa, K. Naresh Kumar
    Duraipandian, N.
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 119 (03) : 2707 - 2724
  • [23] Malicious Traffic classification Using Long Short-Term Memory (LSTM) Model
    K. Naresh Kumar Thapa
    N. Duraipandian
    Wireless Personal Communications, 2021, 119 : 2707 - 2724
  • [24] Long Short-Term Memory (LSTM) model for Indian sign language recognition
    Nihalani R.
    Chouhan S.S.
    Mittal D.
    Vadula J.
    Thakur S.
    Chakraborty S.
    Patel R.K.
    Singh U.P.
    Ghosh R.
    Singh P.
    Saxena A.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 11185 - 11203
  • [25] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Peng Chen
    Rong Wang
    Yibin Yao
    Hao Chen
    Zhihao Wang
    Zhiyuan An
    Journal of Geodesy, 2023, 97
  • [26] Long Short-Term Memory Spatial Transformer Network
    Feng, Shiyang
    Chen, Tianyue
    Sun, Hao
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 239 - 242
  • [27] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Chen, Peng
    Wang, Rong
    Yao, Yibin
    Chen, Hao
    Wang, Zhihao
    An, Zhiyuan
    JOURNAL OF GEODESY, 2023, 97 (05)
  • [28] Bidirectional Long Short-Term Memory for Sentiment Analysis of Chinese Product Reviews
    Zhang, Kai
    Song, Wei
    Liu, Lizhen
    Zhao, Xinlei
    Du, Chao
    PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 665 - 668
  • [29] A method for behaviour recognition based on long short-term memory (LSTM) and splines
    Andersson, Maria
    COUNTERTERRORISM, CRIME FIGHTING, FORENSICS, AND SURVEILLANCE TECHNOLOGIES V, 2021, 11869
  • [30] Metaheuristics based long short term memory optimization for sentiment analysis
    Suddle, Mehtab Kiran
    Bashir, Maryam
    APPLIED SOFT COMPUTING, 2022, 131