Empirical study on imbalanced learning of Arabic sentiment polarity with neural word embedding

被引:7
|
作者
El-Alfy, El-Sayed M. [1 ]
Al-Azani, Sadam [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
关键词
Social network; sentiment analysis; polarity detection; word embedding; machine learning; imbalanced dataset; Arabic tweets; CLASSIFICATION; SMOTE;
D O I
10.3233/JIFS-179703
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the proliferation of social media and mobile technology, huge amount of unstructured data is posted daily online. Consequently, sentiment analysis has gained increasing importance as a tool to understand the opinions of certain groups of people on contemporary political, cultural, social or commercial issues. Unlike western languages, the research on sentiment analysis for dialectical Arabic language is still in its early stages with several challenges to be addressed. The main goal of this study is twofold. First, it compares the performance of core machine learning algorithms for detecting the polarity in imbalanced Arabic tweet datasets using neural word embedding as a feature extractor rather than hand-crafted or traditional features. Second, it examines the impact of using various oversampling techniques to handle the highly-imbalanced nature of the sentiment data. Intensive empirical analysis of nine machine learning methods and six oversampling methods has been conducted and the results have been discussed in terms of a wide range of performance measures.
引用
收藏
页码:6211 / 6222
页数:12
相关论文
共 50 条
  • [1] Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text
    Al-Azani, Sadam
    El-Alfy, El-Sayed M.
    8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 359 - 366
  • [2] A Comparative Analysis of Word Embedding and Deep Learning for Arabic Sentiment Classification
    Sabbeh, Sahar F.
    Fasihuddin, Heba A.
    ELECTRONICS, 2023, 12 (06)
  • [3] Exploring Word Embedding for Arabic Sentiment Analysis
    Gayed, Sana
    Mallat, Souheyl
    Zrigui, Mounir
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 92 - 101
  • [4] Improving the Polarity of Text through word2vec Embedding for Primary Classical Arabic Sentiment Analysis
    Nour Elhouda Aoumeur
    Zhiyong Li
    Eissa M. Alshari
    Neural Processing Letters, 2023, 55 : 2249 - 2264
  • [5] Improving the Polarity of Text through word2vec Embedding for Primary Classical Arabic Sentiment Analysis
    Aoumeur, Nour Elhouda
    Li, Zhiyong
    Alshari, Eissa M. M.
    NEURAL PROCESSING LETTERS, 2023, 55 (03) : 2249 - 2264
  • [6] Text Sentiment Polarity Classification Method Based on Word Embedding
    Sun, Xiaojie
    Du, Menghao
    Shi, Hua
    Huang, Wenming
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS (ICACS 2018), 2018, : 99 - 104
  • [7] Empirical Evaluation of Word Representations on Arabic Sentiment Analysis
    Gridach, Mourad
    Haddad, Hatem
    Mulki, Hala
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 147 - 158
  • [8] Probabilistic Neural Network and Word Embedding for Sentiment Analysis
    Alam, Saqib
    Yao, Nianmin
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (07) : 48 - 53
  • [9] Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification
    Tang, Duyu
    Wei, Furu
    Yang, Nan
    Zhou, Ming
    Liu, Ting
    Qin, Bing
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1555 - 1565
  • [10] Hybrid Deep Learning for Sentiment Polarity Determination of Arabic Microblogs
    Al-Azani, Sadam
    El-Alfy, El-Sayed M.
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 491 - 500