Empirical study on imbalanced learning of Arabic sentiment polarity with neural word embedding

被引:7
|
作者
El-Alfy, El-Sayed M. [1 ]
Al-Azani, Sadam [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
关键词
Social network; sentiment analysis; polarity detection; word embedding; machine learning; imbalanced dataset; Arabic tweets; CLASSIFICATION; SMOTE;
D O I
10.3233/JIFS-179703
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the proliferation of social media and mobile technology, huge amount of unstructured data is posted daily online. Consequently, sentiment analysis has gained increasing importance as a tool to understand the opinions of certain groups of people on contemporary political, cultural, social or commercial issues. Unlike western languages, the research on sentiment analysis for dialectical Arabic language is still in its early stages with several challenges to be addressed. The main goal of this study is twofold. First, it compares the performance of core machine learning algorithms for detecting the polarity in imbalanced Arabic tweet datasets using neural word embedding as a feature extractor rather than hand-crafted or traditional features. Second, it examines the impact of using various oversampling techniques to handle the highly-imbalanced nature of the sentiment data. Intensive empirical analysis of nine machine learning methods and six oversampling methods has been conducted and the results have been discussed in terms of a wide range of performance measures.
引用
收藏
页码:6211 / 6222
页数:12
相关论文
共 50 条
  • [41] An Integrated Word Embedding-Based Dual-Task Learning Method for Sentiment Analysis
    Yanping Fu
    Yun Liu
    Sheng-Lung Peng
    Arabian Journal for Science and Engineering, 2020, 45 : 2571 - 2586
  • [42] Software Sentiment Analysis using Deep-learning Approach with Word-Embedding Techniques
    Mula, Venkata Krishna Chandra
    Kumar, Lov
    Murthy, Lalita Bhanu
    Krishna, Aneesh
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 873 - 882
  • [43] An Integrated Word Embedding-Based Dual-Task Learning Method for Sentiment Analysis
    Fu, Yanping
    Liu, Yun
    Peng, Sheng-Lung
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 2571 - 2586
  • [44] Word2Sent: A new learning sentiment-embedding model with low dimension for sentence level sentiment classification
    Kasri, Mohammed
    Birjali, Marouane
    Beni-Hssane, Abderrahim
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (09):
  • [45] Word Embedding Dropout and Variable-Length Convolution Window in Convolutional Neural Network for Sentiment Classification
    Sun, Shangdi
    Gu, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 40 - 48
  • [46] Enhancing Sentiment Analysis: A Comparative Study of Word Embedding Integration in Mobile Phone Reviews
    Yang, Zichen
    Wang, Hanrui
    Wang, Zhiliang
    Chen, Jiaxin
    2024 IEEE INTERNATIONAL CONFERENCE ON SMART INTERNET OF THINGS, SMARTIOT 2024, 2024, : 540 - 546
  • [47] Opinion Mining and Sentiment Study of Tweets Polarity Using Machine Learning
    Mridula, A.
    Kavitha, C. R.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 621 - 626
  • [48] Thai sentiment analysis with deep learning techniques: A comparative study based on word embedding, POS-tag, and sentic features
    Pasupa, Kitsuchart
    Ayutthaya, Thititorn Seneewong Na
    SUSTAINABLE CITIES AND SOCIETY, 2019, 50
  • [49] Three Convolutional Neural Network-based Models for Learning Sentiment Word Vectors towards Sentiment Analysis
    Lan, Man
    Zhang, Zhihua
    Lu, Yue
    Wu, Ju
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3172 - 3179
  • [50] A new word embedding model integrated with medical knowledge for deep learning-based sentiment classification
    Khine, Aye Hninn
    Wettayaprasit, Wiphada
    Duangsuwan, Jarunee
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 148