Empirical study on imbalanced learning of Arabic sentiment polarity with neural word embedding

被引:7
|
作者
El-Alfy, El-Sayed M. [1 ]
Al-Azani, Sadam [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
关键词
Social network; sentiment analysis; polarity detection; word embedding; machine learning; imbalanced dataset; Arabic tweets; CLASSIFICATION; SMOTE;
D O I
10.3233/JIFS-179703
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the proliferation of social media and mobile technology, huge amount of unstructured data is posted daily online. Consequently, sentiment analysis has gained increasing importance as a tool to understand the opinions of certain groups of people on contemporary political, cultural, social or commercial issues. Unlike western languages, the research on sentiment analysis for dialectical Arabic language is still in its early stages with several challenges to be addressed. The main goal of this study is twofold. First, it compares the performance of core machine learning algorithms for detecting the polarity in imbalanced Arabic tweet datasets using neural word embedding as a feature extractor rather than hand-crafted or traditional features. Second, it examines the impact of using various oversampling techniques to handle the highly-imbalanced nature of the sentiment data. Intensive empirical analysis of nine machine learning methods and six oversampling methods has been conducted and the results have been discussed in terms of a wide range of performance measures.
引用
收藏
页码:6211 / 6222
页数:12
相关论文
共 50 条
  • [11] Improving Arabic Sentiment Analysis Using LSTM Based on Word Embedding Models
    Zahidi, Youssra
    Al-Amrani, Yassine
    El Younoussi, Yacine
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (03) : 391 - 407
  • [12] Learning Sentiment-Specific Word Embedding via Global Sentiment Representation
    Fu, Peng
    Lin, Zheng
    Yuan, Fengcheng
    Wang, Weiping
    Meng, Dan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4808 - 4815
  • [13] Arabic Sentiment Analysis Based on Word Embeddings and Deep Learning
    Elhassan, Nasrin
    Varone, Giuseppe
    Ahmed, Rami
    Gogate, Mandar
    Dashtipour, Kia
    Almoamari, Hani
    El-Affendi, Mohammed A.
    Al-Tamimi, Bassam Naji
    Albalwy, Faisal
    Hussain, Amir
    COMPUTERS, 2023, 12 (06)
  • [14] The Study on the Chinese Word Sentiment Polarity Automatic Estimation
    Zhang, Jing
    Jin, Hao
    PROCEEDINGS OF ANNUAL CONFERENCE OF CHINA INSTITUTE OF COMMUNICATIONS, 2010, : 64 - +
  • [15] Turkish Tweet Sentiment Analysis with Word Embedding and Machine Learning
    Ayata, Deger
    Saraclar, Murat
    Ozgur, Arzucan
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [16] Learning Polarity Embedding Attention for Aspect-based Sentiment Analysis
    Wadawadagi, Ramesh
    Hatture, Sanjeevakumar M.
    Pagi, Veerappa
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2024, 33 (01)
  • [17] An Empirical Study on Machine Learning-Based Sentiment Classification Using Polarity Clues
    Waltinger, Ulli
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2011, 75 : 202 - 214
  • [18] Learning Sentiment-inherent Word Embedding for Word-level and Sentence-Level Sentiment Analysis
    Zhang, Zhihua
    Lan, Man
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 94 - 97
  • [19] Arabic Word Segmentation With Long Short-Term Memory Neural Networks and Word Embedding
    Almuhareb, Abdulrahman
    Alsanie, Waleed
    Al-Thubaity, Abdulmohsen
    IEEE ACCESS, 2019, 7 : 12879 - 12887
  • [20] Aggregation of Word Embedding and Q-learning for Arabic Anaphora Resolution
    Bouzid, Saoussen Mathlouthi
    Zribi, Chiraz Ben Othmane
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 93 - 107