AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

被引:84
|
作者
Al-Twairesh, Nora [1 ]
Al-Khalifa, Hend [1 ]
Al-Salman, AbdulMalik [1 ]
Al-Ohali, Yousef [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
来源
ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017) | 2017年 / 117卷
关键词
Sentiment Analysis; Arabic NLP; Corpus Sentiment Annotation; Arabic tweets; Saudi Dialect; RESOURCES;
D O I
10.1016/j.procs.2017.10.094
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Arabic Sentiment Analysis is an active research area these days. However, the Arabic language still lacks sufficient language resources to enable the tasks of sentiment analysis. In this paper, we present the details of collecting and constructing a large dataset of Arabic tweets. The techniques used in cleaning and pre-processing the collected dataset are explained. A corpus of Arabic tweets annotated for sentiment analysis was extracted from this dataset. The corpus consists mainly of tweets written in Modern Standard Arabic and the Saudi dialect. The corpus was manually annotated for sentiment. The annotation process is explained in detail and the challenges during the annotation are highlighted. The corpus contains 17,573 tweets labelled with four labels for sentiment: positive, negative, neutral and mixed. Baseline experiments were conducted to provide benchmark results for future work. (c) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [41] Developing Lexicon-based Algorithms and Sentiment Lexicon for Sentiment Analysis of Saudi Dialect Tweets
    Al-Ghaith, Waleed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 83 - 88
  • [42] Sentiment Analysis of Arabic Tweets using ARABERT as a fine tuner and feature extractors
    Alsugair, Athir Mohammed
    Alghamdi, Norah Saleh
    2024 11TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS 2024, 2024, : 31 - 36
  • [43] Hybrid Sentiment Analyser for Arabic Tweets using R
    Alhumoud, Sarah
    Albuhairi, Tarfa
    Alohaideb, Wejdan
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 417 - 424
  • [44] Depression detection for twitter users using sentiment analysis in English and Arabic tweets
    Helmy, Abdelmoniem
    Nassar, Radwa
    Ramdan, Nagy
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 147
  • [45] Exploring Sentiment Analysis on Arabic Tweets about the COIVD-19 Vaccines
    Alsabban, Wesam H.
    TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2022, 16 (02): : 268 - 272
  • [46] Aspect-based Sentiment Analysis and Location Detection for Arabic Language Tweets
    AlShammari, Norah
    AlMansour, Amal
    APPLIED COMPUTER SYSTEMS, 2022, 27 (02) : 119 - 127
  • [47] Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning
    Alshutayri, Areej
    Alamoudi, Huda
    Alshehri, Boushra
    Aldhahri, Eman
    Alsaleh, Iqbal
    Aljojo, Nahla
    Alghoson, Abdullah
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2022, 32 (04): : 7 - 18
  • [48] Sentiment Analysis on Tweets
    Khatoon, Mehjabin
    Banu, W. Aisha
    Zohra, A. Ayesha
    Chinthamani, S.
    SOFTWARE ENGINEERING (CSI 2015), 2019, 731 : 717 - 724
  • [49] Trustworthy Health-Related Tweets on Social Media in Saudi Arabia: Tweet Metadata Analysis
    Albalawi, Yahya
    Nikolov, Nikola S.
    Buckley, Jim
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (10)
  • [50] Real-Time Sentiment Analysis of Saudi Dialect Tweets Using SPARK
    Assiri, Adel
    Emam, Ahmed
    Al-dossari, Hmood
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3947 - 3950