Urdu Sentiment Analysis

被引:5
|
作者
Rehman, Iffraah [1 ]
Soomro, Tariq Rahim [1 ]
机构
[1] Inst Business Management IoBM, CCSIS, Karachi, Pakistan
关键词
Machine learning algorithms; sentiment analysis; Tweepy; WEKA; TEXT;
D O I
10.2478/acss-2022-0004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The world is heading towards more modernized and digitalized data and therefore a significant growth is observed in the active number of social media users with each passing day. Each post and comment can give an insight into valuable information about a certain topic or issue, a product or a brand, etc. Similarly, the process to uncover the underlying information from the opinion that a person keeps about any entity is called a sentiment analysis. The analysis can be carried out through two main approaches, i.e., either lexicon-based or machine learning algorithms. A significant amount of work in the different domains has been done in numerous languages for sentiment analysis, but minimal research has been conducted on the national language of Pakistan, which is Urdu. Twitter users who are familiar with Urdu update the tweets in two different textual formats either in Urdu Script (Nastaleeq) or in Roman Urdu. Thus, the paper is an attempt to perform the sentiment analysis on the Urdu language by extracting the tweets (Nastaleeq and Roman Urdu both) from Twitter using Tweepy APL A machine learning-based approach has been adopted for this study and the tool opted for the purpose is WEKA. The best algorithm was identified based on evaluation metrics, which comprise the number of correctly and incorrectly classified instances, accuracy, precision, and recall. SMO was found to be the most suitable machine learning algorithm for performing the sentiment analysis on Urdu (Nastaleeq) tweets, while the Roman Urdu Random Forest algorithm was identified as the best one.
引用
收藏
页码:30 / 42
页数:13
相关论文
共 50 条
  • [21] Effective lexicon-based approach for Urdu sentiment analysis
    Mukhtar, Neelam
    Khan, Mohammad Abid
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2521 - 2548
  • [22] Resource Creation and Evaluation of Aspect Based Sentiment Analysis in Urdu
    Rani, Sadaf
    Anwar, Muhammad Waqas
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 72 - 77
  • [23] Sentiment analysis with word-based Urdu speech recognition
    Shaik, Riyaz
    Venkatramaphanikumar, S.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2511 - 2531
  • [24] Sentiment Analysis for a Resource Poor Language-Roman Urdu
    Mehmood, Khawar
    Essam, Daryl
    Shafi, Kamran
    Malik, Muhammad Kamran
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
  • [25] Urdu Sentiment Analysis Using Supervised Machine Learning Approach
    Mukhtar, Neelam
    Khan, Mohammad Abid
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (02)
  • [26] Discriminative Feature Spamming Technique for Roman Urdu Sentiment Analysis
    Mehmood, Khawar
    Essam, Daryl
    Shafi, Kamran
    Malik, Muhammad Kamran
    IEEE ACCESS, 2019, 7 : 47991 - 48002
  • [27] Opinion within Opinion: Segmentation Approach for Urdu Sentiment Analysis
    Hassan, Muhammad
    Shoaib, Muhammad
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (01) : 21 - 28
  • [28] Sentiment Analysis for Urdu News Tweets Using Decision Tree
    Bibi, Raheela
    Qamar, Usman
    Ansar, Munazza
    Shaheen, Asma
    2019 IEEE/ACIS 17TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS (SERA), 2019, : 66 - 70
  • [29] Sentiment analysis with word-based Urdu speech recognition
    Riyaz Shaik
    S. Venkatramaphanikumar
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2511 - 2531
  • [30] Identification and handling of intensifiers for enhancing accuracy of Urdu sentiment analysis
    Mukhtar, Neelam
    Khan, Mohammad Abid
    Chiragh, Nadia
    Nazir, Shah
    EXPERT SYSTEMS, 2018, 35 (06)