Urdu Sentiment Analysis

被引:5
|
作者
Rehman, Iffraah [1 ]
Soomro, Tariq Rahim [1 ]
机构
[1] Inst Business Management IoBM, CCSIS, Karachi, Pakistan
关键词
Machine learning algorithms; sentiment analysis; Tweepy; WEKA; TEXT;
D O I
10.2478/acss-2022-0004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The world is heading towards more modernized and digitalized data and therefore a significant growth is observed in the active number of social media users with each passing day. Each post and comment can give an insight into valuable information about a certain topic or issue, a product or a brand, etc. Similarly, the process to uncover the underlying information from the opinion that a person keeps about any entity is called a sentiment analysis. The analysis can be carried out through two main approaches, i.e., either lexicon-based or machine learning algorithms. A significant amount of work in the different domains has been done in numerous languages for sentiment analysis, but minimal research has been conducted on the national language of Pakistan, which is Urdu. Twitter users who are familiar with Urdu update the tweets in two different textual formats either in Urdu Script (Nastaleeq) or in Roman Urdu. Thus, the paper is an attempt to perform the sentiment analysis on the Urdu language by extracting the tweets (Nastaleeq and Roman Urdu both) from Twitter using Tweepy APL A machine learning-based approach has been adopted for this study and the tool opted for the purpose is WEKA. The best algorithm was identified based on evaluation metrics, which comprise the number of correctly and incorrectly classified instances, accuracy, precision, and recall. SMO was found to be the most suitable machine learning algorithm for performing the sentiment analysis on Urdu (Nastaleeq) tweets, while the Roman Urdu Random Forest algorithm was identified as the best one.
引用
收藏
页码:30 / 42
页数:13
相关论文
共 50 条
  • [31] Improving Hate Speech Detection of Urdu Tweets Using Sentiment Analysis
    Ali, Muhammad Z.
    Ehsan-Ul-Haq
    Rauf, Sahar
    Javed, Kashif
    Hussain, Sarmad
    IEEE ACCESS, 2021, 9 : 84296 - 84305
  • [32] Sentiment analysis for Urdu online reviews using deep learning models
    Safder, Iqra
    Mehmood, Zainab
    Sarwar, Raheem
    Hassan, Saeed-Ul
    Zaman, Farooq
    Nawab, Rao Muhammad Adeel
    Bukhari, Faisal
    Abbasi, Rabeeh Ayaz
    Alelyani, Salem
    Aljohani, Naif Radi
    Nawaz, Raheel
    EXPERT SYSTEMS, 2021, 38 (08)
  • [33] Sentiment Analysis of Urdu Language: Handling Phrase-Level Negation
    Syed, Afraz Zahra
    Aslam, Muhammad
    Maria Martinez-Enriquez, Ana
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PT I, 2011, 7094 : 382 - +
  • [34] Sentiment Analysis of Reviews in Natural Language: Roman Urdu as a Case Study
    Qureshi, Muhammad Aasim
    Asif, Muhammad
    Hassan, Mohd Fadzil
    Abid, Adnan
    Kamal, Asad
    Safdar, Sohail
    Akber, Rehan
    IEEE ACCESS, 2022, 10 : 24945 - 24954
  • [35] Roman Urdu Slang Dictionary Development for Facebook Comment Sentiment Analysis
    1600, Institute of Electrical and Electronics Engineers Inc.
  • [36] Associating targets with SentiUnits: a step forward in sentiment analysis of Urdu text
    Syed, Afraz Z.
    Aslam, Muhammad
    Maria Martinez-Enriquez, Ana
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 41 (04) : 535 - 561
  • [37] UTSA: Urdu Text Sentiment Analysis Using Deep Learning Methods
    Naqvi, Uzma
    Majid, Abdul
    Abbas, Syed Ali
    IEEE ACCESS, 2021, 9 : 114085 - 114094
  • [38] Associating targets with SentiUnits: a step forward in sentiment analysis of Urdu text
    Afraz Z. Syed
    Muhammad Aslam
    Ana Maria Martinez-Enriquez
    Artificial Intelligence Review, 2014, 41 : 535 - 561
  • [39] UTSA: Urdu Text Sentiment Analysis Using Deep Learning Methods
    Naqvi, Uzma
    Majid, Abdul
    Abbas, Syed Ali
    IEEE Access, 2021, 9 : 114085 - 114094
  • [40] Deep Learning-Based Sentiment Analysis for Roman Urdu Text
    Ghulam, Hussain
    Zeng, Feng
    Li, Wenjia
    Xiao, Yutong
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 131 - 135