Urdu Sentiment Analysis

被引：5

作者：

Rehman, Iffraah ^{[1
]}

Soomro, Tariq Rahim ^{[1
]}

机构：

[1] Inst Business Management IoBM, CCSIS, Karachi, Pakistan

来源：

APPLIED COMPUTER SYSTEMS | 2022年 / 27卷 / 01期

关键词：

Machine learning algorithms; sentiment analysis; Tweepy; WEKA; TEXT;

D O I：

10.2478/acss-2022-0004

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The world is heading towards more modernized and digitalized data and therefore a significant growth is observed in the active number of social media users with each passing day. Each post and comment can give an insight into valuable information about a certain topic or issue, a product or a brand, etc. Similarly, the process to uncover the underlying information from the opinion that a person keeps about any entity is called a sentiment analysis. The analysis can be carried out through two main approaches, i.e., either lexicon-based or machine learning algorithms. A significant amount of work in the different domains has been done in numerous languages for sentiment analysis, but minimal research has been conducted on the national language of Pakistan, which is Urdu. Twitter users who are familiar with Urdu update the tweets in two different textual formats either in Urdu Script (Nastaleeq) or in Roman Urdu. Thus, the paper is an attempt to perform the sentiment analysis on the Urdu language by extracting the tweets (Nastaleeq and Roman Urdu both) from Twitter using Tweepy APL A machine learning-based approach has been adopted for this study and the tool opted for the purpose is WEKA. The best algorithm was identified based on evaluation metrics, which comprise the number of correctly and incorrectly classified instances, accuracy, precision, and recall. SMO was found to be the most suitable machine learning algorithm for performing the sentiment analysis on Urdu (Nastaleeq) tweets, while the Roman Urdu Random Forest algorithm was identified as the best one.

引用

页码：30 / 42

页数：13

共 50 条

[41] Contextually Enriched Meta-Learning Ensemble Model for Urdu Sentiment Analysis
Ahmed, Kanwal
Nadeem, Muhammad Imran
Li, Dun
Zheng, Zhiyun
Al-Kahtani, Nouf
Alkahtani, Hend Khalid
Mostafa, Samih M.
Mamyrbayev, Orken
SYMMETRY-BASEL, 2023, 15 (03):
[42] Sentiment Analysis Based on Urdu Reviews Using Hybrid Deep Learning Models
Singh, Neha
Jaiswal, Umesh Chandra
APPLIED COMPUTER SYSTEMS, 2023, 28 (02) : 258 - 265
[43] An Intelligent Unsupervised Approach for Handling Context-DependentWords in Urdu Sentiment Analysis
Mukhtar, Neelam
Khan, Mohammad Abid
Chiragh, Nadia
Nazir, Shah
Jan, Asim Ullah
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
[44] Aspect-based sentiment analysis in Urdu language: resource creation and evaluation
Altaf, Amna
Anwar, Muhammad Waqas
Jamal, Muhammad Hasan
Bajwa, Usama Ijaz
Rani, Sadaf
Neural Computing and Applications, 2024, 36 (34) : 21365 - 21381
[45] Effective Use of Evaluation Measures for the Validation of Best Classifier in Urdu Sentiment Analysis
Neelam Mukhtar
Mohammad Abid Khan
Nadia Chiragh
Cognitive Computation, 2017, 9 : 446 - 456
[46] A dataset of Roman Urdu text with spelling variations for sentence level sentiment analysis
Soomro, Mudasar Ahmed
Memon, Rafia Naz
Chandio, Asghar Ali
Leghari, Mehwish
Soomro, Muhammad Hanif
DATA IN BRIEF, 2024, 57
[47] Multi-class sentiment analysis of urdu text using multilingual BERT
Lal Khan
Ammar Amjad
Noman Ashraf
Hsien-Tsung Chang
Scientific Reports, 12
[48] Lexical Variation and Sentiment Analysis of Roman Urdu Sentences with Deep Neural Networks
Manzoor, Muhammad Arslan
Mamoon, Saqib
Tao, Song Kei
Zakir, Ali
Adil, Muhammad
Lu, Jianfeng
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (02) : 719 - 726
[49] Effective Use of Evaluation Measures for the Validation of Best Classifier in Urdu Sentiment Analysis
Mukhtar, Neelam
Khan, Mohammad Abid
Chiragh, Nadia
COGNITIVE COMPUTATION, 2017, 9 (04) : 446 - 456
[50] Roman Urdu Sentiment Analysis Using Pre-trained DistilBERT and XLNet
Azhar, Nikhar
Latif, Seemab
2022 FIFTH INTERNATIONAL CONFERENCE OF WOMEN IN DATA SCIENCE AT PRINCE SULTAN UNIVERSITY (WIDS-PSU 2022), 2022, : 75 - 78

← 1 2 3 4 5 →