A Review of Urdu Sentiment Analysis with Multilingual Perspective: A Case of Urdu and Roman Urdu Language

被引:19
|
作者
Khan, Ihsan Ullah [1 ]
Khan, Aurangzeb [1 ,2 ]
Khan, Wahab [1 ]
Su'ud, Mazliham Mohd [2 ]
Alam, Muhammad Mansoor [3 ]
Subhan, Fazli [2 ,4 ]
Asghar, Muhammad Zubair [5 ]
机构
[1] Univ Sci & Technol, Dept Comp Sci, Bannu 28100, Pakistan
[2] Multimedia Univ, Fac Comp & Informat, Kuala Lumpur 50050, Malaysia
[3] Riphah Int Univ, Rawalpindi 74400, Pakistan
[4] Natl Univ Modern Languages NUML, Fac Engn & Comp Sci, Islamabad 44000, Pakistan
[5] Gomal Univ, Inst Comp & Informat Technol, Dera Ismail Khan 29050, Pakistan
关键词
preprocessing; feature extraction; classification;
D O I
10.3390/computers11010003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Research efforts in the field of sentiment analysis have exponentially increased in the last few years due to its applicability in areas such as online product purchasing, marketing, and reputation management. Social media and online shopping sites have become a rich source of user-generated data. Manufacturing, sales, and marketing organizations are progressively turning their eyes to this source to get worldwide feedback on their activities and products. Millions of sentences in Urdu and Roman Urdu are posted daily on social sites, such as Facebook, Instagram, Snapchat, and Twitter. Disregarding people's opinions in Urdu and Roman Urdu and considering only resource-rich English language leads to the vital loss of this vast amount of data. Our research focused on collecting research papers related to Urdu and Roman Urdu language and analyzing them in terms of preprocessing, feature extraction, and classification techniques. This paper contains a comprehensive study of research conducted on Roman Urdu and Urdu text for a product review. This study is divided into categories, such as collection of relevant corpora, data preprocessing, feature extraction, classification platforms and approaches, limitations, and future work. The comparison was made based on evaluating different research factors, such as corpus, lexicon, and opinions. Each reviewed paper was evaluated according to some provided benchmarks and categorized accordingly. Based on results obtained and the comparisons made, we suggested some helpful steps in a future study.
引用
收藏
页数:29
相关论文
共 50 条
  • [41] Aspect-based sentiment analysis in Urdu language: resource creation and evaluation
    Altaf, Amna
    Anwar, Muhammad Waqas
    Jamal, Muhammad Hasan
    Bajwa, Usama Ijaz
    Rani, Sadaf
    Neural Computing and Applications, 2024, 36 (34) : 21365 - 21381
  • [42] RUTUT: Roman Urdu to Urdu Translator Based on Character Substitution Rules and Unicode Mapping
    Shahroz, Mobeen
    Mushtaq, Muhammad Faheem
    Mehmood, Arif
    Ullah, Saleem
    Choi, Gyu Sang
    IEEE ACCESS, 2020, 8 : 189823 - 189841
  • [43] Sentiment Analysis on Urdu Tweets Using Markov Chains
    Nasim Z.
    Ghani S.
    SN Computer Science, 2020, 1 (5)
  • [44] A machine learning approach for urdu text sentiment analysis
    Akhtar, Muhammad
    Shoukat, Rana Saud
    Rehman, Saif Ur
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (02) : 75 - 87
  • [45] Medical assistant chatbot Urdu text sentiment analysis
    Syeda Haneen Ashfaq
    Muhammad Ameen Chhajro
    Shahbaz Khan
    Asif Ali Laghari
    Human-Intelligent Systems Integration, 2024, 6 (1) : 131 - 144
  • [46] Urdu Receptive Language Scale (URLS): Modification & development of protocol for administration in Urdu
    Butt, Ghazal Awais
    Mumtaz, Nazia
    Saqulain, Ghulam
    PAKISTAN JOURNAL OF MEDICAL SCIENCES, 2024, 40 (05) : 884 - 890
  • [47] A Precisely Xtreme-Multi Channel Hybrid Approach for Roman Urdu Sentiment Analysis
    Mehmood, Faiza
    Ghani, Muhammad Usman
    Ibrahim, Muhammad Ali
    Shahzadi, Rehab
    Mahmood, Waqar
    Asim, Muhammad Nabeel
    IEEE ACCESS, 2020, 8 : 192740 - 192759
  • [48] Sentiment Analysis of Roman Urdu on E-Commerce Reviews Using Machine Learning
    Chandio, Bilal
    Shaikh, Asadullah
    Bakhtyar, Maheen
    Alrizq, Mesfer
    Baber, Junaid
    Sulaiman, Adel
    Rajab, Adel
    Noor, Waheed
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2022, 131 (03): : 1263 - 1287
  • [49] Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
    Chandio, Bilal Ahmed
    Imran, Ali Shariq
    Bakhtyar, Maheen
    Daudpota, Sher Muhammad
    Baber, Junaid
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [50] Complex Network of Urdu Language
    Khan, Nuzhat
    Bakht, Muhammad Paend
    Khan, Muhammad Junaid
    Samad, Abdul
    2019 13TH INTERNATIONAL CONFERENCE ON MATHEMATICS, ACTUARIAL SCIENCE, COMPUTER SCIENCE AND STATISTICS (MACS-13), 2019,