Opinion spam detection framework using hybrid classification scheme

被引:1
|
作者
Muhammad Zubair Asghar
Asmat Ullah
Shakeel Ahmad
Aurangzeb Khan
机构
[1] Gomal University,Institute of Computing and Information Technology
[2] King Abdul Aziz University (KAU),Faculty of Computing and Information Technology at Rabigh (FCITR)
[3] University of Science and Technology,Department of Computer Science
来源
Soft Computing | 2020年 / 24卷
关键词
Opinion spam; Spammer; Spam detection; Fake reviews;
D O I
暂无
中图分类号
学科分类号
摘要
With the advent of social networking sites, opinion-mining applications have attracted the interest of the online community on review sites to know about products for their purchase decisions. However, due to increasing trend of posting spam (fake) reviews to promote the target products or defame the specific brands of competitors, Opinion Spam detection and classification has emerged as a hot issue in the community of opinion mining and sentiment analysis. We investigate the issue of Opinion Spam detection by using different combinations of entities, features, and their sentiment scores. We enrich the feature set of a baseline Spam detection method with Spam detection features (Opinion Spam, Opinion Spammer, Item Spam). Using a dataset of reviews from the Amazon site and sentences labeled for Spam detection, we evaluate the role of spamicity-related features in detecting and classifying spam (fake) clues and distinguishing them from genuine reviews. For this purpose, we introduce a rule-based feature weighting scheme and propose a method for tagging the review sentence as spam and non-spam. Experiments results depict that spam-related features improve Spam detection in review sentences posted on product review sites. Adding a revised feature weighting scheme achieved an accuracy increase from 93 to 96%. Furthermore, a hybrid set of features are shown to improve the performance of Opinion Spam detection in terms of better precision, recall, and F-measure values. This work shows that combining spam-related features with rule-based weighting scheme can improve the performance of even baseline Spam detection method. This improvement can be of use to Opinion Spam detection systems, due to the growing interest of individuals and companies in isolating fake (spam) and genuine (non-spam) reviews about products. The outcome of this work will provide an insight into spam-related features and feature weighting and will assist in developing more advanced applications for Opinion Spam detection. In the field of Opinion Spam detection, previous state-of-the-art studies used less number of spamicity-related features and less efficient feature weighting scheme. However, we provided a revised feature selection and a revised feature weighting scheme with normalized spamicity score computation technique. Therefore, our contribution is novel to the field because it provides a significant improvement over the comparing methods.
引用
收藏
页码:3475 / 3498
页数:23
相关论文
共 50 条
  • [21] RLOSD: Representation Learning based Opinion Spam Detection
    Sedighi, Zeinab
    Ebrahimpour-Komleh, Hossein
    Bagheri, Ayoub
    2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 74 - 80
  • [22] Impact of Behavioral and Textual Features on Opinion Spam Detection
    Rastogi, Ajay
    Mehrotra, Monica
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 852 - 857
  • [23] Distributed classification for image spam detection
    Amiza Amir
    Bala Srinivasan
    Asad I. Khan
    Multimedia Tools and Applications, 2018, 77 : 13249 - 13278
  • [24] Distributed classification for image spam detection
    Amir, Amiza
    Srinivasan, Bala
    Khan, Asad I.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (11) : 13249 - 13278
  • [25] Securing Behavior-based Opinion Spam Detection
    Ge, Shuaijun
    Ma, Guixiang
    Xie, Sihong
    Yu, Philip S.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 112 - 117
  • [26] "TwitterSpamDetector" A Spam Detection Framework for Twitter
    Kabakus, Abdullah Talha
    Kara, Resul
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2019, 10 (03) : 1 - 14
  • [27] Fusion Convolutional Attention Network for Opinion Spam Detection
    Li, Jiacheng
    Ma, Qianwen
    Yuan, Chunyuan
    Zhou, Wei
    Han, Jizhong
    Hu, Songlin
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 223 - 235
  • [28] A comprehensive survey of various methods in opinion spam detection
    Arvind Mewada
    Rupesh Kumar Dewang
    Multimedia Tools and Applications, 2023, 82 : 13199 - 13239
  • [29] A Contextual Relationship Model for Deceptive Opinion Spam Detection
    Fahfouh, Anass
    Riffi, Jamal
    Mahraz, Mohamed Adnane
    Yahyaouy, Ali
    Tairi, Hamid
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1228 - 1239
  • [30] Detection of opinion spam based on anomalous rating deviation
    Savage, David
    Zhang, Xiuzhen
    Yu, Xinghuo
    Chou, Pauline
    Wang, Qingmai
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8650 - 8657