Opinion spam detection framework using hybrid classification scheme

被引:1
|
作者
Muhammad Zubair Asghar
Asmat Ullah
Shakeel Ahmad
Aurangzeb Khan
机构
[1] Gomal University,Institute of Computing and Information Technology
[2] King Abdul Aziz University (KAU),Faculty of Computing and Information Technology at Rabigh (FCITR)
[3] University of Science and Technology,Department of Computer Science
来源
Soft Computing | 2020年 / 24卷
关键词
Opinion spam; Spammer; Spam detection; Fake reviews;
D O I
暂无
中图分类号
学科分类号
摘要
With the advent of social networking sites, opinion-mining applications have attracted the interest of the online community on review sites to know about products for their purchase decisions. However, due to increasing trend of posting spam (fake) reviews to promote the target products or defame the specific brands of competitors, Opinion Spam detection and classification has emerged as a hot issue in the community of opinion mining and sentiment analysis. We investigate the issue of Opinion Spam detection by using different combinations of entities, features, and their sentiment scores. We enrich the feature set of a baseline Spam detection method with Spam detection features (Opinion Spam, Opinion Spammer, Item Spam). Using a dataset of reviews from the Amazon site and sentences labeled for Spam detection, we evaluate the role of spamicity-related features in detecting and classifying spam (fake) clues and distinguishing them from genuine reviews. For this purpose, we introduce a rule-based feature weighting scheme and propose a method for tagging the review sentence as spam and non-spam. Experiments results depict that spam-related features improve Spam detection in review sentences posted on product review sites. Adding a revised feature weighting scheme achieved an accuracy increase from 93 to 96%. Furthermore, a hybrid set of features are shown to improve the performance of Opinion Spam detection in terms of better precision, recall, and F-measure values. This work shows that combining spam-related features with rule-based weighting scheme can improve the performance of even baseline Spam detection method. This improvement can be of use to Opinion Spam detection systems, due to the growing interest of individuals and companies in isolating fake (spam) and genuine (non-spam) reviews about products. The outcome of this work will provide an insight into spam-related features and feature weighting and will assist in developing more advanced applications for Opinion Spam detection. In the field of Opinion Spam detection, previous state-of-the-art studies used less number of spamicity-related features and less efficient feature weighting scheme. However, we provided a revised feature selection and a revised feature weighting scheme with normalized spamicity score computation technique. Therefore, our contribution is novel to the field because it provides a significant improvement over the comparing methods.
引用
收藏
页码:3475 / 3498
页数:23
相关论文
共 50 条
  • [31] Detection of Opinion Spam with Character n-grams
    Hernandez Fusilier, Donato
    Montes-y-Gomez, Manuel
    Rosso, Paolo
    Guzman Cabrera, Rafael
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 285 - 294
  • [32] Deceptive opinion spam detection approaches: a literature survey
    Maurya, Sushil Kumar
    Singh, Dinesh
    Maurya, Ashish Kumar
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2189 - 2234
  • [33] Opinion Spam Detection Based on Heterogeneous Information Network
    Sun, Yingcheng
    Loparo, Kenneth
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1156 - 1163
  • [34] Learning Document Representation for Deceptive Opinion Spam Detection
    Li, Luyang
    Ren, Wenjing
    Qin, Bing
    Liu, Ting
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 393 - 404
  • [35] A comprehensive survey of various methods in opinion spam detection
    Mewada, Arvind
    Dewang, Rupesh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13199 - 13239
  • [36] An effective detection and classification of road damages using hybrid deep learning framework
    D. Deepa
    A. Sivasangari
    Multimedia Tools and Applications, 2023, 82 : 18151 - 18184
  • [37] An effective detection and classification of road damages using hybrid deep learning framework
    Deepa, D.
    Sivasangari, A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18151 - 18184
  • [38] Opinion spam detection: Using multi-iterative graph-based model
    Noekhah, Shirin
    Salim, Naomie Binti
    Zakaria, Nor Hawaniah
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
  • [39] A Hybrid Approach for Spam Detection for Twitter
    Mateen, Malik
    Aleem, Muhammad
    Iqbal, Muhammad Azhar
    Islam, Muhammad Arshad
    PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, : 466 - 471
  • [40] Web Spam Detection Using Map Reduce Approach to Collective Classification
    Indyk, Wojciech
    Kajdanowicz, Tomasz
    Kazienko, Przemyslaw
    Plamowski, Slawomir
    INTERNATIONAL JOINT CONFERENCE CISIS'12 - ICEUTE'12 - SOCO'12 SPECIAL SESSIONS, 2013, 189 : 197 - +