Opinion spam detection framework using hybrid classification scheme

被引：1

作者：

Muhammad Zubair Asghar

Asmat Ullah

Shakeel Ahmad

Aurangzeb Khan

机构：

[1] Gomal University,Institute of Computing and Information Technology

[2] King Abdul Aziz University (KAU),Faculty of Computing and Information Technology at Rabigh (FCITR)

[3] University of Science and Technology,Department of Computer Science

来源：

Soft Computing | 2020年 / 24卷

关键词：

Opinion spam; Spammer; Spam detection; Fake reviews;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

With the advent of social networking sites, opinion-mining applications have attracted the interest of the online community on review sites to know about products for their purchase decisions. However, due to increasing trend of posting spam (fake) reviews to promote the target products or defame the specific brands of competitors, Opinion Spam detection and classification has emerged as a hot issue in the community of opinion mining and sentiment analysis. We investigate the issue of Opinion Spam detection by using different combinations of entities, features, and their sentiment scores. We enrich the feature set of a baseline Spam detection method with Spam detection features (Opinion Spam, Opinion Spammer, Item Spam). Using a dataset of reviews from the Amazon site and sentences labeled for Spam detection, we evaluate the role of spamicity-related features in detecting and classifying spam (fake) clues and distinguishing them from genuine reviews. For this purpose, we introduce a rule-based feature weighting scheme and propose a method for tagging the review sentence as spam and non-spam. Experiments results depict that spam-related features improve Spam detection in review sentences posted on product review sites. Adding a revised feature weighting scheme achieved an accuracy increase from 93 to 96%. Furthermore, a hybrid set of features are shown to improve the performance of Opinion Spam detection in terms of better precision, recall, and F-measure values. This work shows that combining spam-related features with rule-based weighting scheme can improve the performance of even baseline Spam detection method. This improvement can be of use to Opinion Spam detection systems, due to the growing interest of individuals and companies in isolating fake (spam) and genuine (non-spam) reviews about products. The outcome of this work will provide an insight into spam-related features and feature weighting and will assist in developing more advanced applications for Opinion Spam detection. In the field of Opinion Spam detection, previous state-of-the-art studies used less number of spamicity-related features and less efficient feature weighting scheme. However, we provided a revised feature selection and a revised feature weighting scheme with normalized spamicity score computation technique. Therefore, our contribution is novel to the field because it provides a significant improvement over the comparing methods.

引用

页码：3475 / 3498

页数：23

共 50 条

[31] Detection of Opinion Spam with Character n-grams
Hernandez Fusilier, Donato
Montes-y-Gomez, Manuel
Rosso, Paolo
Guzman Cabrera, Rafael
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 285 - 294
[32] Deceptive opinion spam detection approaches: a literature survey
Maurya, Sushil Kumar
Singh, Dinesh
Maurya, Ashish Kumar
APPLIED INTELLIGENCE, 2023, 53 (02) : 2189 - 2234
[33] Opinion Spam Detection Based on Heterogeneous Information Network
Sun, Yingcheng
Loparo, Kenneth
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1156 - 1163
[34] Learning Document Representation for Deceptive Opinion Spam Detection
Li, Luyang
Ren, Wenjing
Qin, Bing
Liu, Ting
CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 393 - 404
[35] A comprehensive survey of various methods in opinion spam detection
Mewada, Arvind
Dewang, Rupesh Kumar
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13199 - 13239
[36] An effective detection and classification of road damages using hybrid deep learning framework
D. Deepa
A. Sivasangari
Multimedia Tools and Applications, 2023, 82 : 18151 - 18184
[37] An effective detection and classification of road damages using hybrid deep learning framework
Deepa, D.
Sivasangari, A.
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18151 - 18184
[38] Opinion spam detection: Using multi-iterative graph-based model
Noekhah, Shirin
Salim, Naomie Binti
Zakaria, Nor Hawaniah
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
[39] A Hybrid Approach for Spam Detection for Twitter
Mateen, Malik
Aleem, Muhammad
Iqbal, Muhammad Azhar
Islam, Muhammad Arshad
PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, : 466 - 471
[40] Web Spam Detection Using Map Reduce Approach to Collective Classification
Indyk, Wojciech
Kajdanowicz, Tomasz
Kazienko, Przemyslaw
Plamowski, Slawomir
INTERNATIONAL JOINT CONFERENCE CISIS'12 - ICEUTE'12 - SOCO'12 SPECIAL SESSIONS, 2013, 189 : 197 - +

← 1 2 3 4 5 →