Understanding Large-Scale Network Effects in Detecting Review Spammers

被引:4
|
作者
Rout, Jitendra Kumar [1 ]
Sahoo, Kshira Sagar [2 ]
Dalmia, Anmol
Bakshi, Sambit [3 ]
Bilal, Muhammad [4 ]
Song, Houbing [5 ]
机构
[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur 492010, India
[2] Umea Univ, Dept Comp Sci, SE-90187 Umea, Sweden
[3] Natl Inst Technol Rourkela, Dept Comp Sci & Engn, Rourkela 769008, Odisha, India
[4] Hankuk Univ Foreign Studies, Dept Comp Engn, Yongin 17035, South Korea
[5] Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
关键词
Feature extraction; Behavioral sciences; Analytical models; Writing; Unsolicited e-mail; Sentiment analysis; Scalability; Online review spam; opinion spam; review graphs; spam detection; unlabeled review; FRAMEWORK;
D O I
10.1109/TCSS.2023.3243139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion spam detection is a challenge for online review systems and social forum operators. Opinion spamming costs businesses and people money since it deceives customers as well as automated opinion mining and sentiment analysis systems by bestowing undeserved positive opinions on target firms and/or bestowing fake negative opinions on others. One popular detection approach is to model a review system as a network of users, products, and reviews, for example using review graph models. In this article, we study the effects of network scale on network-based review spammer detection models, specifically on the trust model and the SpammerRank model. We then evaluate both network models using two large publicly available review datasets, namely: the Amazon dataset (containing 6 million reviews by more than 2 million reviewers) and the UCSD dataset (containing over 82 million reviews by 21 million reviewers). It has been observed thatSpammerRank model provides a better scaling time for applications requiring reviewer indicators and in case of trust model distributions are flattening out indicating variance of reviews with respect to spamming. Detailed observations on the scaling effects of these models are reported in the result section.
引用
收藏
页码:4994 / 5004
页数:11
相关论文
共 50 条
  • [41] Understanding Coarsening for Embedding Large-Scale Graphs
    Akyildiz, Taha Atahan
    Aljundi, Amro Alabsi
    Kaya, Kamer
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2937 - 2946
  • [42] Understanding the Context of Large-Scale IT Project Failures
    Rich, Eliot
    Nelson, Mark R.
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2012, 5 (02) : 1 - 24
  • [43] Large-scale quantum effects
    Crane, Leah
    NEW SCIENTIST, 2022, 246 (3435) : 10 - 10
  • [44] Understanding Large-Scale Dynamic Purchase Behavior
    Jacobs, Bruno
    Fok, Dennis
    Donkers, Bas
    MARKETING SCIENCE, 2021, 40 (05) : 844 - 870
  • [45] Nazca: Detecting Malware Distribution in Large-Scale Networks
    Invernizzi, Luca
    Miskovic, Stanislav
    Torres, Ruben
    Saha, Sabyasachi
    Lee, Sung-Ju
    Mellia, Marco
    Kruegel, Christopher
    Vigna, Giovanni
    21ST ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2014), 2014,
  • [46] Detecting fabrication in large-scale molecular omics data
    Bradshaw, Michael S.
    Payne, Samuel H.
    PLOS ONE, 2021, 16 (11):
  • [47] Detecting Suspects by Large-Scale Trajectory Patterns in the City
    Jin, Cang-Hong
    Chen, Dong-Kai
    Zhu, Fan-Wei
    Wu, Ming-Hui
    MOBILE INFORMATION SYSTEMS, 2019, 2019
  • [48] A method for detecting large-scale environmental change with lichens
    Insarov, G
    MONITORING WITH LICHENS - MONITORING LICHENS, 2002, 7 : 399 - 403
  • [49] UNVEIL: A Large-Scale, Automated Approach to Detecting Ransomware
    Kharraz, Amin
    Arshad, Sajjad
    Mulliner, Collin
    Robertson, William
    Kirda, Engin
    PROCEEDINGS OF THE 25TH USENIX SECURITY SYMPOSIUM, 2016, : 757 - 772
  • [50] Technologies for Promptly Understanding Network Conditions When Large-scale System Failure Occurs
    Akashi K.
    Kanai S.
    NTT Technical Review, 2023, 21 (12): : 24 - 26