Understanding Large-Scale Network Effects in Detecting Review Spammers

被引:4
|
作者
Rout, Jitendra Kumar [1 ]
Sahoo, Kshira Sagar [2 ]
Dalmia, Anmol
Bakshi, Sambit [3 ]
Bilal, Muhammad [4 ]
Song, Houbing [5 ]
机构
[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur 492010, India
[2] Umea Univ, Dept Comp Sci, SE-90187 Umea, Sweden
[3] Natl Inst Technol Rourkela, Dept Comp Sci & Engn, Rourkela 769008, Odisha, India
[4] Hankuk Univ Foreign Studies, Dept Comp Engn, Yongin 17035, South Korea
[5] Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
关键词
Feature extraction; Behavioral sciences; Analytical models; Writing; Unsolicited e-mail; Sentiment analysis; Scalability; Online review spam; opinion spam; review graphs; spam detection; unlabeled review; FRAMEWORK;
D O I
10.1109/TCSS.2023.3243139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion spam detection is a challenge for online review systems and social forum operators. Opinion spamming costs businesses and people money since it deceives customers as well as automated opinion mining and sentiment analysis systems by bestowing undeserved positive opinions on target firms and/or bestowing fake negative opinions on others. One popular detection approach is to model a review system as a network of users, products, and reviews, for example using review graph models. In this article, we study the effects of network scale on network-based review spammer detection models, specifically on the trust model and the SpammerRank model. We then evaluate both network models using two large publicly available review datasets, namely: the Amazon dataset (containing 6 million reviews by more than 2 million reviewers) and the UCSD dataset (containing over 82 million reviews by 21 million reviewers). It has been observed thatSpammerRank model provides a better scaling time for applications requiring reviewer indicators and in case of trust model distributions are flattening out indicating variance of reviews with respect to spamming. Detailed observations on the scaling effects of these models are reported in the result section.
引用
收藏
页码:4994 / 5004
页数:11
相关论文
共 50 条
  • [21] Detecting Large-Scale Collusion in Procurement Auctions
    Kawai, Kei
    Nakabayashi, Jun
    JOURNAL OF POLITICAL ECONOMY, 2022, : 1364 - 1411
  • [22] Detecting fraud transactions in large-scale databases
    Pabarskaite, Zidrina
    Long, James Allen
    Proceedings of the ISAT International Scientific School, 2000, : 223 - 231
  • [23] Detecting Product Review Spammers using Activity Model
    Jiang, Bo
    Cao, Renhao
    Chen, Bi
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 650 - 653
  • [24] Detecting collusive spammers with heterogeneous graph attention network
    Zhang, Fuzhi
    Wu, Jiayi
    Zhang, Peng
    Ma, Ru
    Yu, Hongtao
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [25] Detecting spammers on social network through clustering technique
    Jose, Teena
    Babu, Suvanam Sasidhar
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019,
  • [26] Measuring, understanding and projecting the effects of large-scale climatic variability on mammals
    Berteaux, Dominique
    Stenseth, Nils Chr.
    CLIMATE RESEARCH, 2006, 32 (02) : 95 - 97
  • [27] Understanding the Impact of Network Infrastructure Changes Using Large-Scale Measurement Platforms
    Bajpai, Vaibhav
    Schoenwaelder, Juergen
    EMERGING MANAGEMENT MECHANISMS FOR THE FUTURE INTERNET (AIMS 2013), 2013, 7943 : 41 - 44
  • [28] Detecting Singleton Review Spammers Using Semantic Similarity
    Sandulescu, Vlad
    Ester, Martin
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 971 - 976
  • [29] Large-scale network simulations with GTNets
    Riley, GR
    PROCEEDINGS OF THE 2003 WINTER SIMULATION CONFERENCE, VOLS 1 AND 2, 2003, : 676 - 684
  • [30] AN ANALYTICAL LARGE-SCALE NETWORK MODEL
    VIDOMENKO, VP
    AVTOMATIKA I VYCHISLITELNAYA TEKHNIKA, 1987, (04): : 36 - 41