Understanding Large-Scale Network Effects in Detecting Review Spammers

被引:4
|
作者
Rout, Jitendra Kumar [1 ]
Sahoo, Kshira Sagar [2 ]
Dalmia, Anmol
Bakshi, Sambit [3 ]
Bilal, Muhammad [4 ]
Song, Houbing [5 ]
机构
[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur 492010, India
[2] Umea Univ, Dept Comp Sci, SE-90187 Umea, Sweden
[3] Natl Inst Technol Rourkela, Dept Comp Sci & Engn, Rourkela 769008, Odisha, India
[4] Hankuk Univ Foreign Studies, Dept Comp Engn, Yongin 17035, South Korea
[5] Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
关键词
Feature extraction; Behavioral sciences; Analytical models; Writing; Unsolicited e-mail; Sentiment analysis; Scalability; Online review spam; opinion spam; review graphs; spam detection; unlabeled review; FRAMEWORK;
D O I
10.1109/TCSS.2023.3243139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion spam detection is a challenge for online review systems and social forum operators. Opinion spamming costs businesses and people money since it deceives customers as well as automated opinion mining and sentiment analysis systems by bestowing undeserved positive opinions on target firms and/or bestowing fake negative opinions on others. One popular detection approach is to model a review system as a network of users, products, and reviews, for example using review graph models. In this article, we study the effects of network scale on network-based review spammer detection models, specifically on the trust model and the SpammerRank model. We then evaluate both network models using two large publicly available review datasets, namely: the Amazon dataset (containing 6 million reviews by more than 2 million reviewers) and the UCSD dataset (containing over 82 million reviews by 21 million reviewers). It has been observed thatSpammerRank model provides a better scaling time for applications requiring reviewer indicators and in case of trust model distributions are flattening out indicating variance of reviews with respect to spamming. Detailed observations on the scaling effects of these models are reported in the result section.
引用
收藏
页码:4994 / 5004
页数:11
相关论文
共 50 条
  • [31] Evaluating IM on a large-scale network
    Shiau, Wen-Lung
    Li, Yu-Feng
    Chao, Han-Chieh
    Hsu, Ping-Yu
    COMPUTER COMMUNICATIONS, 2006, 29 (16) : 3113 - 3121
  • [32] ON LARGE-SCALE NONLINEAR NETWORK OPTIMIZATION
    TOINT, PL
    TUYTTENS, D
    MATHEMATICAL PROGRAMMING, 1990, 48 (01) : 125 - 159
  • [33] LARGE-SCALE NETWORK TOPOLOGICAL OPTIMIZATION
    BOORSTYN, RR
    FRANK, H
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1977, 25 (01) : 29 - 47
  • [34] A network architecture for large-scale science
    Gorinsky, Sergey
    Jechlitschek, Christoph
    Rao, Nageswara S. V.
    2007 HIGH-SPEED NETWORKS WORKSHOP, 2007, : 1 - +
  • [35] A LARGE-SCALE NETWORK STORAGE FACILITY
    DAKIN, RJ
    LEDERER, BR
    PARKER, KR
    SOFTWARE-PRACTICE & EXPERIENCE, 1985, 15 (09): : 889 - 899
  • [36] Understanding Large-Scale Software - A Hierarchical View
    Levy, Omer
    Feitelson, Dror G.
    2019 IEEE/ACM 27TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2019), 2019, : 283 - 293
  • [37] Understanding Source Code Comments at Large-Scale
    He, Hao
    ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2019, : 1217 - 1219
  • [38] Advances in the understanding of the large-scale gap test
    Burley, S. J.
    Bourne, N. K.
    Fung, V.
    Hollands, R.
    Millett, J. C. F.
    Milne, A. M.
    Wood, A.
    Shock Compression of Condensed Matter - 2005, Pts 1 and 2, 2006, 845 : 944 - 947
  • [39] The Case for a Large-scale Occultation Network
    Rice, Malena
    Laughlin, Gregory
    ASTRONOMICAL JOURNAL, 2019, 158 (01):
  • [40] Large-Scale Signaling Network Reconstruction
    Hashemikhabir, Seyedsasan
    Ayaz, Eyup Serdar
    Kavurucu, Yusuf
    Can, Tolga
    Kahveci, Tamer
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1696 - 1708