An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

被引:60
|
作者
Dong, Lu-yu [1 ]
Ji, Shu-juan [2 ,3 ]
Zhang, Chun-jin [4 ]
Zhang, Qi [1 ]
Chiu, DicksonK. W. [5 ]
Qiu, Li-Qing [1 ]
Li, Da [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Informat Sci & Engn, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Key Lab Wisdom Mine Informat Technol Shandong Pro, Qingdao, Peoples R China
[3] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan, Shandong, Peoples R China
[4] Shandong Univ Sci & Technol, Network Informat Ctr NIC, Qingdao, Peoples R China
[5] Univ Hong Kong, Fac Educ, Hong Kong, Hong Kong, Peoples R China
关键词
Deceptive review detection; Topic-sentiment joint probabilistic model; Latent dirichlet allocation; Gibbs sampling; REPUTATION;
D O I
10.1016/j.eswa.2018.07.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In electronic commerce, online reviews play very important roles in customers' purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:210 / 223
页数:14
相关论文
共 50 条
  • [21] An Unsupervised Cross-Lingual Topic Model Framework for Sentiment Classification
    Lin, Zheng
    Jin, Xiaolong
    Xu, Xueke
    Wang, Yuanzhuo
    Cheng, Xueqi
    Wang, Weiping
    Meng, Dan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (03) : 432 - 444
  • [22] SENTIMENT ANALYSIS OF MICROBLOG TEXT BASED ON JOINT SENTIMENT-TOPIC MODEL
    Zhang, Hui
    Liu, Yiqun
    Ma, Shaoping
    2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 46 - 54
  • [23] Large-Scale Joint Topic, Sentiment & User Preference Analysis for Online Reviews
    Yu, Xinli
    Chen, Zheng
    Yang, Wei-Shih
    Hu, Xiaohua
    Yan, Erjia
    Li, Guangrong
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 847 - 856
  • [24] Joint multi-grain topic sentiment: modeling semantic aspects for online reviews
    Alam, Md Hijbul
    Ryu, Woo-Jong
    Lee, SangKeun
    INFORMATION SCIENCES, 2016, 339 : 206 - 223
  • [25] Bayesian game model based unsupervised sentiment analysis of product reviews
    Punetha, Neha
    Jain, Goonjan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [26] Joint Sentiment Part Topic Regression Model for Multimodal Analysis
    Li, Mengyao
    Zhu, Yonghua
    Gao, Wenjing
    Cao, Meng
    Wang, Shaoxiu
    INFORMATION, 2020, 11 (10) : 1 - 16
  • [27] STC: A Joint Sentiment-Topic Model for Community Identification
    Yang, Baoguo
    Manandhar, Suresh
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 535 - 548
  • [28] Weakly supervised topic sentiment joint model with word embeddings
    Fu, Xianghua
    Sun, Xudong
    Wu, Haiying
    Cui, Laizhong
    Huang, Joshua Zhexue
    KNOWLEDGE-BASED SYSTEMS, 2018, 147 : 43 - 54
  • [29] Study on Spatio-Temporal Topic-Sentiment Synergy Model and Visualization of Online Public Opinion on Public Health Emergency
    Lu, Yuhan
    Zeng, Ziming
    E-BUSINESS: DIGITAL EMPOWERMENT FOR AN INTELLIGENT FUTURE, PT I, WHICEB 2023, 2023, 480 : 338 - 349
  • [30] Sentiment Classification of Crowdsourcing Participants' Reviews Text Based on LDA Topic Model
    Huang, Yanrong
    Wang, Rui
    Huang, Bin
    Wei, Bo
    Zheng, Shu Li
    Chen, Min
    IEEE ACCESS, 2021, 9 : 108131 - 108143