An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

被引:60
|
作者
Dong, Lu-yu [1 ]
Ji, Shu-juan [2 ,3 ]
Zhang, Chun-jin [4 ]
Zhang, Qi [1 ]
Chiu, DicksonK. W. [5 ]
Qiu, Li-Qing [1 ]
Li, Da [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Informat Sci & Engn, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Key Lab Wisdom Mine Informat Technol Shandong Pro, Qingdao, Peoples R China
[3] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan, Shandong, Peoples R China
[4] Shandong Univ Sci & Technol, Network Informat Ctr NIC, Qingdao, Peoples R China
[5] Univ Hong Kong, Fac Educ, Hong Kong, Hong Kong, Peoples R China
关键词
Deceptive review detection; Topic-sentiment joint probabilistic model; Latent dirichlet allocation; Gibbs sampling; REPUTATION;
D O I
10.1016/j.eswa.2018.07.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In electronic commerce, online reviews play very important roles in customers' purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:210 / 223
页数:14
相关论文
共 50 条
  • [31] A weakly-supervised graph-based joint sentiment topic model for multi-topic sentiment analysis
    Zhou, Tao
    Law, Kris
    Creighton, Douglas
    INFORMATION SCIENCES, 2022, 609 : 1030 - 1051
  • [32] A Sentiment-Aware Topic Model for Extracting Failures from Product Reviews
    Tutubalina, Elena
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 37 - 45
  • [33] An Unsupervised Fine-grained Sentiment Analysis Model for Chinese Online Reviews
    Shi, Hanxiao
    Zhou, Guodong
    Qian, Peide
    Li, Xiaojun
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4277 - 4294
  • [34] A Joint Model for Sentiment-Aware Topic Detection on Social Media
    Xu, Kang
    Qi, Guilin
    Huang, Junheng
    Wu, Tianxing
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 338 - 346
  • [35] Dynamic non-parametric joint sentiment topic mixture model
    Fu, Xianghua
    Yang, Kun
    Huang, Joshua Zhexue
    Cui, Laizhong
    KNOWLEDGE-BASED SYSTEMS, 2015, 82 : 102 - 114
  • [36] Opinion Mining Using Enriched Joint Sentiment-Topic Model
    Osmani, Amjad
    Mohasefi, Jamshid Bagherzadeh
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2023, 22 (01) : 313 - 375
  • [37] Analysing user sentiment of Indian movie reviews: A probabilistic committee selection model
    Trivedi, Shrawan Kumar
    Dey, Shubhamoy
    ELECTRONIC LIBRARY, 2018, 36 (04): : 590 - 606
  • [38] Unsupervised sentiment analysis of Hindi reviews using MCDM and game model optimization techniques
    NEHA PUNETHA
    GOONJAN JAIN
    Sādhanā, 48
  • [39] Unsupervised sentiment analysis of Hindi reviews using MCDM and game model optimization techniques
    Punetha, Neha
    Jain, Goonjan
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2023, 48 (04):
  • [40] SBTM: A joint sentiment and behaviour topic model for online course discussion forums
    Peng, Xian
    Xu, Qinmei
    Gan, Wenbin
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 517 - 532