An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

被引:60
|
作者
Dong, Lu-yu [1 ]
Ji, Shu-juan [2 ,3 ]
Zhang, Chun-jin [4 ]
Zhang, Qi [1 ]
Chiu, DicksonK. W. [5 ]
Qiu, Li-Qing [1 ]
Li, Da [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Informat Sci & Engn, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Key Lab Wisdom Mine Informat Technol Shandong Pro, Qingdao, Peoples R China
[3] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan, Shandong, Peoples R China
[4] Shandong Univ Sci & Technol, Network Informat Ctr NIC, Qingdao, Peoples R China
[5] Univ Hong Kong, Fac Educ, Hong Kong, Hong Kong, Peoples R China
关键词
Deceptive review detection; Topic-sentiment joint probabilistic model; Latent dirichlet allocation; Gibbs sampling; REPUTATION;
D O I
10.1016/j.eswa.2018.07.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In electronic commerce, online reviews play very important roles in customers' purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:210 / 223
页数:14
相关论文
共 50 条
  • [41] U-RSNet: An unsupervised probabilistic model for joint registration and segmentation
    Qiu, Liang
    Ren, Hongliang
    NEUROCOMPUTING, 2021, 450 : 264 - 274
  • [42] Detecting Topic and Sentiment Trends in Physician Rating Websites: Analysis of Online Reviews Using 3-Wave Datasets
    Shah, Adnan Muhammad
    Naqvi, Rizwan Ali
    Jeong, Ok-Ran
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (09)
  • [43] LJST: A Semi-supervised Joint Sentiment-Topic Model for Short Texts
    Sengupta A.
    Roy S.
    Ranjan G.
    SN Computer Science, 2021, 2 (4)
  • [44] Fake Review Detection Based on Joint Topic and Sentiment Pre-Training Model
    Zhang D.
    Huang L.
    Zhang R.
    Xue H.
    Lin J.
    Lu Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (07): : 1385 - 1394
  • [45] Sentiment Recognition of Online Chinese Micro Movie Reviews Using Multiple Probabilistic Reasoning Model
    Xu, Wei
    Liu, Zhi
    Wang, Tai
    Liu, Sanya
    JOURNAL OF COMPUTERS, 2013, 8 (08) : 1906 - 1911
  • [46] Weighted Joint Sentiment-Topic Model for Sentiment Analysis Compared to ALGA: Adaptive Lexicon Learning Using Genetic Algorithm
    Osmani, Amjad
    Mohasefi, Jamshid Bagherzadeh
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [47] Probabilistic Topic Model based Approach for Detecting Bursty Events from Social Media Data
    Li, Chunshan
    Chu, Dianhui
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 701 - 706
  • [48] A Hybrid Deep Learning Model to Predict Business Closure from Reviews and User Attributes Using Sentiment Aligned Topic Model
    Thazhackal, Sharun S.
    Devi, V. Susheela
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 397 - 404
  • [49] Retraction Note to: A joint model for analyzing topic and sentiment dynamics from large-scale online news
    Peng Liu
    Jon Atle Gulla
    Lemei Zhang
    World Wide Web, 2019, 22 : 417 - 417
  • [50] RETRACTED ARTICLE: A joint model for analyzing topic and sentiment dynamics from large-scale online news
    Peng Liu
    Jon Atle Gulla
    Lemei Zhang
    World Wide Web, 2018, 21 : 1117 - 1139