An unsupervised topic-sentiment joint probabilistic model for detecting deceptive reviews

被引:60
|
作者
Dong, Lu-yu [1 ]
Ji, Shu-juan [2 ,3 ]
Zhang, Chun-jin [4 ]
Zhang, Qi [1 ]
Chiu, DicksonK. W. [5 ]
Qiu, Li-Qing [1 ]
Li, Da [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Informat Sci & Engn, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Key Lab Wisdom Mine Informat Technol Shandong Pro, Qingdao, Peoples R China
[3] Shandong Normal Univ, Shandong Prov Key Lab Novel Distributed Comp Soft, Jinan, Shandong, Peoples R China
[4] Shandong Univ Sci & Technol, Network Informat Ctr NIC, Qingdao, Peoples R China
[5] Univ Hong Kong, Fac Educ, Hong Kong, Hong Kong, Peoples R China
关键词
Deceptive review detection; Topic-sentiment joint probabilistic model; Latent dirichlet allocation; Gibbs sampling; REPUTATION;
D O I
10.1016/j.eswa.2018.07.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In electronic commerce, online reviews play very important roles in customers' purchasing decisions. Unfortunately, malicious sellers often hire buyers to fabricate fake reviews to improve their reputation. In order to detect deceptive reviews and mine the topics and sentiments from the reviews, in this paper, we propose an unsupervised topic-sentiment joint probabilistic model (UTSJ) based on Latent Dirichlet Allocation (LDA) model. This model first employs Gibbs sampling algorithm to approximate parameters of maximum likelihood function offline and obtain topic-sentiment joint probabilistic distribution vector for each review. Secondly, a Random Forest classifier and a SVM (Support Vector Machine) classifier are trained offline, respectively. Experimental results on real-life datasets show that our proposed model is better than baseline models such as n-grams, character n-grams in token, POS (part-of-speech), LDA, and JST (Joint Sentiment/Topic). Moreover, our UTSJ model outperforms or performs similarly to benchmark models in detecting deceptive reviews over balanced dataset and unbalanced dataset in different domains. Particularly, our UTSJ model is good at dealing with real-life unbalanced big data, which makes it very suitable for being applied in e-commerce environment. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:210 / 223
页数:14
相关论文
共 50 条
  • [1] A Joint Model for Topic-Sentiment Modeling from Text
    Dermouche, Mohamed
    Kouas, Leila
    Velcin, Julien
    Loudcher, Sabine
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 819 - 824
  • [2] A Joint Model for Topic-Sentiment Evolution over Time
    Dermouche, Mohamed
    Velcin, Julien
    Khouas, Leila
    Loudcher, Sabine
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 773 - 778
  • [3] Segment-Level Joint Topic-Sentiment Model for Online Review Analysis
    Yang, Qinjuan
    Rao, Yanghui
    Xie, Haoran
    Wang, Jiahai
    Wang, Fu Lee
    Chan, Wai Hong
    IEEE INTELLIGENT SYSTEMS, 2019, 34 (01) : 43 - 50
  • [4] Segment-Level Joint Topic-Sentiment Model for Online Review Analysis (vol 34, pg 43, 2019)
    Xie, Haoran
    IEEE INTELLIGENT SYSTEMS, 2019, 34 (02) : 82 - 82
  • [5] Unsupervised Sentiment Classification: A Hybrid Sentiment-Topic Model Approach
    Blair, Stuart J.
    Bi, Yaxin
    Mulvenna, Maurice D.
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 453 - 460
  • [6] A deceptive detection model based on topic, sentiment, and sentence structure information
    Du, Xiaodong
    Zhu, Ruiqi
    Zhao, Fuqiang
    Zhao, Fangzhou
    Han, Ping
    Zhu, Zhengyu
    APPLIED INTELLIGENCE, 2020, 50 (11) : 3868 - 3881
  • [7] A deceptive detection model based on topic, sentiment, and sentence structure information
    Xiaodong Du
    Ruiqi Zhu
    Fuqiang Zhao
    Fangzhou Zhao
    Ping Han
    Zhengyu Zhu
    Applied Intelligence, 2020, 50 : 3868 - 3881
  • [8] Clustering-Based Joint Topic-Sentiment Modeling of Social Media Data: A Neural Networks Approach
    Hanny, David
    Resch, Bernd
    INFORMATION, 2024, 15 (04)
  • [9] Dynamic Joint Sentiment-Topic Model
    He, Yulan
    Lin, Chenghua
    Gao, Wei
    Wong, Kam-Fai
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 5 (01)
  • [10] Topic-sentiment evolution over time: a manifold learning-based model for online news
    Yuemei Xu
    Yang Li
    Ye Liang
    Lianqiao Cai
    Journal of Intelligent Information Systems, 2020, 55 : 27 - 49