An experimental study on the performance of collaborative filtering based on user reviews for large-scale datasets

被引:2
|
作者
Al-Ghuribi, Sumaia [1 ,2 ]
Noah, Shahrul Azman Mohd [1 ]
Mohammed, Mawal [3 ]
机构
[1] Univ Kebangsaan Malaysia, Ctr Artificial Intelligence Technol, Bangi, Selangor, Malaysia
[2] Taiz Univ, Fac Appl Sci, Dept Comp Sci, Taizi, Yemen
[3] Prince Sattam Bin Abdulaziz Univ, Dept Software Engn, Alkharj, Saudi Arabia
关键词
Collaborative filtering; Recommender systems; User reviews; Sentiment analysis; RECOMMENDER SYSTEMS; SENTIMENT ANALYSIS;
D O I
10.7717/peerj-cs.1525
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collaborative filtering (CF) approaches generate user recommendations based on user similarities. These similarities are calculated based on the overall (explicit) user ratings. However, in some domains, such ratings may be sparse or unavailable. User reviews can play a significant role in such cases, as implicit ratings can be derived from the reviews using sentiment analysis, a natural language processing technique. However, most current studies calculate the implicit ratings by simply aggregating the scores of all sentiment words appearing in reviews and, thus, ignoring the elements of sentiment degrees and aspects of user reviews. This study addresses this issue by calculating the implicit rating differently, leveraging the rich information in user reviews by using both sentiment words and aspect-sentiment word pairs to enhance the CF performance. It proposes four methods to calculate the implicit ratings on large-scale datasets: the first considers the degree of sentiment words, while the second exploits the aspects by extracting aspect-sentiment word pairs to calculate the implicit ratings. The remaining two methods combine explicit ratings with the implicit ratings generated by the first two methods. The generated ratings are then incorporated into different CF rating prediction algorithms to evaluate their effectiveness in enhancing the CF performance. Evaluative experiments of the proposed methods are conducted on two large-scale datasets: Amazon and Yelp. Results of the experiments show that the proposed ratings improved the accuracy of CF rating prediction algorithms and outperformed the explicit ratings in terms of three predictive accuracy metrics.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] A Collaborative Filtering Approach based on User's Reviews
    D'Addio, Rafael Martins
    Manzato, Marcelo Garcia
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 204 - 209
  • [2] Experimental performance study of a user intensive and large-scale digital library framework
    Liao, Xiangwen
    Fang, Binxing
    Luo, Weihua
    Wang, Bin
    SECOND INTERNATIONAL CONFERENCE ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2006, : 332 - +
  • [3] Study on Collaborative Filtering Recommendation Model Fusing User Reviews
    Wang, Heyong
    Hong, Ming
    Lan, Jinjiong
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (05) : 864 - 873
  • [4] Multistage strategy for ground point filtering on large-scale datasets
    Paredes, Diego Teijeiro
    Lopez, Margarita Amor
    Bujan, Sandra
    Richter, Rico
    Doellner, Juergen
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (18): : 25974 - 26001
  • [5] Large-scale parallel collaborative filtering for the Netflix Prize
    Zhou, Yunhong
    Wilkinson, Dennis
    Schreiber, Robert
    Pan, Rong
    ALGORITHMIC ASPECTS IN INFORMATION AND MANAGEMENT, PROCEEDINGS, 2008, 5034 : 337 - 348
  • [6] Towards Matching User Mobility Traces in Large-Scale Datasets
    Kondor, Daniel
    Hashemian, Behrooz
    de Montjoye, Yves-Alexandre
    Ratti, Carlo
    IEEE TRANSACTIONS ON BIG DATA, 2020, 6 (04) : 714 - 726
  • [7] Additive Regression Applied to a Large-Scale Collaborative Filtering Problem
    Frank, Eibe
    Hall, Mark
    AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 435 - +
  • [8] Neural Binary Representation Learning for Large-Scale Collaborative Filtering
    Zhang, Yujia
    Wu, Jun
    Wang, Haishuai
    IEEE ACCESS, 2019, 7 : 60752 - 60763
  • [9] Fast Nonparametric Matrix Factorization for Large-scale Collaborative Filtering
    Yu, Kai
    Zhu, Shenghuo
    Lafferty, John
    Gong, Yihong
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 211 - 218
  • [10] User-based Collaborative Filtering: Sparsity and Performance
    Redpath, Jennifer
    Glass, David H.
    McClean, Sally
    Chen, Luke
    STAIRS 2010: PROCEEDINGS OF THE FIFTH STARTING AI RESEARCHERS' SYMPOSIUM, 2011, 222 : 264 - 276