The Halo Effect in Multicomponent Ratings and Its Implications for Recommender Systems: The Case of Yahoo! Movies

被引:63
|
作者
Sahoo, Nachiketa [1 ]
Krishnan, Ramayya [2 ]
Duncan, George [2 ]
Callan, Jamie
机构
[1] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Heinz Coll, Pittsburgh, PA 15213 USA
关键词
collaborative filtering; multicomponent rating; halo effect; Bayesian network; mixture model; expectation maximization; recommender system; SELF-RATINGS; ERROR; SUPERIOR;
D O I
10.1287/isre.1100.0336
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Collaborative filtering algorithms learn from the ratings of a group of users on a set of items to find personalized recommendations for each user. Traditionally they have been designed to work with one-dimensional ratings. With interest growing in recommendations based on multiple aspects of items, we present an algorithm for using multicomponent rating data. The presented mixture model-based algorithm uses the component rating dependency structure discovered by a structure learning algorithm. The structure is supported by the psychometric literature on the halo effect. This algorithm is compared with a set of model-based and instance-based algorithms for single-component ratings and their variations for multicomponent ratings. We evaluate the algorithms using data from Yahoo! Movies. Use of multiple components leads to significant improvements in recommendations. However, we find that the choice of algorithm depends on the sparsity of the training data. It also depends on whether the task of the algorithm is to accurately predict ratings or to retrieve relevant items. In our experiments a model-based multicomponent rating algorithm is able to better retrieve items when training data are sparse. However, if the training data are not sparse, or if we are trying to predict the rating values accurately, then the instance-based multicomponent rating collaborative filtering algorithms perform better. Beyond generating recommendations we show that the proposed model can fill in missing rating components. Theories in psychometric literature and the empirical evidence suggest that rating specific aspects of a subject is difficult. Hence, filling in the missing component values leads to the possibility of a rater support system to facilitate gathering of multicomponent ratings.
引用
收藏
页码:231 / 246
页数:16
相关论文
共 24 条
  • [22] Examining the distance-decay effect on obsidian lithic technological organization and its implications for raw material transportation: A case study from the Upper Paleolithic of Northeast Asia
    Hou, Zhe
    Xu, Ting
    Obie, Michael
    Guo, Beiheng
    Zhao, Yuchao
    Gao, Xing
    Seong, Chuntaek
    JOURNAL OF ARCHAEOLOGICAL SCIENCE-REPORTS, 2024, 57
  • [23] The effect of inflation on the morphology-derived rheological parameters of lava flows and its implications for interpreting remote sensing data - A case study on the 2014/2015 eruption at Holuhraun, Iceland
    Kolzenburg, S.
    Jaenicke, J.
    Muenzer, U.
    Dingwell, D. B.
    JOURNAL OF VOLCANOLOGY AND GEOTHERMAL RESEARCH, 2018, 357 : 200 - 212
  • [24] The Empirical Testing for the Effect of Organizational Commitment and Leadership Style on the Implementation Success of Enterprise Resource Planning (ERP) Systems and its Implications on the Quality of Accounting Information (Survey on State-Owned Enterprises in Bandung, West Java']Java, Indonesia)
    Mulyani, Sri
    Endraria
    Putra, Donny Maha
    Sulcmadilaga, Citra
    Rozak, Yuhanis Ladewi
    SUSTAINABLE ECONOMIC GROWTH, EDUCATION EXCELLENCE, AND INNOVATION MANAGEMENT THROUGH VISION 2020, VOLS I-VII, 2017, : 807 - 822