KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

被引:42
|
作者
Gao, Chongming [1 ]
Li, Shijun [1 ]
Lei, Wenqiang [2 ]
Chen, Jiawei [3 ]
Li, Biao [4 ]
Jiang, Peng [4 ]
He, Xiangnan [1 ]
Mao, Jiaxin [5 ]
Chua, Tat-Seng [6 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Sichuan Univ, Chengdu, Sichuan, Peoples R China
[3] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[4] Kuaishou Technol Co Ltd, Beijing, Peoples R China
[5] Renmin Univ China, Beijing, Peoples R China
[6] Natl Univ Singapore, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Fully-observed data; Recommendation; Evaluation; User simulation;
D O I
10.1145/3511808.3557220
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The progress of recommender systems is hampered mainly by evaluation as it requires real-time interactions between humans and systems, which is too laborious and expensive. This issue is usually approached by utilizing the interaction history to conduct offline evaluation. However, existing datasets of user-item interactions are partially observed, leaving it unclear how and to what extent the missing interactions will influence the evaluation. To answer this question, we collect a fully-observed dataset from Kuaishou's online environment, where almost all 1, 411 users have been exposed to all 3, 327 items. To the best of our knowledge, this is the first real-world fully-observed data with millions of user-item interactions. With this unique dataset, we conduct a preliminary analysis of how the two factors - data density and exposure bias - affect the evaluation results of multi-round conversational recommendation. Our main discoveries are that the performance ranking of different methods varies with the two factors, and this effect can only be alleviated in certain cases by estimating missing interactions for user simulation. This demonstrates the necessity of the fully-observed dataset. We release the dataset and the pipeline implementation for evaluation at https://kuairec.com.
引用
收藏
页码:540 / 550
页数:11
相关论文
共 50 条
  • [1] How Will I Argue? A Dataset for Evaluating Recommender Systems for Argumentations
    Brenneis, Markus
    Behrendt, Maike
    Harmeling, Stefan
    SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 360 - 367
  • [2] Global analytic solution of fully-observed variational bayesian matrix factorization
    Nakajima, Shinichi
    Sugiyama, Masashi
    Babacan, S. Derin
    Tomioka, Ryota
    Journal of Machine Learning Research, 2013, 14 (01) : 1 - 37
  • [3] Global Analytic Solution of Fully-observed Variational Bayesian Matrix Factorization
    Nakajima, Shinichi
    Sugiyama, Masashi
    Babacan, S. Derin
    Tomioka, Ryota
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 1 - 37
  • [4] Evaluating Recommender Systems
    Zaier, Zied
    Godin, Robert
    Faucher, Luc
    FOURTH INTERNATIONAL CONFERENCE ON AUTOMATED SOLUTIONS FOR CROSS MEDIA CONTENT AND MULTI-CHANNEL DISTRIBUTION, PROCEEDINGS, 2008, : 211 - 217
  • [5] An Educational News Dataset for Recommender Systems
    Xing, Yujie
    Mohallick, Itishree
    Gulla, Jon Atle
    Ozgobek, Ozlem
    Zhang, Lemei
    ECML PKDD 2020 WORKSHOPS, 2020, 1323 : 562 - 570
  • [6] Evaluating content novelty in recommender systems
    Mendoza, Marcelo
    Torres, Nicolas
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2020, 54 (02) : 297 - 316
  • [7] Evaluating Hybrid Music Recommender Systems
    Hornung, Thomas
    Ziegler, Cai-Nicolas
    Franz, Simon
    Przyjaciel-Zablocki, Martin
    Schaetzle, Alexander
    Lausen, Georg
    2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2013, : 57 - 64
  • [8] Exploratory Methods for Evaluating Recommender Systems
    De Pauw, Joey
    RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2020, : 782 - 786
  • [9] Evaluating collaborative filtering recommender systems
    Herlocker, JL
    Konstan, JA
    Terveen, K
    Riedl, JT
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 5 - 53
  • [10] Evaluating Recommender Systems: Survey and Framework
    Zangerle, Eva
    Bauer, Christine
    ACM COMPUTING SURVEYS, 2023, 55 (08)