KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

被引:42
|
作者
Gao, Chongming [1 ]
Li, Shijun [1 ]
Lei, Wenqiang [2 ]
Chen, Jiawei [3 ]
Li, Biao [4 ]
Jiang, Peng [4 ]
He, Xiangnan [1 ]
Mao, Jiaxin [5 ]
Chua, Tat-Seng [6 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Sichuan Univ, Chengdu, Sichuan, Peoples R China
[3] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[4] Kuaishou Technol Co Ltd, Beijing, Peoples R China
[5] Renmin Univ China, Beijing, Peoples R China
[6] Natl Univ Singapore, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Fully-observed data; Recommendation; Evaluation; User simulation;
D O I
10.1145/3511808.3557220
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The progress of recommender systems is hampered mainly by evaluation as it requires real-time interactions between humans and systems, which is too laborious and expensive. This issue is usually approached by utilizing the interaction history to conduct offline evaluation. However, existing datasets of user-item interactions are partially observed, leaving it unclear how and to what extent the missing interactions will influence the evaluation. To answer this question, we collect a fully-observed dataset from Kuaishou's online environment, where almost all 1, 411 users have been exposed to all 3, 327 items. To the best of our knowledge, this is the first real-world fully-observed data with millions of user-item interactions. With this unique dataset, we conduct a preliminary analysis of how the two factors - data density and exposure bias - affect the evaluation results of multi-round conversational recommendation. Our main discoveries are that the performance ranking of different methods varies with the two factors, and this effect can only be alleviated in certain cases by estimating missing interactions for user simulation. This demonstrates the necessity of the fully-observed dataset. We release the dataset and the pipeline implementation for evaluation at https://kuairec.com.
引用
收藏
页码:540 / 550
页数:11
相关论文
共 50 条
  • [21] Evaluating Decision-Aware Recommender Systems
    Mesas, Rus M.
    Bellogin, Alejandro
    PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 74 - 78
  • [22] Evaluating Recommender Systems in Feature Model Configuration
    Uta, Mathias
    Felfernig, Alexander
    Le, Viet-Man
    Popescu, Andrei
    Tran, Thi Ngoc Trang
    Helic, Denis
    SPLC '21: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE, VOL A, 2021,
  • [23] On Challenges of Evaluating Recommender Systems in an Offline Setting
    Sun, Aixin
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1284 - 1285
  • [24] Evaluating the Pros and Cons of Recommender Systems Explanations
    Wardatzky, Kathrin
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 1302 - 1307
  • [26] Evaluating the intrusion cost of recommending in recommender systems
    Hernandez-Del-Olmo, F
    Gaudioso, E
    Boticario, JG
    USER MODELING 2005, PROCEEDINGS, 2005, 3538 : 342 - 346
  • [27] Introducing CSP Dataset: A Dataset Optimized for the Study of the Cold Start Problem in Recommender Systems
    Herce-Zelaya, Julio
    Porcel, Carlos
    Tejeda-Lorente, Alvaro
    Bernabe-Moreno, Juan
    Herrera-Viedma, Enrique
    INFORMATION, 2023, 14 (01)
  • [28] A Large Multilingual and Multi-domain Dataset for Recommender Systems
    Di Tommaso, Giorgia
    Faralli, Stefano
    Velardi, Paola
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2806 - 2813
  • [29] New Insights Towards Developing Recommender Systems
    Taghavi, Mona
    Bentahar, Jamal
    Bakhtiyari, Kaveh
    Hanachi, Chihab
    COMPUTER JOURNAL, 2018, 61 (03): : 319 - 348
  • [30] Evaluating Teachers' Perceptions of Learning Design Recommender Systems
    Karga, Soultana
    Satratzemi, Maya
    TRANSFORMING LEARNING WITH MEANINGFUL TECHNOLOGIES, EC-TEL 2019, 2019, 11722 : 98 - 111