KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

被引:42
|
作者
Gao, Chongming [1 ]
Li, Shijun [1 ]
Lei, Wenqiang [2 ]
Chen, Jiawei [3 ]
Li, Biao [4 ]
Jiang, Peng [4 ]
He, Xiangnan [1 ]
Mao, Jiaxin [5 ]
Chua, Tat-Seng [6 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Sichuan Univ, Chengdu, Sichuan, Peoples R China
[3] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[4] Kuaishou Technol Co Ltd, Beijing, Peoples R China
[5] Renmin Univ China, Beijing, Peoples R China
[6] Natl Univ Singapore, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Fully-observed data; Recommendation; Evaluation; User simulation;
D O I
10.1145/3511808.3557220
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The progress of recommender systems is hampered mainly by evaluation as it requires real-time interactions between humans and systems, which is too laborious and expensive. This issue is usually approached by utilizing the interaction history to conduct offline evaluation. However, existing datasets of user-item interactions are partially observed, leaving it unclear how and to what extent the missing interactions will influence the evaluation. To answer this question, we collect a fully-observed dataset from Kuaishou's online environment, where almost all 1, 411 users have been exposed to all 3, 327 items. To the best of our knowledge, this is the first real-world fully-observed data with millions of user-item interactions. With this unique dataset, we conduct a preliminary analysis of how the two factors - data density and exposure bias - affect the evaluation results of multi-round conversational recommendation. Our main discoveries are that the performance ranking of different methods varies with the two factors, and this effect can only be alleviated in certain cases by estimating missing interactions for user simulation. This demonstrates the necessity of the fully-observed dataset. We release the dataset and the pipeline implementation for evaluation at https://kuairec.com.
引用
收藏
页码:540 / 550
页数:11
相关论文
共 50 条
  • [41] X-Wines: A Wine Dataset for Recommender Systems and Machine Learning
    de Azambuja, Rogerio Xavier
    Morais, A. Jorge
    Filipe, Vitor
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
  • [42] Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems
    Yuan, Guanghu
    Yuan, Fajie
    Li, Yudong
    Kong, Beibei
    Li, Shujie
    Chen, Lei
    Yang, Min
    Yu, Chenyun
    Hu, Bo
    Li, Zang
    Xu, Yu
    Qie, Xiaohu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [43] Effects of Binary Similarity Metrics in Recommender Systems for Jester Jokes Dataset
    Senyurek, Edip
    Kevric, Jasmin
    NEW TECHNOLOGIES, DEVELOPMENT AND APPLICATION VII, VOL 2, NT-2024, 2024, 1070 : 404 - 412
  • [44] A flexible framework for evaluating user and item fairness in recommender systems
    Yashar Deldjoo
    Vito Walter Anelli
    Hamed Zamani
    Alejandro Bellogín
    Tommaso Di Noia
    User Modeling and User-Adapted Interaction, 2021, 31 : 457 - 511
  • [45] Evaluating facial recognition services as interaction technique for recommender systems
    Toon De Pessemier
    Ine Coppens
    Luc Martens
    Multimedia Tools and Applications, 2020, 79 : 23547 - 23570
  • [46] Evaluating recommender systems for AI-driven biomedical informatics
    Cava, William La
    Williams, Heather
    Fu, Weixuan
    Vitale, Steve
    Srivatsan, Durga
    Moore, Jason H.
    BIOINFORMATICS, 2021, 37 (02) : 250 - 256
  • [47] Evaluating Recommender Systems for Technology Enhanced Learning: A Quantitative Survey
    Erdt, Mojisola
    Fernandez, Alejandro
    Rensing, Christoph
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2015, 8 (04): : 326 - 344
  • [48] A flexible framework for evaluating user and item fairness in recommender systems
    Deldjoo, Yashar
    Anelli, Vito Walter
    Zamani, Hamed
    Bellogin, Alejandro
    Di Noia, Tommaso
    USER MODELING AND USER-ADAPTED INTERACTION, 2021, 31 (03) : 457 - 511
  • [49] Using and Evaluating Quantum Computing for Information Retrieval and Recommender Systems
    Dacrema, Maurizio Ferrari
    Pasin, Andrea
    Cremonesi, Paolo
    Ferro, Nicola
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 3017 - 3020
  • [50] SynEvaRec: A Framework for Evaluating Recommender Systems on Synthetic Data Classes
    Provalov, Vladimir
    Stavinova, Elizaveta
    Chunaev, Petr
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 55 - 64