Towards reproducibility in recommender-systems research

被引:40
|
作者
Beel, Joeran [1 ,5 ]
Breitinger, Corinna [1 ,2 ]
Langer, Stefan [1 ,3 ]
Lommatzsch, Andreas [4 ]
Gipp, Bela [1 ,5 ]
机构
[1] Docear, Constance, Germany
[2] Linnaeus Univ, Sch Comp Sci Phys & Math, S-35195 Vaxjo, Sweden
[3] Otto Von Guericke Univ, Dept Comp Sci, D-39106 Magdeburg, Germany
[4] Tech Univ Berlin, DAI Lab, Ernst Reuter Pl 7, D-10587 Berlin, Germany
[5] Univ Konstanz, Dept Informat Sci, Universitatsstr 10, D-78464 Constance, Germany
关键词
Recommender systems; Evaluation; Experimentation; Reproducibility;
D O I
10.1007/s11257-016-9174-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous recommendation approaches are in use today. However, comparing their effectiveness is a challenging task because evaluation results are rarely reproducible. In this article, we examine the challenge of reproducibility in recommender-system research. We conduct experiments using Plista's news recommender system, and Docear's research-paper recommender system. The experiments show that there are large discrepancies in the effectiveness of identical recommendation approaches in only slightly different scenarios, as well as large discrepancies for slightly different approaches in identical scenarios. For example, in one news-recommendation scenario, the performance of a content-based filtering approach was twice as high as the second-best approach, while in another scenario the same content-based filtering approach was the worst performing approach. We found several determinants that may contribute to the large discrepancies observed in recommendation effectiveness. Determinants we examined include user characteristics (gender and age), datasets, weighting schemes, the time at which recommendations were shown, and user-model size. Some of the determinants have interdependencies. For instance, the optimal size of an algorithms' user model depended on users' age. Since minor variations in approaches and scenarios can lead to significant changes in a recommendation approach's performance, ensuring reproducibility of experimental results is difficult. We discuss these findings and conclude that to ensure reproducibility, the recommender-system community needs to (1) survey other research fields and learn from them, (2) find a common understanding of reproducibility, (3) identify and understand the determinants that affect reproducibility, (4) conduct more comprehensive experiments, (5) modernize publication practices, (6) foster the development and use of recommendation frameworks, and (7) establish best-practice guidelines for recommender-systems research.
引用
收藏
页码:69 / 101
页数:33
相关论文
共 50 条
  • [1] Towards reproducibility in recommender-systems research
    Joeran Beel
    Corinna Breitinger
    Stefan Langer
    Andreas Lommatzsch
    Bela Gipp
    User Modeling and User-Adapted Interaction, 2016, 26 : 69 - 101
  • [2] Improving accountability in recommender systems research through reproducibility
    Alejandro Bellogín
    Alan Said
    User Modeling and User-Adapted Interaction, 2021, 31 : 941 - 977
  • [3] A Troubling Analysis of Reproducibility and Progress in Recommender Systems Research
    Dacrema, Maurizio Ferrari
    Boglio, Simone
    Cremonesi, Paolo
    Jannach, Dietmar
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2021, 39 (02)
  • [4] Improving accountability in recommender systems research through reproducibility
    Bellogin, Alejandro
    Said, Alan
    USER MODELING AND USER-ADAPTED INTERACTION, 2021, 31 (05) : 941 - 977
  • [5] Reproducibility of Experiments in Recommender Systems Evaluation
    Polatidis, Nikolaos
    Kapetanakis, Stelios
    Pimenidis, Elias
    Kosmidis, Konstantinos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 401 - 409
  • [6] Enabling Reproducibility in Group Recommender Systems
    Dario Silveira, Joaquin
    Salamo, Maria
    Boratto, Ludovico
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 115 - 124
  • [7] Towards Cognitive Recommender Systems
    Beheshti, Amin
    Yakhchi, Shahpar
    Mousaeirad, Salman
    Ghafari, Seyed Mohssen
    Goluguri, Srinivasa Reddy
    Edrisi, Mohammad Amin
    ALGORITHMS, 2020, 13 (08)
  • [8] Towards Conversational Recommender Systems
    Christakopoulou, Konstantina
    Radlinski, Filip
    Hofmann, Katja
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 815 - 824
  • [9] Towards Persuasive Recommender Systems
    Alslaity, Alaa
    Tran, Thomas
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), 2019, : 143 - 148
  • [10] Towards A Benchmark for OSS Recommender Systems
    Sbai, Nesrine
    Ben Sassi, Sihem
    Ben Ghezala, Henda Hajjami
    VISION 2020: SUSTAINABLE ECONOMIC DEVELOPMENT, INNOVATION MANAGEMENT, AND GLOBAL GROWTH, VOLS I-IX, 2017, 2017, : 5093 - 5104