Scalable Neural Contextual Bandit for Recommender Systems

被引:0
|
作者
Zhu, Zheqing [1 ]
Van Roy, Benjamin [2 ]
机构
[1] Stanford Unvers, Meta AI, Menlo Pk, CA 94025 USA
[2] Stanford Univ, Stanford, CA USA
关键词
Recommender Systems; Contextual Bandits; Reinforcement Learning; Exploration vs Exploitation; Decision Making under Uncertainty; MODEL;
D O I
10.1145/3583780.3615048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been some progress with neural contextual bandit algorithms towards enabling online exploration through neural networks, their onerous computational demands hinder widespread adoption in real-world recommender systems. In this work, we propose a scalable sample-efficient neural contextual bandit algorithm for recommender systems. To do this, we design an epistemic neural network architecture, Epistemic Neural Recommendation (ENR), that enables Thompson sampling at a large scale. In two distinct large-scale experiments with real-world tasks, ENR significantly boosts click-through rates and user ratings by at least 9% and 6% respectively compared to state-of-the-art neural contextual bandit algorithms. Furthermore, it achieves equivalent performance with at least 29% fewer user interactions compared to the best-performing baseline algorithm. Remarkably, while accomplishing these improvements, ENR demands orders of magnitude fewer computational resources than neural contextual bandit baseline algorithms.
引用
收藏
页码:3636 / 3646
页数:11
相关论文
共 50 条
  • [21] Bandit Algorithms for e-Commerce Recommender Systems Extended Abstract
    Broden, Bjorn
    Hammar, Mikael
    Nilsson, Bengt J.
    Paraschakis, Dimitris
    PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 349 - 349
  • [22] Quantum contextual bandits and recommender systems for quantum data
    Brahmachari, Shrigyan
    Lumbreras, Josep
    Tomamichel, Marco
    QUANTUM MACHINE INTELLIGENCE, 2024, 6 (02)
  • [23] Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
    Wang, Hao
    Ma, Yifei
    Ding, Hao
    Wang, Yuyang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8539 - 8547
  • [24] Selective contextual information acquisition in travel recommender systems
    Braunhofer M.
    Ricci F.
    Information Technology & Tourism, 2017, 17 (1) : 5 - 29
  • [25] Exploiting Contextual Information for Recommender Systems Oriented to Tourism
    Sanchez, Pablo
    RECSYS 2019: 13TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2019, : 601 - 605
  • [26] Contextual Bandits for Multi-Objective Recommender Systems
    Lacerda, Anisio
    2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2015), 2015, : 68 - 73
  • [27] Improving Recommender Systems by Incorporating Social Contextual Information
    Ma, Hao
    Zhou, Tom Chao
    Lyu, Michael R.
    King, Irwin
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2011, 29 (02)
  • [28] Integrating contextual sentiment analysis in collaborative recommender systems
    Osman, Nurul Aida
    Noah, Shahrul Azman Mohd
    Darwich, Mohammad
    Mohd, Masnizah
    PLOS ONE, 2021, 16 (03):
  • [29] Encrypted Linear Contextual Bandit
    Garcelon, Evrard
    Perchet, Vianney
    Pirotta, Matteo
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [30] Scalable Neural Data Server: A Data Recommender for Transfer Learning
    Cao, Tianshi
    Doubov, Sasha
    Acuna, David
    Fidler, Sanja
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34