Model-Free Preference Elicitation

被引:0
|
作者
Martinet, Carlos [2 ,6 ]
Boutilieri, Craig [1 ]
Meshil, Ofer [1 ]
Sandholm, Tuomas [2 ,3 ,4 ,5 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Strategy Robot Inc, Pittsburgh, PA USA
[4] Optimized Markets Inc, Pittsburgh, PA USA
[5] Strateg Machine Inc, Pittsburgh, PA USA
[6] Google, Mountain View, CA 94043 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In recommender systems, preference elicitation (PE) is an effective way to learn about a user's preferences to improve recommendation quality. Expected value of information (EVOI), a Bayesian technique that computes expected gain in user utility, has proven to be effective in selecting useful PE queries. Most EVOI methods use probabilistic models of user preferences and query responses to compute posterior utilities. By contrast, we develop model-free variants of EVOI that rely on function approximation to obviate the need for specific modeling assumptions. Specifically, we learn user response and utility models from existing data (often available in real-world recommender systems), which are used to estimate EVOI rather than relying on explicit probabilistic inference. We augment our approach by using online planning, specifically, Monte Carlo tree search, to further enhance our elicitation policies. We show that our approach offers significant improvement in recommendation quality over standard baselines on several PE tasks.
引用
收藏
页码:3493 / 3503
页数:11
相关论文
共 50 条
  • [1] Model-Free Preference-Based Reinforcement Learning
    Wirth, Christian
    Fuernkranz, Johannes
    Neumann, Gerhard
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2222 - 2228
  • [2] Model-Free or Not?
    Zumpfe, Kai
    Smith, Albert A.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [3] Model-Free Model Reconciliation
    Sreedharan, Sarath
    Hernandez, Alberto Olmo
    Mishra, Aditya Prasad
    Kambhampati, Subbarao
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 587 - 594
  • [4] Model-free CPPI
    Schied, Alexander
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2014, 40 : 84 - 94
  • [5] Model-free sampling
    Beer, Michael
    STRUCTURAL SAFETY, 2007, 29 (01) : 49 - 65
  • [6] Model-free control
    Fliess, Michel
    Join, Cedric
    INTERNATIONAL JOURNAL OF CONTROL, 2013, 86 (12) : 2228 - 2252
  • [7] SUPPORTING PREFERENCE ELICITATION - THE FAW PREFERENCE ELICITATION TOOL
    KAMPKE, T
    RADERMACHER, FJ
    WOLF, P
    DECISION SUPPORT SYSTEMS, 1993, 9 (04) : 381 - 391
  • [8] Model-free metacognition
    Carruthers, Peter
    Williams, David M.
    COGNITION, 2022, 225
  • [9] Set theory formulation of the model-free problem and the diffusion seeded model-free paradigm
    d'Auvergne, Edward J.
    Gooley, Paul R.
    MOLECULAR BIOSYSTEMS, 2007, 3 (07) : 483 - 494
  • [10] Cooperative Adaptive Model-Free Control With Model-Free Estimation and Online Gain Tuning
    Safaei, Ali
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 8642 - 8654