A survey of preference-based reinforcement learning methods

被引:0
|
作者
机构
[1] Wirth, Christian
[2] Akrour, Riad
[3] Neumann, Gerhard
[4] Fürnkranz, Johannes
来源
| 1600年 / Microtome Publishing卷 / 18期
关键词
105;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] A Survey of Preference-Based Reinforcement Learning Methods
    Wirth, Christian
    Akrour, Riad
    Neumann, Gerhard
    Fuernkranz, Johannes
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [2] Learning state importance for preference-based reinforcement learning
    Zhang, Guoxi
    Kashima, Hisashi
    MACHINE LEARNING, 2023, 113 (4) : 1885 - 1901
  • [3] Learning state importance for preference-based reinforcement learning
    Guoxi Zhang
    Hisashi Kashima
    Machine Learning, 2024, 113 : 1885 - 1901
  • [4] Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning
    Cheng, Weiwei
    Fuernkranz, Johannes
    Huellermeier, Eyke
    Park, Sang-Hyeun
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 312 - 327
  • [5] Model-Free Preference-Based Reinforcement Learning
    Wirth, Christian
    Fuernkranz, Johannes
    Neumann, Gerhard
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2222 - 2228
  • [6] Dueling Posterior Sampling for Preference-Based Reinforcement Learning
    Novoseller, Ellen R.
    Wei, Yibing
    Sui, Yanan
    Yue, Yisong
    Burdick, Joel W.
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 1029 - 1038
  • [7] Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
    Róbert Busa-Fekete
    Balázs Szörényi
    Paul Weng
    Weiwei Cheng
    Eyke Hüllermeier
    Machine Learning, 2014, 97 : 327 - 351
  • [8] Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
    Busa-Fekete, Robert
    Szoerenyi, Balazs
    Weng, Paul
    Cheng, Weiwei
    Huellermeier, Eyke
    MACHINE LEARNING, 2014, 97 (03) : 327 - 351
  • [9] Preference-based online learning with dueling bandits: A survey
    Bengs, Viktor
    Busa-Fekete, Robert
    Mesaoudi-Paul, Adil El
    Hullermeier, Eyke
    Journal of Machine Learning Research, 2021, 22
  • [10] Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
    Ren, Zhizhou
    Liu, Anji
    Liang, Yitao
    Peng, Jian
    Ma, Jianzhu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,